Abstract
This study investigates radiomic efficacy in post-surgical traumatic spinal cord injury (SCI), overcoming MRI limitations from metal artifacts to enhance diagnosis, severity assessment, and lesion characterization or prognosis and therapy guidance. Traumatic spinal cord injury (SCI) causes severe neurological deficits. While MRI allows qualitative injury evaluation, standard imaging alone has limitations for precise SCI diagnosis, severity stratification, and pathology characterization, which are needed to guide prognosis and therapy. Radiomics enables quantitative tissue phenotyping by extracting a high-dimensional set of descriptive texture features from medical images. However, the efficacy of postoperative radiomic quantification in the presence of metal-induced MRI artifacts from spinal instrumentation has yet to be fully explored. A total of 50 healthy controls and 12 SCI patients post-stabilization surgery underwent 3D multi-spectral MRI. Automated spinal cord segmentation was followed by radiomic feature extraction. Supervised machine learning categorized SCI versus controls, injury severity, and lesion location relative to instrumentation. Radiomics differentiated SCI patients (Matthews correlation coefficient (MCC) 0.97; accuracy 1.0), categorized injury severity (MCC: 0.95; ACC: 0.98), and localized lesions (MCC: 0.85; ACC: 0.90). Combined T1 and T2 features outperformed individual modalities across tasks with gradient boosting models showing the highest efficacy. The radiomic framework achieved excellent performance, differentiating SCI from controls and accurately categorizing injury severity. The ability to reliably quantify SCI severity and localization could potentially inform diagnosis, prognosis, and guide therapy. Further research is warranted to validate radiomic SCI biomarkers and explore clinical integration.
Keywords: radiomics, spinal cord injury, multi-spectral imaging, magnetic resonance imaging, metal artifact
1. Introduction
Traumatic spinal cord injury (SCI) is a devastating condition affecting millions of individuals worldwide. SCI profoundly impacts physical, psychological, and socioeconomic well-being [1]. In the United States alone, approximately 17,800 new SCI cases occur annually [2]. SCI can damage axons, neurons, glia, and blood vessels, resulting in temporary or permanent sensory and motor deficits below the lesion level [3]. Most SCIs occur at cervical levels, with common causes being motor vehicle collisions, falls, violence, and sports activities [2].
Acute SCI can present with varying clinical manifestations depending on injury level and severity. These include tetraplegia or paraplegia, sensory deficits, autonomic dysfunction affecting cardiovascular, respiratory, and bowel/bladder function, and neuropathic pain. The severity and extent of these manifestations significantly influence patient outcomes and rehabilitation strategies [4].
Magnetic resonance imaging (MRI) is the preferred modality for visualizing the spinal cord and soft tissues [5]. Conventional MRI protocols enable the detection of cord compression, signal changes, edema, hemorrhage, and morphologic alterations after injury. However, qualitative image evaluation has limitations in providing microstructural and functional details needed to guide SCI prognosis and management [6].
Recent advances in quantitative MRI techniques have shown promise in characterizing SCI pathology. Diffusion tensor imaging has demonstrated utility in assessing white matter integrity and predicting functional outcomes [7,8]. Magnetization transfer imaging provides insights into myelin content [7,9], and functional MRI has revealed changes in brain activity following spinal cord injury [10]. While these methods can extract precise biomarkers of post-traumatic cord integrity and function, they each probe specific physiological phenomena in isolation.
Machine learning methods are increasingly being applied to spinal cord imaging analysis. Deep learning approaches have improved segmentation accuracy [11] and pathologic finding detection [12]. Radiomics offers a more holistic approach by extracting multiple descriptive features from medical images through high-throughput data characterization [13]. These methods have shown promise for prognosis in oncology [14]. Recently, radiomic techniques have been explored in spinal cord studies, with Okimatsu et al. developing a model using T2*-weighted MRI to predict neurological outcomes after acute cervical SCI [15]. However, a key challenge is that SCI frequently requires surgical stabilization involving metallic instrumentation. This hardware produces artifacts on conventional postoperative MRI [16] that disrupt quantitative radiomic analyses.
This study aimed to implement a radiomic modeling approach to analyze MRI of the instrumented spinal cord in SCI subjects. Multi-spectral imaging sequences were leveraged to suppress metal artifacts and enable unobstructed radiomic feature extraction at instrumented levels [17]. We hypothesized that radiomic signatures could reliably categorize SCI severity and lesion location. Successfully quantifying MRI traits in instrumented cords could ultimately enable monitoring of traumatic changes to inform SCI diagnosis and therapeutic regimens.
2. Materials and Methods
Reporting and analysis in this study followed the CheckList for EvaluAtion of Radiomics research (CLEAR) documentation standard focusing on repeatability, reproducibility, and transparency of radiomic studies [18].
2.1. Study Cohorts
This study involved 12 subjects with traumatic SCI who underwent MRI scans at 1-14 months (mean 7.08 ± 4.03) following surgical stabilization at cervical levels using metallic instrumentation. SCI severity was graded using the American Spinal Injury Association (ASIA) Impairment Scale (AIS). The study also included 50 healthy controls with no SCI or cord disorder history. Informed consent was obtained from all participants per our Institutional Review Board protocol. Table 1 summarizes the cohort demographics.
Table 1.
Cohorts | Gender | Count | Age | BMI | ASIA: A | ASIA: B | ASIA: C | ASIA: D |
---|---|---|---|---|---|---|---|---|
Healthy | Female | 25 | 47.52 ± 15.23 | 27.22 ± 7.18 | - | - | - | - |
Male | 25 | 48.50 ± 16.92 | 27.84 ± 4.67 | - | - | - | - | |
Total | 50 | 48.02 ± 15.96 | 27.54 ± 5.98 | - | - | - | - | |
SCI | Female | 6 | 59.50 ± 18.62 | 25.57 ± 5.90 | 0 | 1 | 2 | 3 |
Male | 6 | 48.50 ± 21.95 | 23.52 ± 2.68 | 2 | 1 | 1 | 2 | |
Total | 12 | 54.00 ± 20.24 | 24.54 ± 4.50 | 2 | 2 | 3 | 5 |
Imaging was performed at 3T (GE Signa Premier) using a 21-channel neurovascular coil. Multi-spectral 3D fast spin echo MRI was acquired to suppress metal artifacts [17]. Isotropic 1.2 mm resolution T1- and T2-weighted volumes were obtained with 8 spectral bins. Imaging parameters were as follows: TR\TE for T1—750/8 ms; for T2—2100/60 ms; and ARC 2 × 2 acceleration.
2.2. Image Analysis
The spinal cord was automatically segmented using a multi-step pipeline (Figure 1) as described in [19]. First, N4 bias field correction was applied to remove image shading artifacts. The Spinal Cord Toolbox (SCT) deep learning model [20] then performed initial segmentation independently on both T1 and T2 volumes. However, these initial segmentations exhibited intermittent failures near metallic instrumentation where image artifacts were present. To address these failures, the T1 data were registered to the T2 space using SCT’s registration module. The segmentations were then integrated using a radial basis function (RBF) algorithm, which creates a smooth interpolation between the T1 and T2 segmentations. The RBF approach was chosen for its robustness to local failures; when one modality’s segmentation fails due to metal artifacts, the algorithm appropriately weights the more reliable segmentation from the other modality in that region. This integrated segmentation leverages complementary information from both imaging sequences to produce a more robust final result. The improved segmentation was then used by SCT to automatically label the vertebral levels [19].
Radiomic feature extraction was performed within cord segmentation using PyRadiomics [21]. A total of 1374 features describing intensity, shape, and texture patterns were generated from original and filtered images, including wavelet, square, square root, logarithm, exponential, gradient, and local binary patterns.
2.3. Classification Framework
Three classification tasks were the following: (1) differentiating SCI cases from healthy controls, (2) SCI severity (severe AIS A-B vs. non-severe AIS C-D), and (3) lesion zone (above, at, or below instrumentation level). For each target, models were trained using T1, T2, or combined T1 + T2 radiomic features to compare performance. Evaluation metrics were accuracy, the Matthews correlation coefficient (MCC), the F1 score, and the area under the ROC (receiver operating characteristic) curve (AUC).
The radiomic feature sets were input into a supervised machine learning pipeline to differentiate SCI subjects from controls and categorize injury severity and cord location relative to the injury site. The dataset was divided into training (70%), validation (15%), and testing (15%) subsets. An automated modeling framework (H2O AutoML [21]) evaluated various classifiers (random forest, XGBoost, neural networks, etc.) using 5-fold cross-validation on the training data. To address class imbalance in the dataset, oversampling of minority classes was applied during the training process, ensuring more equitable representation and mitigating bias toward the majority class. This approach improves the generalization and reliability of the models, especially for minority class predictions.
Feature selection was automatically performed within the H2O AutoML framework, which identified GBM as the optimal model. The framework leveraged GBM’s inherent feature importance calculation, where features were weighted based on their contribution to decision tree splits and the associated reduction in squared error across the ensemble. Feature importance scores were normalized to a [0, 1] range, enabling systematic identification of the most influential radiomic markers while automatically suppressing less informative features during model optimization. This automated feature selection streamlined the modeling process while maintaining robust predictive performance [22,23].
3. Results
Figure 2 depicts sample T1- and T2-weighted MRI images and the segmented cord on an instrumented slice. As shown in Figure 3, a combined T1 and T2 feature set achieved strong performance in discriminating between healthy controls and SCI patients, with 0.97 MCC, 0.98 F1 score, 1.00 accuracy, and 1.00 AUC. For predicting injury severity, the T1 + T2 model again achieved robust performance with 0.95 MCC, 0.98 F1 score, 0.98 accuracy, and 0.99 AUC. The T2 model achieved 0.86 MCC and 0.94 accuracy. For lesion zone classification, the T1 + T2 model performed best with 0.85 MCC, 0.90 F1 score, 0.90 accuracy, and 0.98 AUC. The T2 model achieved 0.81 MCC and 0.88 accuracy.
Gradient boosting machine (GBM) models achieved the top performance for most tasks. The only exception was the zone classification task using T1 features, for which XGBoost was optimal.
Overall, the combined T1 + T2 models outperformed individual modalities across tasks. The models demonstrated excellent discrimination for SCI vs. controls and good predictive performance for injury severity. The results were strong but comparatively lower for the more challenging three-class zone classification task.
4. Discussion
This preliminary study demonstrates the potential of a radiomic modeling approach for instrumented spinal cord MRI analysis in traumatic SCI. A key advance was the use of multi-spectral imaging to suppress instrumentation artifacts that can distort quantitative feature extraction. The study’s reproducibility is strengthened by the implementation of standardized tools and protocols, including multi-spectral imaging optimized for metal artifact suppression [17], validated automated spinal cord segmentation [19], and standardized radiomic feature extraction following established guidelines [18]. Radiomic SCI characterization could offer advantages over both qualitative evaluation alone and standard diffusion/functional MRI methods that assess specific microstructural or physiological properties in isolation. The high throughput radiomic feature set provides a more comprehensive phenotypic profiling of overall cord tissue traits linked to injury.
The radiomic framework reliably differentiated severe and non-severe SCI categories, achieving robust classification performance. This ability to determine injury severity, which has significant implications for prognosis and therapy, demonstrates clinical utility.
While global accuracy metrics were relatively high across tasks, lower MCC and F1 scores imply that some degree of inter-class imbalance likely exists in the dataset. This imbalance means majority classes were more successfully predicted than minority classes. Techniques such as oversampling of the minority classes or cost-sensitive learning could address this and improve MCC and F1 metrics. Additionally, discrimination power was weaker for more nuanced tasks like severity level or subtle zone differences. These findings warrant focused efforts on feature engineering and model tuning targeting MCC and F1 improvements.
When assessing the advantages of combined T1 and T2 features versus prolonged scan times, the MCC is particularly informative in the presence of class imbalance. For cohort differentiation, the MCC increase from 0.92 to 0.97 with combined features is substantial. However, the 0.92 baseline already indicates robust predictive power. In efficiency-focused clinical settings, marginal T1 + T2 benefits may not outweigh longer scans, especially for resource optimization.
For severity classification, the MCC rose slightly from 0.86 to 0.95 with combined features. Although showing an increase, the 0.86 T2 baseline is respectable. The slight absolute MCC increase may have limited utility depending on clinical use. T2 could suffice when efficiency is critical and acceptable severity discrimination is achievable. However, for applications where severity subtleties carry high stakes, the T1 + T2 approach may provide value despite a longer scan time.
For multiclass zone classification, the more substantial MCC boost from 0.81 to 0.85 with T1 + T2 features could justify extra scan time. While context-dependent, this degree of performance lift may warrant dual-acquisition protocols.
The ability to quantitatively characterize post-surgical cord changes through radiomics could provide valuable prognostic information. By reliably categorizing injury severity and location, this approach could help predict functional outcomes, guide rehabilitation strategies, and monitor treatment response. Future longitudinal studies could establish relationships between radiomic features and functional recovery patterns, potentially enabling more personalized treatment planning.
A limitation of this study is the small cohort size, which may impact the generalizability of the results. Future studies with larger and more diverse patient cohorts are warranted to validate the findings and further enhance model robustness and accuracy across different populations. Moreover, although the multi-spectral imaging technique used in this study effectively reduces metallic artifacts, it is important to acknowledge its limitations, particularly in cases with complex or extensive metallic instrumentation. Residual artifacts, while minimized, can still interfere with image quality and may impact the accuracy of radiomic feature extraction. This is especially relevant for texture-based features, which are sensitive to subtle intensity and spatial variations. Such artifacts could introduce variability or bias in the radiomic data, potentially affecting the reliability of the extracted features and subsequent model predictions. While the results of this study demonstrate the potential of radiomic analysis in postoperative spinal cord injury assessment, the influence of residual artifacts warrants further investigation. Future work could explore advanced artifact reduction techniques or post-processing methods to enhance radiomic feature robustness, particularly in challenging cases. Additionally, incorporating quality assurance metrics to quantify artifact levels could help identify and mitigate their impact during the analysis pipeline.
In summary, T1 + T2 improved performance metrics across tasks. However, clinical value versus efficiency tradeoffs depend on the classification specifics and performance requirements. Further feature engineering or integrating other imaging modalities could refine model performance. More extensive longitudinal studies are essential to fully explore clinical utility. Overall, radiomic modeling shows promise for quantitative SCI MRI, potentially guiding diagnosis and management.
Author Contributions
Conceptualization, A.S. and K.M.K.; data curation, A.S.; formal analysis, A.S.; funding acquisition, K.M.K.; investigation, K.M.K.; methodology, A.S.; project administration, K.M.K.; resources, K.M.K.; software, A.S.; supervision, K.M.K.; validation, A.S. and K.M.K.; visualization, A.S.; writing—original draft, A.S.; writing—review and editing, A.P.K. and K.M.K. All authors have read and agreed to the published version of the manuscript.
Institutional Review Board Statement
This study was conducted in accordance with the Declaration of Helsinki and approved by the Institutional Review Board of Medical College of Wisconsin (protocol code PRO00034880, date of approval: 15 February 2024).
Informed Consent Statement
Informed consent was obtained from all subjects involved in the study.
Data Availability Statement
The data presented in this study are openly available in Mendeley Data at https://data.mendeley.com/datasets/sjpx7md7cf/1 (accessed on 25 October 2024).
Conflicts of Interest
The authors declare no conflicts of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of the data; in the writing of the manuscript; or in the decision to publish the results.
Funding Statement
This research was funded by the Department of Defense’s Congressionally Directed Medical Research Program and Spinal Cord Injury Research Program, award number W81XWH1910273.
Footnotes
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
References
- 1.Lee B.B., Cripps R.A., Fitzharris M., Wing P.C. The global map for traumatic spinal cord injury epidemiology: Update 2011, global incidence rate. Spinal Cord. 2014;52:110–116. doi: 10.1038/sc.2012.158. [DOI] [PubMed] [Google Scholar]
- 2.Fehlings M.G., Singh A., Tetreault L., Kalsi-Ryan S., Nouri A. Global prevalence and incidence of traumatic spinal cord injury. Clin. Epidemiol. 2014;6:309–331. doi: 10.2147/CLEP.S68889. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Dulin J.N., Adler A.F., Kumamaru H., Poplawski G.H.D., Lee-Kubli C., Strobl H., Gibbs D., Kadoya K., Fawcett J.W., Lu P., et al. Injured adult motor and sensory axons regenerate into appropriate organotypic domains of neural progenitor grafts. Nat. Commun. 2018;9:84. doi: 10.1038/s41467-017-02613-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Ahuja C.S., Wilson J.R., Nori S., Kotter M., Druschel C., Curt A., Fehlings M.G. Traumatic spinal cord injury. Nat. Rev. Dis. Primers. 2017;3:1–21. doi: 10.1038/nrdp.2017.18. [DOI] [PubMed] [Google Scholar]
- 5.Shabani S., Meyer B.P., Budde M.D., Wang M.C. Diagnostic Imaging in Spinal Cord Injury. Neurosurg. Clin. N. Am. 2021;32:323–331. doi: 10.1016/j.nec.2021.03.004. [DOI] [PubMed] [Google Scholar]
- 6.Martin A.R., Aleksanderek I., Cohen-Adad J., Tarmohamed Z., Tetreault L., Smith N., Cadotte D.W., Crawley A., Ginsberg H., Mikulis D.J., et al. Translating state-of-the-art spinal cord MRI techniques to clinical use: A systematic review of clinical studies utilizing DTI, MT, MWF, MRS, and fMRI. NeuroImage Clin. 2016;10:192–238. doi: 10.1016/j.nicl.2015.11.019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Cohen-Adad J., El Mendili M.-M., Lehéricy S., Pradat P.-F., Blancho S., Rossignol S., Benali H. Demyelination and degeneration in the injured human spinal cord detected with diffusion and magnetization transfer MRI. NeuroImage. 2011;55:1024–1033. doi: 10.1016/j.neuroimage.2010.11.089. [DOI] [PubMed] [Google Scholar]
- 8.Madden D.J., Bennett I.J., Burzynska A., Potter G.G., Chen N.-K., Song A.W. Diffusion tensor imaging of cerebral white matter integrity in cognitive aging. Biochim. Et Biophys. Acta (BBA)-Mol. Basis Dis. 2012;1822:386–400. doi: 10.1016/j.bbadis.2011.08.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Sled J.G. Modelling and interpretation of magnetization transfer imaging in the brain. NeuroImage. 2018;182:128–135. doi: 10.1016/j.neuroimage.2017.11.065. [DOI] [PubMed] [Google Scholar]
- 10.Athanasiou A., Klados M.A., Pandria N., Foroglou N., Kavazidi K.R., Polyzoidis K., Bamidis P.D. A Systematic Review of Investigations into Functional Brain Connectivity Following Spinal Cord Injury. Front. Hum. Neurosci. 2017;11:517. doi: 10.3389/fnhum.2017.00517. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Fei N., Li G., Wang X., Li J., Hu X., Hu Y. Deep Learning-Based Auto-Segmentation of Spinal Cord Internal Structure of Diffusion Tensor Imaging in Cervical Spondylotic Myelopathy. Diagnostics. 2023;13:817. doi: 10.3390/diagnostics13050817. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Merali Z., Wang J.Z., Badhiwala J.H., Witiw C.D., Wilson J.R., Fehlings M.G. A deep learning model for detection of cervical spinal cord compression in MRI scans. Sci. Rep. 2021;11:10473. doi: 10.1038/s41598-021-89848-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Lambin P., Leijenaar R.T.H., Deist T.M., Peerlings J., de Jong E.E.C., van Timmeren J., Sanduleanu S., Larue R.T.H.M., Even A.J.G., Jochems A., et al. Radiomics: The bridge between medical imaging and personalized medicine. Nat. Rev. Clin. Oncol. 2017;14:749–762. doi: 10.1038/nrclinonc.2017.141. [DOI] [PubMed] [Google Scholar]
- 14.Aerts H.J.W.L., Velazquez E.R., Leijenaar R.T.H., Parmar C., Grossmann P., Carvalho S., Bussink J., Monshouwer R., Haibe-Kains B., Rietveld D., et al. Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach. Nat. Commun. 2014;5:4006. doi: 10.1038/ncomms5006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Okimatsu S., Maki S., Furuya T., Fujiyoshi T., Kitamura M., Inada T., Aramomi M., Yamauchi T., Miyamoto T., Inoue T., et al. Determining the short-term neurological prognosis for acute cervical spinal cord injury using machine learning. J. Clin. Neurosci. 2022;96:74–79. doi: 10.1016/j.jocn.2021.11.037. [DOI] [PubMed] [Google Scholar]
- 16.Hargreaves B.A., Worters P.W., Pauly K.B., Pauly J.M., Koch K.M., Gold G.E. Metal-Induced Artifacts in MRI. Am. J. Roentgenol. 2011;197:547–555. doi: 10.2214/AJR.11.7364. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Koch K.M., Lorbiecki J.E., Hinks R.S., King K.F. A multispectral three-dimensional acquisition technique for imaging near metal implants. Magn. Reson. Med. 2009;61:381–390. doi: 10.1002/mrm.21856. [DOI] [PubMed] [Google Scholar]
- 18.Kocak B., Baessler B., Bakas S., Cuocolo R., Fedorov A., Maier-Hein L., Mercaldo N., Müller H., Orlhac F., dos Santos D.P., et al. CheckList for EvaluAtion of Radiomics research (CLEAR): A step-by-step reporting guideline for authors and reviewers endorsed by ESR and EuSoMII. Insights Into Imaging. 2023;14:75. doi: 10.1186/s13244-023-01415-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Koch K.M., Nencka A.S., Klein A., Wang M., Kurpad S., Vedantam A., Budde M. Diffusion-weighted MRI of the spinal cord in cervical spondylotic myelopathy after instrumented fusion. Front. Neurol. 2023;14:1172833. doi: 10.3389/fneur.2023.1172833. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.De Leener B., Lévy S., Dupont S.M., Fonov V.S., Stikov N., Collins D.L., Callot V., Cohen-Adad J. SCT: Spinal Cord Toolbox, an open-source software for processing spinal cord MRI data. NeuroImage. 2017;145:24–43. doi: 10.1016/j.neuroimage.2016.10.009. [DOI] [PubMed] [Google Scholar]
- 21.van Griethuysen J.J.M., Fedorov A., Parmar C., Hosny A., Aucoin N., Narayan V., Beets-Tan R.G.H., Fillion-Robin J.-C., Pieper S., Aerts H.J.W.L. Computational Radiomics System to Decode the Radiographic Phenotype. Cancer Res. 2017;77:e104–e107. doi: 10.1158/0008-5472.CAN-17-0339. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.LeDell E., Poirier S. 7th ICML Workshop on Automated Machine Learning (AutoML) ICML; San Diego, CA, USA: 2020. H2O AutoML: Scalable automatic machine learning. [Google Scholar]
- 23.Ryan R., Klautau A. In Defense of One-Vs-All Classification. [(accessed on 5 November 2024)];J. Mach. Learn. Res. 2004 5:101–141. Available online: http://www.jmlr.org/papers/v5/rifkin04a.html. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The data presented in this study are openly available in Mendeley Data at https://data.mendeley.com/datasets/sjpx7md7cf/1 (accessed on 25 October 2024).