Abstract
Automatic analysis of pathologic vertebrae from computed tomography (CT) scans could significantly improve the diagnostic management of patients with metastatic spine disease. We provide the first publicly available annotated imaging dataset of cancerous CT spines to help develop artificial intelligence frameworks for automatic vertebrae segmentation and classification. This collection contains a dataset of 55 CT scans collected on patients with various types of primary cancers at two different institutions. In addition to raw images, data include manual segmentations and contours, vertebral level labeling, vertebral lesion-type classifications, and patient demographic details. Our automated segmentation model uses nnU-Net, a freely available open-source framework for deep learning in healthcare imaging, and is made publicly available. This data will facilitate the development and validation of models for predicting the mechanical response to loading and the resulting risk of fractures and spinal deformity.
Introduction
After the liver and the lungs, bones are the third most frequent target site of metastases in the United States (1), with the spine being the most frequent site of bone metastases (2). Up to 50% of spinal metastases receive treatment, and 5–10% receive surgical management (3). Patients are at high risk for skeletal adverse events, including bone pain, fracture, hypercalcemia, and neurological dysfunction secondary to the spinal cord or cauda equina compression (4), that profoundly and negatively impact patients’ lives (24), requiring high doses of narcotics and reduced quality of life (5). Although the impact of spinal metastasis on the vertebral column anatomical integrity is readily imaged with MRI and CT (6,7,8), in the absence of quantitative assessment (9, 10, 11), standard-of-care imaging modalities (12, 13, 14) do not provide precise estimates of the structural competence of the spine. This limited ability profoundly negatively impacts patients’ clinical care, with clinicians forced to decide without objective quantitative criteria whether surgery or vertebral augmentation should be offered or withheld in attempts to treat to reduce the risk of fracture or to control pain. This choice can be particularly tragic when treatment could have been offered before the development of complications.
Implementing routine quantitative measurements from clinical imaging, such as vertebral segmentation, biomechanical quantification (26), lesion classification, longitudinal lesion growth monitoring (25), and risk progression estimates, would guide individualized earlier intervention. Recently, deep learning has been successfully applied to vertebral segmentation and classification tasks, demonstrating high accuracy and outperforming conventional methods (15). This success is due, in part, to the Large Scale Vertebrae Segmentation Challenge (VerSe) (16) and dataset (26), containing a total of 374 multi-detector CT scans with voxel-level annotations publically released. Before VerSe, only small public imaging datasets with vertebral segmentations existed (17,18,19,20,21,22), hindering efforts to develop robust and accurate segmentation and classification methods. Nevertheless, one limitation of VerSe is the exclusion of scans with vertebral abnormalities, such as bone metastasis and primary bone tumors, from their dataset.
In this work, we provide the first publicly available imaging dataset of vertebral segmentation masks for spine CT images of subjects with metastatic spine disease with annotations of vertebral lesions per vertebral level. This dataset is aimed to allow the development of automated large-scale deep learning methods for spine segmentation, vertebral level, and lesion classification and to facilitate longitudinal studies for monitoring lesion growth and predicting fracture risk.
Materials and Methods
Patient Overview
This dataset contains 55 metastatic spine disease patients CT scans (Male: Female 36:19; mean age: 64.70, range: 33–82, see Table 1) treated with radiotherapy or Stereotactic Body Radiation Therapy between September 2020 and October 2021 due to documented spine metastases (CT, MR, or bone scan imaging).
Table 1:
Patient demographics and primary cancer categories.
| Demographics | N = 55 |
|---|---|
| Age | Mean: 64.7 years (Range: 33–82) |
| Sex | Female: 34.54% | Male: 65.45% |
| Primary Cancer | Instances |
| - Prostate | 24 |
| - Renal | 8 |
| - Lung | 7 |
| - Breast | 5 |
| - Colon | 2 |
| - Skin | 2 |
| - Uterus, Cervical, Neuro, Soft Tissue Sarcoma, Urothelial, Skeletal, Bile Ducts | 1 each |
Patients were identified for the study by the attending clinician according to the following inclusion criteria: (1) Histologically or cytologically documented stage IV bone metastases and radiographic (CT scan or bone scan) evidence of bone metastases, and (2) Karnofsky Performance Status >70. Patients were excluded if they had abnormal bone metabolism Paget disease, and untreated hyperthyroidism, hyperprolactinemia, or Cushing disease, or had previous surgery or vertebral augmentation at the site of radiation. Patients with prior radiotherapy treatment were excluded only if the patient had prior radiotherapy treatment at the spinal region under consideration. All contributions to this study were based on approval by Institutional Review Boards and were conducted following the 1964 Declaration of Helsinki.
Image Acquisition
Clinical CT data was obtained on a General Electric scanner (LightSpeed RT16) or a Siemens scanner (SOMATOM Confidence). While acquisitions vary by patient, CT imaging was typically performed with an in-plane resolution of 0.3125×0.3125mm, in-plane matrix of 512×512, and slice thickness of 1.0–1.5mm. Data were fully de-identified with scans including no anatomy above the neck. All data were visually inspected before release. With the de-identified patients selected retrospectively, the study did not require informed consent from individual participants.
Vertebral Segmentation, Level Annotation, and Lesion Classification
The CT data was imported into the 3DSlicer imaging platform. For each spine, eighteen vertebrae (T1 to L5) were considered for annotation depending on the field of view, with labels from 6 to 25 in the original segmentation label maps. Note that partially visible vertebrae at the top or bottom of the scan (or both) were not annotated. Each vertebral level was segmented using “Grow from Seeds” method, resulting in volumetric label maps by a spine biomechanics expert (RNA 25 years of experience). The anatomical accuracy of the segmentation and leabling were confirmed by an expert radiologist (DBH, 35 years of experience). In addition to the delineations, DBH and RNA classified vertebral metastatic lesions as Not-involved, Lytic, Blastic, or Mixed, as illustrated in Figure 2.
Figure 2:
Illustrative example dataset of a patient with vertebral lesions with the raw CT, annotated segmentation and identification of vertebral levels, and lesion classification (Lytic, Blastic, and Mixed).
Technical Validation: nnU-Net for Baseline Segmentation Performance
We trained a deep convolutional neural network using the dataset. Each vertebra was automatically segmented from the background, localized, and labeled, and 3D surfaces were generated using 3DSlicer. We used the nnU-Net framework (25), a self-adapting framework for 3D full-resolution image segmentation based on the U-Net architecture. We obtained an average segmentation accuracy of 94.33% ±2.24% Dice score on the spine CTs dataset not used in the training (3-fold validation). Segmentation results are illustrated in Figure 3.
Figure 3:
Segmentation results on one patient using the trained nnU-Net model. From left to right: Axial, Coronal, Sagittal, and 3D views of the spine CT with segmentation masks; segmentation results overlaid with ground truth annotation of one vertebra with a blastic lesion.
We then compared our model with the TotalSegmentator framework (15), trained on a spine CT dataset that excludes vertebral lesions. Our comparison exhibited an average improvement in Dice score of 3.32% across all spines and an average improvement in Dice score of 5.18% in vertebrae with bone metastasis, further validating the importance of our dataset.
Resulting Dataset
De-identified data was submitted to The Cancer Imaging Archive (TCIA) through its established selection and governance processes. The CT images exclude the head and, thus, any facial features. The images and associated files are publicly available at:https://doi.org/10.7937/kh36-ds04.
Both CT image files and segmentation are provided in DICOM format for each case, compatible with TCIA requirements and interoperating with PACS and commercial tools. We followed the SNOMED® CT coding scheme as the controlled terminology to label each vertebra. Lesion classification is provided in a Microsoft Excel format, and maps every vertebra level for each scan with a lesion class is also provided. In addition to the CT images, the dataset contains patient demographics data and primary cancer sites.
The dataset comprises 55 CT image series from 55 patients and 787 vertebrae, encompassing 543 thoracic and 244 lumbar vertebrae. Among these vertebrae, 183 contain lesions, 111 thoracic and 72 lumbar, classified as 58 blastic, 54 lytic, and 71 mixed (see Table 2). This represents a considerable increase in available annotated data—particularly for lesion vertebrae—compared with previously available datasets (17,18,19,20,21,22,26).
Table 2:
Classification of vertebral lesions by level.
| Classification (Lesion) | Thoracic | Lumbar | Total |
|---|---|---|---|
| Vertebra with Lesion | 111 | 72 | 183 |
| - Blastic | 33 | 25 | 58 |
| - Lytic | 30 | 24 | 54 |
| - Mixed | 48 | 23 | 71 |
| Vertebra Not Involved | 432 | 172 | 604 |
| All vertebrae | 543 | 244 | 787 |
Discussion
We present a unique dataset of spine CTs featuring metastatic bone lesions with annotated vertebral segmentation and lesion classifications. While public datasets of spine CTs, such as VerSe, are available, they primarily focus on healthy subjects and are dedicated to voxel-based segmentation and classification. With this dataset, we aim to enhance existing resources by specifically highlighting bone lesions to encourage further research in this domain.
We acknowledge that the dataset includes a limited number of cases, largely due to the complexity of lesion type classification, and does not contain follow-up scans. Nevertheless, we are confident that this resource will advance research in deep-learning approaches for detecting, segmenting and classifying metastatic bone lesions in spine CT. Ultimately, this dataset may support the development of objective quantitative criteria to guide treatment decisions aimed at reducing fracture risk and controlling pain in affected patients.
Figure 1:
Example lesions and segmentations found in the dataset. Top row: Spine CTs with lytic, blastic, and mixed lesions and without lesions. Bottom row: Spine CTs with masks visualized as colored overlays.
Acknowledgments
This work was supported by the NIAMS and the NIBIB of the National Institutes of Health under award numbers R01AR075964, 3R01AR075964-03S1, and K25EB035166.
References
- 1.U.S. Cancer Statistics Working Group. U.S. Cancer Statistics Data Visualizations Tool bosd. Cancer Facts & Figures 2022. In: Institute CfDCaPaNC, editor.: U.S. Department of Health and Human Services; 2022. [Google Scholar]
- 2.Siegel R, DeSantis C, Virgo K, Stein K, Mariotto A, Smith T, Cooper D, Gansler T, Lerro C, Fedewa S, Lin C, Leach C, Cannady RS, Cho H, Scoppa S, Hachey M, Kirch R, Jemal A, Ward E. Cancer treatment and survivorship statistics, 2012. CA Cancer J Clin. 2012;62(4):220–41. [DOI] [PubMed] [Google Scholar]
- 3.Bilsky MH, Lis E, Raizer J, Lee H, Boland P. The diagnosis and treatment of metastatic spinal tumor. Oncologist 1999;4(6):459–69. [PubMed] [Google Scholar]
- 4.Sahgal A, Whyne CM, Ma L, Larson DA, Fehlings MG. Vertebral compression fracture after stereotactic body radiotherapy for spinal metastases. Lancet Oncol 2013;14(8):e310–20 [DOI] [PubMed] [Google Scholar]
- 5.Rustoen T, Moum T, Padilla G, Paul S, Miaskowski C. Predictors of quality of life in oncology outpatients with pain from bone metastasis. J Pain Symptom Manage. 2005;30(3):234–42. [DOI] [PubMed] [Google Scholar]
- 6.Algra PR, Bloem JL, Tissing H, Falke TH, Arndt JW, Verboom LJ. Detection of vertebral metastases: comparison between MR imaging and bone scintigraphy. Radiographics. 1991;11(2):219–32 [DOI] [PubMed] [Google Scholar]
- 7.Avrahami E, Tadmor R, Dally O, Hadar H. Early MR demonstration of spinal metastases in patients with normal radiographs and CT and radionuclide bone scans. J Comput Assist Tomogr. 1989;13(4):598–602 [DOI] [PubMed] [Google Scholar]
- 8.Saip P, Tenekeci N, Aydiner A, Dincer M, Inanc S, Demir C, Oral EN, Topuz E. Response evaluation of bone metastases in breast cancer: value of magnetic resonance imaging. Cancer Invest. 1999;17(8):575–80 [DOI] [PubMed] [Google Scholar]
- 9.Boehling NS, Grosshans DR, Allen PK, McAleer MF, Burton AW, Azeem S, Rhines LD, Chang EL. Vertebral compression fracture risk after stereotactic body radiotherapy for spinal metastases. J Neurosurg Spine. 2012;16(4):379–86. [DOI] [PubMed] [Google Scholar]
- 10.Ejima Y, Matsuo Y, Sasaki R. The current status and future of radiotherapy for spinal bone metastases. J Orthop Sci. 2015;20(4):585–92. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Fourney DR, Frangou EM, Ryken TC, Dipaola CP, Shaffrey CI, Berven SH, Bilsky MH, Harrop JS, Fehlings MG, Boriani S, Chou D, Schmidt MH, Polly DW, Biagini R, Burch S, Dekutoski MB, Ganju A, Gerszten PC, Gokaslan ZL, Groff MW, Liebsch NJ, Mendel E, Okuno SH, Patel S, Rhines LD, Rose PS, Sciubba DM, Sundaresan N, Tomita K, Varga PP, Vialle LR, Vrionis FD, Yamada Y, Fisher CG. Spinal instability neoplastic score: an analysis of reliability and validity from the spine oncology study group. J Clin Oncol. 2011;29(22):3072–7 [DOI] [PubMed] [Google Scholar]
- 12.Walls J, Bundred N, Howell A. Hypercalcemia and bone resorption in malignancy. Clin Orthop. 1995;312(51):51–63 [PubMed] [Google Scholar]
- 13.Aizer-Dannon A, Bar-Am A, Ron IG, Flusser G, Even-Sapir E. Fused functional-anatomic images of metastatic cancer of cervix obtained by a combined gamma camera and an X-ray tube hybrid system with an illustrative case and review of the 18F-fluorodeoxyglucose literature. Gynecol Oncol. 2003;90(2):453–7 [DOI] [PubMed] [Google Scholar]
- 14.Roque M, Martinez M, Alonso P, Catala E, Garcia J, Ferrandiz M. Radioisotopes for metastatic bone pain. Cochrane Database Syst Rev. 2003;4:CD003347. [DOI] [PubMed] [Google Scholar]
- 15.Wasserthal J., Meyer M., Breit H., Cyriac J., Yang S., Segeroth M. TotalSegmentator: robust segmentation of 104 anatomical structures in CT images, 2022. URL: https://arxiv.org/abs/2208.05868. arXiv: 2208.05868 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Sekuboyina Anjany, Husseini Malek E., Bayat Amirhossein, Löffler Maximilian, Liebl Hans, Li Hongwei, Tetteh Giles et al. “VerSe: a vertebrae labelling and segmentation benchmark for multi-detector CT images.” Medical image analysis 73 (2021): 102166. [DOI] [PubMed] [Google Scholar]
- 17.Yao J., Burns J. E., Munoz H., & Summers R. M. (2012). Detection of vertebral body fractures based on cortical shell unwrapping. In International Conference on Medical Image Computing and Computer-Assisted Intervention (pp. 509–516). Springer. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Yao J., Burns J. E., Forsberg D., Seitel A., Rasoulian A., Abolmaesumi P., Hammernik K., Urschler M., Ibragimov B., Korez R., Vrtovec T., Castro-Mateos I., Pozo J. M., Frangi A. F., Summers R. M., & Li S. (2016). A multi-center milestone study of clinical vertebral ct segmentation. Computerized Medical Imaging and Graphics, 49, 16–28. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Korez R., Ibragimov B., Likar B., Pernuš F., & Vrtovec T. (2015). A framework for automated spine and vertebrae interpolation-based detection and model-based segmentation. IEEE transactions on medical imaging, 34, 1649–1662. [DOI] [PubMed] [Google Scholar]
- 20.Korez R., Likar B., Pernus F., & Vrtovec T. (2016). Model-based segmentation of vertebral bodies from MR images with 3d CNNs. In International conference on medical image computing and computer-assisted intervention (pp. 433–441). Springer. [Google Scholar]
- 21.Ibragimov B., Likar B., Pernus F., & Vrtovec T. (2014). Shape representation for efficient landmark-based segmentation in 3-d. IEEE Transactions on Medical Imaging, 33, 861–874. [DOI] [PubMed] [Google Scholar]
- 22.Glocker B., Zikic D., Konukoglu E., Haynor D. R., & Criminisi A. (2013). Vertebrae localization in pathological spine CT via dense classification from sparse annotations. In International conference on medical image computing and computer-assisted intervention (pp. 262–270). Springer. [DOI] [PubMed] [Google Scholar]
- 23.Zhu XC, Zhang JL, Ge CT, Yu YY, Wang P, Yuan TF, Fu CY. Advances in cancer pain from bone metastasis. Drug Des Devel Ther. 2015;9:4239–45. Epub 2015/09/01. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Sanhinova M, Haouchine N, Pieper SD, Wells III WM, Balboni TA, Spektor A, Huynh MA, Guenette JP, Czajkowski B, Caplan S, Doyle P. Registration of longitudinal spine CTs for monitoring lesion growth. InMedical Imaging 2024: Image Processing 2024. Apr 2 (Vol. 12926, pp. 658–661). SPIE. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Isensee F., Jaeger P. F., Kohl S. A., Petersen J., & Maier-Hein K. H. (2021). nnU-Net: a self-configuring method for deep learning-based biomedical image segmentation. Nature methods, 18(2), 203–211. [DOI] [PubMed] [Google Scholar]
- 26.Löffler Maximilian T., Sekuboyina Anjany, Jacob Alina, Grau Anna-Lena, Scharr Andreas, Husseini Malek El, Kallweit Mareike, Zimmer Claus, Baum Thomas, and Kirschke Jan S.. “A vertebral segmentation dataset with fracture grading.” Radiology: Artificial Intelligence 2, no. 4 (2020): e190138. [DOI] [PMC free article] [PubMed] [Google Scholar]



