Abstract
Background
To develop a machine learning model based on tumor-to-bone distance and radiomic features derived from preoperative MRI images to distinguish intramuscular (IM) lipomas and atypical lipomatous tumors/well-differentiated liposarcomas (ALTs/WDLSs) and compared with radiologists.
Methods
The study included patients with IM lipomas and ALTs/WDLSs diagnosed between 2010 and 2022, and with MRI scans (sequence/field strength: T1-weighted (T1W) imaging at 1.5 or 3.0 Tesla MRI). Manual segmentation of tumors based on the three-dimensional T1W images was performed by two observers to appraise the intra- and interobserver variability. After radiomic features and tumor-to-bone distance were extracted, it was used to train a machine learning model to distinguish IM lipomas and ALTs/WDLSs. Both feature selection and classification steps were performed using Least Absolute Shrinkage and Selection Operator logistic regression. The performance of the classification model was assessed using a tenfold cross-validation strategy and subsequently evaluated using the receiver operating characteristic curve (ROC) analysis. The classification agreement of two experienced musculoskeletal (MSK) radiologists was assessed using the kappa statistics. The diagnosis accuracy of each radiologist was evaluated using the final pathological results as the gold standard. Additionally, we compared the performance of the model and two radiologists in terms of the area under the receiver operator characteristic curves (AUCs) using the Delong’s test.
Results
There were 68 tumors (38 IM lipomas and 30 ALTs/WDLSs). The AUC of the machine learning model was 0.88 [95% CI 0.72–1] (sensitivity, 91.6%; specificity, 85.7%; and accuracy, 89.0%). For Radiologist 1, the AUC was 0.94 [95% CI 0.87–1] (sensitivity, 97.4%; specificity, 90.9%; and accuracy, 95.0%), and as to Radiologist 2, the AUC was 0.91 [95% CI 0.83–0.99] (sensitivity, 100%; specificity, 81.8%; and accuracy, 93.3%). The classification agreement of the radiologists was 0.89 of kappa value (95% CI 0.76–1). Although the AUC of the model was lower than of two experienced MSK radiologists, there was no statistically significant difference between the model and two radiologists (all P > 0.05).
Conclusions
The novel machine learning model based on tumor-to-bone distance and radiomic features is a noninvasive procedure that has the potential for distinguishing IM lipomas from ALTs/WDLSs. The predictive features that suggested malignancy were size, shape, depth, texture, histogram, and tumor-to-bone distance.
Supplementary Information
The online version contains supplementary material available at 10.1186/s13018-023-03718-4.
Keywords: Artificial intelligence, Machine learning, Lipoma, Liposarcoma, Magnetic resonance imaging, Radiomics
Introduction
Lipomatous soft tissue tumors are a group of tumors that exhibit a variety of clinical behaviors. Lipomas are the most common soft tissue tumor, accounting for one-third of soft tissue tumors [1, 2]. Lipomas are benign adipocytic tumors, and they can be treated conservatively with careful observation. Surgical excision is necessary when a patient is symptomatic [3]. Local recurrence may occur if the surgical margin is not clear, but the chance is very low. However, intramuscular (IM) lipomas that are larger than 5 cm, deep-seated, and symptomatic can sometimes be difficult to distinguish from atypical lipomatous tumors or well-differentiated liposarcomas (ALTs/WDLSs). The World Health Organization uses the terms ALT and WDLS to represent tumors with identical histology but different anatomical locations and clinical outcomes [4, 5]. ALTs characterize extremity or upper trunk lesions, and WDLSs represent retroperitoneal or mediastinal lesions. ALTs/WDLSs are low-grade, malignant, adipocytic tumors that recur locally or dedifferentiate to high-grade sarcomas, but they rarely metastasize [6, 7]. The gold standard for the diagnosis of ALTs/WDLSs is histopathological evidence of lipoblasts and lipocytes, with immunohistopathological staining positive for murine double minute 2 (MDM2) or cyclin-dependent kinase 4 (CDK4) [8, 9]. Establishing a diagnosis before surgery is crucial to dictate the urgency of surgery timing, to have a well-planned incision (a larger wound for ALTs/WDLSs), and to manage surgical margins (intracapsular incisions for lipomas, and capsular removals for ALTs/WDLSs). However, computed tomography-guided biopsies and incisional biopsies often cannot identify the malignant cell foci from the small core of a tissue sample [10]. Frozen sections of tissue are also considered unsuitable because fatty tissue is often too friable and cannot be fixed.
Magnetic resonance imaging (MRI) is the most useful diagnostic tool for lipomatous soft tissue tumors, and it can distinguish IM lipomas from ALTs/WDLSs. MR images of IM lipomas show a uniform structure of adipose tissue with high intensity in both T1- and T2-weighted images, and low signal intensity with fat suppression. The MRI features of ALTs/WDLSs are similar to those of lipomas. However, features that suggest malignancy are (1) masses that are deep to the fascia; (2) masses that are larger than 10 cm; (3) the presence of thick septa (> 2 mm); (4) an enhancement on the post-contrast sequence; (5) a nodular or non-adipose mass-like area, and (6) a decreased percentage of fat [11–13]. In contrast, high-grade liposarcomas (including myxoid/round cell, pleomorphic, and dedifferentiated liposarcomas) show low signal intensity in T1-weighted images and heterogeneous high signal intensity in T2-weighted images. However, the interobserver reliability for diagnosis of IM lipomas and ALTs/WDLSs showed slight to the substantial agreement, based on Cohen’s kappa coefficient [14]. To reduce diagnostic uncertainty in medical image classifications using MRI and to prevent inadequate or excessive treatments, diagnoses must be made by an experienced musculoskeletal (MSK) radiologist.
Currently, interest in artificial intelligence is strong and is growing rapidly. To remedy the image classification problem, several methods have been proposed, such as machine learning and deep learning [15–23]. Recently, Tang et al. [20] reported that performing machine learning in distinguishing between all types of lipomas (subcutaneous and IM lipomas) and ALTs on preoperative MRI has shown greater precision than MSK radiologists. However, no studies have yet focused on IM lipomas and ALTs/WDLSs.
Consequently, the primary objective of this study was to develop a machine learning model based on tumor-to-bone distance and radiomic features derived from preoperative MRI images to distinguish IM lipomas from ALTs/WDLSs and compared with radiologists. Regarding feature importance insights in the ultimate model, we evaluated the radiomic features that have significant in differentiating IM lipomas from ALTs/WDLSs. Along with, we examined whether a decrease in the tumor-to-bone distance could be used to pathologically differentiate ALTs/WDLSs from IM lipomas and whether it affected the clinical behavior of WDLS.
Materials and methods
Patients
This retrospective study was carried out after receiving a certificate of approval from the ethics committee of our institute, with a waiver of informed consent. Data sets were obtained from 68 patients with IM lipomas or ALTs/WDLSs. All had undergone an MRI examination and a total excision, with the diagnosis confirmed by histopathology and immunohistochemistry or fluorescence in situ hybridization (FISH) examination (The tumor with MDM2 and CDK4 amplification revealed by immunohistochemistry/FISH was diagnosed as ALT/WDLS finally. On the other hand, the tumor that had no MDM2 and CDK4 amplification was defined as IM lipoma.), at our institution between 2010 and 2022. The final pathological results were used as the gold standard for the classification process. The inclusion criteria were defined as follows: (1) the final pathological diagnosis confirmed as IM lipoma or ALT/WDLS; (2) received total excision of primary tumors; and (3) received preoperative MRI protocol that including T1-weighted image (T1WI) sequence. The exclusion criteria were listed as follows: (1) the patients received anticancer treatments before MRI scan; (2) having a history of other cancers; and (3) poor quality image. According to these criteria, 38 patients with IM lipomas and 30 patients with ALTs/WDLSs were included in this study.
MRI acquisition
The preoperative MRI of the patients was obtained using a variety of scanners and sequences. In this study, the T1WI sequence was the most available since it was part of the routine clinical protocol of our institution. Thus, we focused on this sequence. The MRI data sets comprised the axial, coronal, and sagittal planes of T1WI, all of which reveal soft tissue tumors most clearly. All images were acquired by 1.5 or 3.0 Tesla MRI scanners (Philips Healthcare; GE Medical System; Magnetom Siemens Healthineers). The MRI slice thickness was 5 mm (median, range 3–8). The repetition time (TR) and echo time (TE) were 590 ms (median, range 425–2259) and 12 ms (median, range 7–25), respectively. The detailed parameters for T1WI sequence are listed in Additional file 1: Table S1.
Three-dimensional (3D) tumor segmentation
The open-source software 3D Slicer version 4.11.20210226 r29738/7a593c8 (https://download.slicer.org/) with AI-assisted Nvidia Clara was used for manual segmentation of soft tissue tumors (Fig. 1). In all cases, lesions were segmented using the original T1WI sequences. Tumor region of interests (ROIs) were drawn on the entire volume of the lesion. Additionally, a reference region of interest (ROI) was drawn in fat on T1W MRI for image-intensity-normalization procedure [24]. All cases were selected randomly and blindly for repetitive the segmented ROI by two observers (statistician and research scientist) and subsequently confirmed their precision by experts in musculoskeletal radiology and orthopedic oncology. (One observer repeated the segmentation of all cases after a paused of 2 weeks to evaluate intraobserver variability. A second observer analyzed all cases to appraise interobserver variability.)
Image-intensity normalization
After segmentation, the next step was image-intensity normalization. The purpose of this step was to adjust for differences in T1W MRI protocols. The normalized intensity value (NIV) [24] was determined as follows: .
In all cases, T1W MRI was normalized before radiomic feature extraction.
Feature extraction
Radiomic features (107 features)
The radiomic features of a segmented ROI were extracted from the normalized T1W intensity using PyRadiomics version 3.0.1. Total radiomic features included first-order features (18 features); gray-level co-occurrence matrix (GLCM, 24 features); gray-level run length matrix (GLRLM, 16 features); gray-level size zone matrix (GLSZM, 16 features); gray-level dependence matrix (GLDM, 14 features); neighboring gray-tone difference matrix (NGTDM, 5 features); and shape-based features (3D, 14 features). These were classified as histogram-based features (i.e., first-order features), texture-based features (i.e., GLCM, GLRLM, GLSZM, GLDM, and NGTDM features), and shape-based features (Fig. 1) [25].
-
2.
Radiographic feature: tumor-to-bone distance (1 feature)
The distance from each tumor to bone was measured in 3D (Figs. 1 and 2). The distance from A to B was from the closest aspect of the soft tissue tumor (A) to the bone (B).
Classification by machine learning model
To predict soft tissue tumor differentiation, a classification model was constructed with supervised machine learning. Firstly, the data sets were randomly split into two subsets: the learning and testing data sets. (70% of data were used for the learning data set, and 30% for the testing data set.) Then, we applied a tenfold cross-validation strategy to the learning data set, which was repeatedly subdivided into training sets (ninefold) and validation sets (onefold). The training sets were used for “Feature selection” using LASSO logistic regression, and subsequently built models using LASSO logistic regression. This method is suitable for high-dimensional data [26]. For the validation sets were used to test the performance of the built model. Finally, the best model is selected and then evaluated on the testing data set (Fig. 3) [26, 27]. The performance of the classification model was assessed using the area under the curve (AUC), sensitivity, specificity, and accuracy (Fig. 4C) [27].
Classification by radiologists
Two MSK radiologists served as readers, blindly and independently, to label the same lipomatous tumor MRI test set above. Radiologist 1 and 2 had 7 and 22 years of experience retrospectively. The readers were shown T1W MRI sequences in portable network graphics (PNG) format with an image size of 1024 × 1024 pixels; they were able to zoom in and out using the software. They made binary decisions on the 68 MR images and were blinded to the reports. The ratio of normal to abnormal radiographs was also not revealed to the readers. The classification agreement of the 2 radiologists was evaluated with kappa statistics [28].
Statistical analysis
Mean and standard deviation (SD) or median and range were used to describe the continuous data, and frequency and percentage were used for categorical data. Comparison of the characteristics and the properties of the acquisition protocols of T1W MRI sequences between 2 groups (IM lipomas vs. ALTs/WDLSs) were analyzed using an independent sample t test or the Mann–Whitney U test for continuous data, and the chi-squared test or Fisher’s exact test for categorical data.
Regarding assessing intra- and interobserver variability of manual segmentation, the reproducibility of the radiomic features and tumor-to-bone distance extracted from normalized images were used to appraise the agreement of the feature values concerning intra- and interobserver variability by using the intraclass correlation coefficient (ICC). In the analysis, we chose a two-way random-effects model with an absolute agreement (ICC (2,1)) [29]. Interpretation of ICC value is as follows: poor (ICC < 0.50); moderate (0.50 ≤ ICC < 0.75); good (0.75 ≤ ICC < 0.90); and excellent (ICC ≥ 0.90) [30]. In this study, we considered a feature reproducible if the ICC value was more than 0.75 [24, 31–35]. Additionally, Pearson correlation coefficient (r)/Spearman’s rank-order coefficient () were also computed. The only features with ICC > 0.75 were utilized for machine learning procedure and statistical analysis.
Both the machine learning model and MSK radiologists, the area under the receiver operator characteristic curve (AUC), sensitivity, specificity, and accuracy values with 95% confidence interval (CI) were used to assess the performance of classification [27]. In addition, we compared the performance of the model and two radiologists in terms of the AUCs using the Delong’s test [36, 37].
Analysis of feature importance, differences in radiomic and radiographic features between two groups (IM lipomas vs. ALTs/WDLSs) were performed using the Bonferroni-corrected Mann–Whitney U test [22, 38, 39].
Statistical analysis and graphs were performed using R version 4.2.1 (R Foundation for Statistical Computing, Vienna, Austria). A P value < 0.05 was considered statistically significant.
Results
Patients
Sixty-eight patients were identified: 38 with IM lipomas and 30 with ALTs/WDLSs (Table 1). Most were women (61.8%) with deep-seated soft tissue tumors located in the thigh. The proportion of women in the IM lipoma group was significantly higher than in the ALT/WDLS group (P = 0.02). The mean age of the 68 patients was 59.3 years (range 38–80). There was no statistical difference in the mean ages of the IM lipoma and ALT/WDLS groups (P = 0.5).
Table 1.
Variable | Total subjects (n = 68) |
IM lipomas (n = 38) |
ALTs/WDLSs (n = 30) |
P value |
---|---|---|---|---|
Gender | ||||
Male | 26 (38.2%) | 10 (26.3%) | 16 (53.3%) | 0.02 |
Female | 42 (61.8%) | 28 (73.7%) | 14 (46.7%) | |
Age (years) | ||||
Mean (SD) | 59.3 (9.7) | 60.0 (9.1) | 58.1 (10.6) | 0.5 |
Range | 38–80 | 38–80 | 39–80 | |
Tumor location | ||||
Thigh | 34 | 14 | 20 | |
Arm | 15 | 11 | 4 | |
Shoulder | 9 | 9 | 0 | |
Leg | 7 | 1 | 6 | |
Back/trunk | 2 | 2 | 0 | |
Buttock | 1 | 1 | 0 |
IM lipomas intramuscular lipomas, ALTs/WDLSs atypical lipomatous tumors/well-differentiated liposarcomas
Intra- and interobserver manual segmentation variability in the feature reproducibility
Intra- and interobserver variability of manual segmentation was appraised using ICC. The only features with ICC > 0.75 (78.7%, 85 of 108 features) were considered for feature selection and model building steps. Additionally, Pearson’s correlation coefficient and Spearman’s rank-order coefficient were more than 0.7 (Additional file 1: Table S2).
The selected features in the ultimate model
The feature selection process with LASSO method revealed 14 significant features including 13 radiomic features (i.e., 3 shape-based features, 2 histogram-based features, and 8 texture-based features) and tumor-to-bone distance (Fig. 4A, B). The definition and formula of these features were described according to PyRadiomics’s documentation. (https://pyradiomics.readthedocs.io/en/latest/features.html) (Additional file 1: Table S3). All 14 were used to build the ultimate model in the next step.
Performance of machine learning model
The classification performance of the machine learning approach used in this study is summarized in Table 2 and Fig. 4C, which performed well in differentiating IM lipomas from ALTs/WDLSs. The AUC, sensitivity, specificity, and accuracy were 0.88 (95% CI 0.72–1), 91.6% (62.0–100%), 85.7% (42.0–100%), and 89.0% (67.0–98.6%), respectively (Table 2).
Table 2.
AUC (95% CI) | Sensitivity (95% CI) | Specificity (95% CI) | Accuracy (95% CI) | P value* | |
---|---|---|---|---|---|
Machine learning model | 0.88 (0.72–1) | 91.6 (62.0–100) | 85.7 (42.0–100) | 89.0 (67.0–98.6) | |
MSK radiologist 1 | 0.94 (0.87–1) | 97.4 (86.0–100) | 90.9 (71.0–99.0) | 95.0 (86.1–98.9) | 0.4 |
MSK radiologist 2 | 0.91 (0.83–0.99) | 100 (91.0–100) | 81.8 (60.0–95.0) | 93.3 (83.8–98.1) | 0.7 |
AUC The area under the receiver operator characteristic curve, CI Confidence interval
*P value based on the Delong’s test, which were used to compare the AUCs between the machine learning model and radiologists
Comparison between the machine learning model and two experienced MSK radiologists
The classification performances of the 2 MSK radiologists are detailed in Table 2. The AUCs obtained by the radiologist 1 and 2 were 0.94 (0.87–1) and 0.91 (0.83–0.99). Their respective sensitivities, specificities, and accuracies were 97.4% (86.0–100%), 90.9% (71.0–99.0%), and 95.0% (86.1–98.9%), respectively, for radiologist 1 and 100% (91.0–100%), 81.8% (60.0–95.0%), and 93.3% (83.8–98.1%), respectively, for radiologist 2 (Table 2). Although the AUC of the machine learning model was lower than that from the two MSK radiologists, there was no statistically significant difference between the model and two radiologists (all P > 0.05) (Table 2 and Fig. 5).
Fourteen feature importance insights in the ultimate model
Regarding analysis of feature importance, the 14 features were found to be significant after Bonferroni-corrected Mann–Whitney U test (Fig. 6, supporting information).
Three important shape-based features were (1) shape_SurfaceVolumeRatio (Fig. 6A), (2) shape_Sphericity (Fig. 6D), and (3) shape_MajorAxisLength (Fig. 6I), all of which suggested that ALTs/WDLS were more spherical and larger than IM lipomas.
Two histogram-based features of significance were (1) firstorder_Skewness (Fig. 6E) and (2) firstorder_TotalEnergy (Fig. 6N). The voxel intensity distribution showed negative skewness for IM lipomas and close to zero for ALTs/WDLSs. And the firstorder_TotalEnergy value for ALTs/WDLSs was higher than those for IM lipomas.
The 8 texture-based features indicated the degree of homogeneity and heterogeneity in the texture patterns in an image (Fig. 6B, C, F, H, J–M). These findings supported the position that IM lipomas exhibited more homogeneity in T1WI sequences than ALTs/WDLSs.
The tumor-to-bone distance (Fig. 6G) for IM lipomas was greater than those for ALTs/WDLSs.
Discussion
This study revealed that regular MRI attainment facilitates the diagnosis of IM lipomas and WDLSs with credible results using the machine learning model based on tumor-to-bone distance and radiomic features derived from T1-weighted, non-contrast-enhanced MR images, with AUC 0.88 (95% CI 0.72–1), 91.6% sensitivity, 85.7% specificity, and 89% accuracy.
In this cohort of patients (Table 1), the mean ages of the patients with IM lipomas and ALTs/WDLSs were comparable. Women had IM lipomas (28 of 38 cases; 73.7%) more frequently than ALTs/WDLSs (14 of 30 cases; 46.7%), with statistically significance (P = 0.02). The most common location of the IM lipomas and ALTs/WDLSs was the thigh. Most calf tumors were ALTs/WDLSs, whereas tumors located in the shoulder, arm, back, trunk, and pelvis were almost always IM lipomas. Patient demographic data, including age, gender, and localization, may suggest malignancy. However, without a combination of clinical data, the machine learning approach based on tumor-to-bone distance and radiomic features of the T1W images themselves offer promising predictions of sarcoma.
Moreover, the present radiomics model required only T1W images without T2W or gadolinium contrast, which makes this method generalizable and reproducible for use in diverse centers equipped with various MRI scanners. No vendor effect was observed between the General Electric, Siemens, and Philips MR systems for any of the radiomic features in this study (Additional file 1: Table S1).
Three previous studies [17, 18, 21] (Table 3) evaluated all types of liposarcomas using a radiomic approach on MRI, including myxoid and dedifferentiated liposarcomas and a limited number of ALTs/WDLSs. Radiologists could easily differentiate the distinct radiological features of lipomas and high-grade liposarcomas on conventional MRI, resulting in high diagnostic accuracy (91–95%). In addition, Doyle et al. [13] showed that 2 radiologists achieved 94% and 100% sensitivities as well as 64% and 76% specificities for T1-weighted images when distinguishing between lipomas and atypical lipomatous tumor/WDLS. Focusing on studies of radiomics to differentiate lipomas from ALTs/WDLSs [19, 20, 22, 23], the accuracy dropped to 67%–96% because the lipomas and ALTs/WDLSs shared common MRI features. The results of our study are comparable with those of Vos et al. [22], who applied a radiomic model to differentiate both subcutaneous and IM lipomas from ATLs/WDLSs, using shape and texture analysis. Their design evaluated 58 lipomas and 58 WDLSs, with an accuracy of 67% when using T1W images and 75% when using combined T1W and T2W images. Recently, Tang et al. [20] reported that a radiomic models with T1WI, FS T2 WI, and T1&T2WI achieved 88%, 96%, and 92% accuracies when distinguishing between lipomas and ALTs of the extremities. Our non-contrast-enhanced T1W MRI achieved equivalent performance compared with those of Tang et al. [20], with an accuracy of 89%. This may be explained by using 3D segmentation to improve accuracy and usefulness while trading with a longer analysis time and GPU space [40].
Table 3.
Study | Number of patients with lipomas in benign group | Number of patients with ALTs/WDLSs in malignant group | Classify | Machine learning algorithm | AUC | Sensitivity (%) | Specificity (%) | Accuracy (%) |
---|---|---|---|---|---|---|---|---|
Cay, 2022 [15] | 45 | 20/20 (100%) | Lipomas vs. ALTs/WDLSs | SVM | 0.98 | 96.8 | 93.7 | – |
Fradet, 2022 [16] | 40 | 45/45 (100%) | Lipomas vs. ALTs | Logistic regression | 0.50 | 100 | 0.0 | – |
SVM | 0.47 | 70.0 | 32.0 | – | ||||
Random forest | 0.71 | 64.0 | 68.0 | – | ||||
Gradient boosting | 0.70 | 67.0 | 64.0 | – | ||||
Tang, 2022 [20] | 90 | 32/32 (100%) | Lipomas vs. ALTs | Random forest | 0.94 | 85.7 | 100 | 96.0 |
Yang, 2022 [23] | 69 | 58/58 (100%) | Lipomas vs. WDLSs | SVM | 0.95 | 95.0 | 88.9 | 92.1 |
Malinauskaite, 2020 [18] | 24 | 5/14 (35.7%) | Lipomas vs. Liposarcomas | SVM | 0.93 | 88.8 | 100 | 94.7 |
LDA | 0.93 | – | – | 89.5 | ||||
Naïve Bayes | 0.81 | – | – | 79.0 | ||||
Logistic regression | 0.812 | – | – | 73.7 | ||||
Leporq, 2020 [17] | 40 | ?/41 | Lipomas vs. Liposarcomas | SVM | 0.96 | 100 | 90 | 95.0 |
Vos, 2019 [22] | 58 | 58/80 (71.6%) | Lipomas vs. WDLSs | Various* | 0.83 | 68 | 84 | 67.0 |
Thornhill, 2014 [21] | 24 | ?/20 | Lipomas vs. Liposarcomas | LDA | N/A | 85 | 96 | 91.0 |
Current study | ||||||||
Machine learning model | 38 | 30/30 (100%) | IM lipomas vs. ALTs/WDLSs | LASSO LR | 0.88 | 91.6 | 85.7 | 89.0 |
MSK radiologist 1 | 38 | 30/30 (100%) | IM lipomas vs. ALTs/WDLSs | - | 0.94 | 97.4 | 90.9 | 95.0 |
MSK radiologist 2 | 38 | 30/30 (100%) | IM lipomas vs. ALTs/WDLSs | - | 0.91 | 100 | 81.8 | 93.3 |
IM lipomas intramuscular lipomas, ALTs/WDLSs atypical lipomatous tumors/well-differentiated liposarcomas, MSK radiologist musculoskeletal radiologist, LASSO LR Least Absolute Shrinkage and Selection Operator (LASSO) logistic regression, AUC area under the curve, N/A not available
*The model was constructed using various methods including logistic regression, support vector machine (SVM), random forests, naïve Bayes, linear discriminant analysis (LDA), and quadratic discriminant analysis (QDA)
Our work compared the classification performances of the machine learning approaches and the radiologists. When looking at the prediction accuracy, the experienced MSK radiologists demonstrated superior performance to the machine learning model because of their great expertise. However, even though there was a small set of MRIs available for the learning process, the machine learning models required less time than the radiologists to perform their readings after the segmentation process was done. Additionally, when comparing the AUCs between the model and two radiologists in Fig. 5, it can be seen that there was no statistically significant difference. Therefore, the model based on tumor-to-bone distance and radiomic features has the potential to distinguish between IM lipomas and ALTs/WDLSs and its discrimination ability is close to the radiologists.
Regarding feature importance insights in the ultimate model, the 14 features importance indicated that shape-based, histogram-based, and texture-based features were of foremost importance (Figs. 4B and 6).
Shape-based features or size features were found to be related to the prediction of malignancy and were largely integrated into the models. The ALTs/WDLSs were generally larger than IM lipomas. This illustrated that tumor size may help to characterize malignancy, as reported by previous investigations based on conventional imaging characteristics [11, 12, 41, 42]. However, in some specific areas, such as an ALT/WDLS of the hand, the lesion is not large, and a large IM lipoma of the thigh is typically not difficult to find, depending on the location and looseness of the surrounding tissue. Therefore, radiographic size is not always a reliable radiomic feature for diagnosis. Although ovoid/nodular configurations cannot distinguish the grade of malignancy (i.e., high-, intermediate, and low-grade soft tissue tumors) on MRI [43], it may differentiate IM lipomas from ALTs/WDLSs. Based on our results, ALTs/WDLSs had a more compact (sphere-like) shape than IM lipomas, as evidenced by the surface area-to-volume ratio. This radiomic feature may be correlated to ovoid/nodular configurations on MRI [43, 44]. We can observe that this shape difference may also reflect excessive cell proliferation and/or non-lipomatous or nodular fibrous septa, which showed a fluffier consistency in ALTs/WDLSs [45, 46]. Tumor shape was also a key characteristic that was visually evaluated in clinical practice [12, 42]. Further studies with expanded data sets that include larger IM lipomas or smaller ALTs/WDLSs should be performed to identify important radiomic features in addition to the size- or volume-dependent features.
Regarding the histogram features, the higher heterogeneity in the skewness map of T1W values from tumor ROI correlated with a higher grade of malignancy. The ALTs/WDLSs had a greater value of energy feature than the IM lipomas.
Texture features (particularly the features of GLSZM, GLRLM, and GLCM) were largely reported herein as relevant features. This is also consistent with clinical practice based on visual characterization of tumor heterogeneity and the presence of thick and nodular septa as key characteristics [12, 47].
In addition to histogram and texture features, we found that depth played an important role. By measuring the tumor-to-bone distance on 3D reconstructions of T1W MR images from the closest aspect of the tumor to the bone (Fig. 2), we showed that the closer a tumor was to the bone, the higher the probability that it was an ALT/WDLS rather than an IM lipoma. This implies either that IM lipomas are often found in the more superficial muscle groups or that WDLS are larger, as mentioned earlier.
Because of the higher cost of FISH and its longer test turnaround time, molecular testing may be performed only in cases of clinical suspicion. Most ALT/WDLS cases were diagnosed by evidence of mature adipocytes with pleomorphism, nuclear atypia, and hyperchromatic stromal cells on hematoxylin and eosin stain. This method is widely accepted in our country. Using the machine learning approach may not replace molecular testing for MDM2 and CDK4. However, it may help to alert physicians to the need to refer a patient to a sarcoma center at the earliest opportunity. Thus, the use of the machine learning approach may also facilitate the development of plans for surgical treatment and follow-up.
We demonstrated that machine learning approach based on tumor-to-bone distance and radiomic features retrieved from 3D segmentation MRI can distinguish between IM lipomas and ALTs/WDLSs and provide quantitative information for their differential diagnosis. There are 3 main advantages of using the machine learning approach to differentiate between IM lipomas and ALTs/WDLSs before surgery. First, it obviates the need for a patient to undergo a biopsy. As mentioned above, biopsies of lipomatous tumors can produce unfavorable results because the foci of malignant cells that lie in the septum are difficult to reach. Another advantage is that this approach helps surgeons decide whether to wait or to accelerate the timing of surgery. This is because surgery on an IM lipoma can wait, but an ALT/WDLS cannot be left untreated for a long time as it has a 1–5% chance of becoming a dedifferentiated liposarcoma [5, 48, 49]. Furthermore, the use of the machine learning approach can guide the decision by a surgeon to use a minimally invasive incision (for an IM lipoma) or a larger incision with wider margins (for ALT/WDLS). Once an IM lipoma or ALT/WDLS has been resected and its diagnosis has been confirmed by histopathology, a different follow-up protocol is used for each. For an ALT/WDLS, a serial physical examination and/or MRI is performed every 3–4 months during the first 2 postoperative years and once a year for the following 3 years. This protocol is employed because WDLSs have a greater chance of recurrence and dedifferentiation to sarcomas than IM lipomas, but they rarely metastasize [6, 50, 51]. In contrast, IM lipomas require much less frequent follow-up. No additional treatment, including irradiation or chemotherapy, will be administered to either group.
Our study has some limitations. First, most of our IM lipomas were smaller in size. This volume bias was mentioned earlier in this article. Additionally, the sample sizes of 38 IM lipomas and 30 ALTs/WDLSs were relatively small, especially for diagnostic efficiency. We will continually evaluate the preoperative MRI examination of IM lipomas and ALTs/WDLSs to progressively refine our model. The model was based solely on selected retrospective data related to masses that had been surgically removed and for which pathological reports were available. However, small tumors treated with observation or masses that had not been referred to the sarcoma center were excluded from the data sets. Furthermore, we used manual 3D segmentation with semiautomatic contoured images by two observers (statistician and research scientist) which were confirmed by experienced MSK radiologists. This may be subject to intra- and interobserver variability; however, the results showed no significant difference between observers. Finally, we did not compare the different pathological subtypes of lipomas and ALTs/WDLSs (e.g., the sclerosing type). However, the subtype may not alter the surgical procedure.
Conclusions
Quantitative analysis of an artificial intelligence-based system using machine learning approach based on tumor-to-bone distance and radiomic features derived from MRI images can differentiate between IM lipomas and ALTs/WDLSs. The predictive features that suggested malignancy were size, shape, depth, texture, histogram, and tumor-to-bone distance. This comprehensive study of multiple features contributes to the diagnosis of IM lipomas and ALTs/WDLSs. The further development of artificial intelligence-based systems that will be gained through training with a larger number of patients and improving the mathematical methods will improve the accuracy of these systems in diagnosing IM lipomas and ALTs/WDLSs.
Supplementary Information
Acknowledgements
The authors thank Supani Duangkaew for her research coordination and Pakorn Yodprom for physics consultation and image retrieval. The authors also gratefully acknowledge the professional English editing by David Park.
Author contributions
All authors contributed to the study’s conception and design. CC contributed to administrative support. Material preparation, data collection and analysis, and interpretation were performed by all authors. The first draft of the manuscript was written by NS and CC. All authors finally read and approved the manuscript.
Funding
The research was supported by the Siriraj Research Fund (Grant No. [IO] R016332025), Faculty of Medicine Siriraj Hospital, Mahidol University), and the Siriraj Foundation Fund for advanced sarcoma research (D004146).
Declarations
Ethics approval and consent to participate
The study was approved by the Institutional Review Board of the Faculty of Medicine Siriraj Hospital, Mahidol University (COA No. SI 874/2020).
Competing interests
The authors declare that they have no competing interests.
Footnotes
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.McTighe S, Chernev I. Intramuscular lipoma: a review of the literature. Orthop Rev (Pavia) 2014;6(4):5618. doi: 10.4081/or.2014.5618. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Ramos-Pascua LR, Guerra-Álvarez OA, Sánchez-Herráez S, Izquierdo-García FM, Maderuelo-Fernández J. Intramuscular lipomas: Large and deep benign lumps not to underestimated. Review of a series of 51 cases. Rev Esp Cir Ortop Traumatol. 2013;57(6):391–397. doi: 10.1016/j.recot.2013.09.010. [DOI] [PubMed] [Google Scholar]
- 3.Su CH, Hung JK, Chang IL. Surgical treatment of intramuscular, infiltrating lipoma. Int Surg. 2011;96(1):56–59. doi: 10.9738/1396.1. [DOI] [PubMed] [Google Scholar]
- 4.Sbaraglia M, Dei Tos, A P Pedeutour F. Atypical lipomatous tumor/well-differentiated liposarcoma. In: The WHO classification of tumours editorial board, editor. Soft Tissue and Bone Tumours. WHO Classification of Tumours. 3. 5th ed. Lyon, France: IARC Press; 2020. p. 36–8.
- 5.Johnson CN, Ha AS, Chen E, Davidson D. Lipomatous soft-tissue tumors. J Am Acad Orthop Surg. 2018;26(22):779–788. doi: 10.5435/JAAOS-D-17-00045. [DOI] [PubMed] [Google Scholar]
- 6.Lazarides AL, Ferlauto HR, Burke ZDC, Griffin AM, Leckey BD, Bernthal NM, et al. The utility of chest imaging for surveillance of atypical lipomatous tumors. Sarcoma. 2021;2021:4740924. doi: 10.1155/2021/4740924. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Yamamoto N, Hayashi K, Tanzawa Y, Kimura H, Takeuchi A, Igarashi K, et al. Treatment strategies for well-differentiated liposarcomas and therapeutic outcomes. Anticancer Res. 2012;32(5):1821–1825. [PubMed] [Google Scholar]
- 8.Binh MB, Sastre-Garau X, Guillou L, de Pinieux G, Terrier P, Lagacé R, et al. MDM2 and CDK4 immunostainings are useful adjuncts in diagnosing well-differentiated and dedifferentiated liposarcoma subtypes: a comparative analysis of 559 soft tissue neoplasms with genetic data. Am J Surg Pathol. 2005;29(10):1340–1347. doi: 10.1097/01.pas.0000170343.09562.39. [DOI] [PubMed] [Google Scholar]
- 9.Thway K, Flora R, Shah C, Olmos D, Fisher C. Diagnostic utility of p16, CDK4, and MDM2 as an immunohistochemical panel in distinguishing well-differentiated and dedifferentiated liposarcomas from other adipocytic tumors. Am J Surg Pathol. 2012;36(3):462–469. doi: 10.1097/PAS.0b013e3182417330. [DOI] [PubMed] [Google Scholar]
- 10.Thavikulwat AC, Wu JS, Chen X, Anderson ME, Ward A, Kung J. Image-guided core needle biopsy of adipocytic tumors: diagnostic accuracy and concordance with final surgical pathology. AJR Am J Roentgenol. 2021;216(4):997–1002. doi: 10.2214/AJR.20.23080. [DOI] [PubMed] [Google Scholar]
- 11.Nagano S, Yokouchi M, Setoguchi T, Ishidou Y, Sasaki H, Shimada H, et al. Differentiation of lipoma and atypical lipomatous tumor by a scoring system: implication of increased vascularity on pathogenesis of liposarcoma. BMC Musculoskelet Disord. 2015;16(1):36. doi: 10.1186/s12891-015-0491-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Wang S, Chan LW, Tang X, Su C, Zhang C, Sun K, et al. A weighted scoring system to differentiate malignant liposarcomas from benign lipomas. J Orthop Surg (Hong Kong) 2016;24(2):216–221. doi: 10.1177/1602400219. [DOI] [PubMed] [Google Scholar]
- 13.Doyle AJ, Pang AK, Miller MV, French JG. Magnetic resonance imaging of lipoma and atypical lipomatous tumour/well-differentiated liposarcoma: observer performance using T1-weighted and fluid-sensitive MRI. J Med Imaging Radiat Oncol. 2008;52(1):44–48. doi: 10.1111/j.1440-1673.2007.01910.x. [DOI] [PubMed] [Google Scholar]
- 14.O'Donnell PW, Griffin AM, Eward WC, Sternheim A, White LM, Wunder JS, et al. Can experienced observers differentiate between lipoma and well-differentiated liposarcoma using only MRI? Sarcoma. 2013;2013:982784. doi: 10.1155/2013/982784. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Cay N, Mendi BAR, Batur H, Erdogan F. Discrimination of lipoma from atypical lipomatous tumor/well-differentiated liposarcoma using magnetic resonance imaging radiomics combined with machine learning. Jpn J Radiol. 2022;40(9):951–960. doi: 10.1007/s11604-022-01278-x. [DOI] [PubMed] [Google Scholar]
- 16.Fradet G, Ayde R, Bottois H, El Harchaoui M, Khaled W, Drapé JL, et al. Prediction of lipomatous soft tissue malignancy on MRI: comparison between machine learning applied to radiomics and deep learning. Eur Radiol Exp. 2022;6(1):41. doi: 10.1186/s41747-022-00295-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Leporq B, Bouhamama A, Pilleul F, Lame F, Bihane C, Sdika M, et al. MRI-based radiomics to predict lipomatous soft tissue tumors malignancy: a pilot study. Cancer Imaging. 2020;20(1):78. doi: 10.1186/s40644-020-00354-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Malinauskaite I, Hofmeister J, Burgermeister S, Neroladaki A, Hamard M, Montet X, et al. Radiomics and machine learning differentiate soft-tissue lipoma and liposarcoma better than musculoskeletal radiologists. Sarcoma. 2020;2020:7163453. doi: 10.1155/2020/7163453. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Pressney I, Khoo M, Endozo R, Ganeshan B, O'Donnell P. Pilot study to differentiate lipoma from atypical lipomatous tumour/well-differentiated liposarcoma using MR radiomics-based texture analysis. Skelet Radiol. 2020;49(11):1719–1729. doi: 10.1007/s00256-020-03454-4. [DOI] [PubMed] [Google Scholar]
- 20.Tang Y, Cui J, Zhu J, Fan G. Differentiation between lipomas and atypical lipomatous tumors of the extremities using radiomics. J Magn Reson Imaging. 2022;56(6):1746–1754. doi: 10.1002/jmri.28167. [DOI] [PubMed] [Google Scholar]
- 21.Thornhill RE, Golfam M, Sheikh A, Cron GO, White EA, Werier J, et al. Differentiation of lipoma from liposarcoma on MRI using texture and shape analysis. Acad Radiol. 2014;21(9):1185–1194. doi: 10.1016/j.acra.2014.04.005. [DOI] [PubMed] [Google Scholar]
- 22.Vos M, Starmans MPA, Timbergen MJM, van der Voort SR, Padmos GA, Kessels W, et al. Radiomics approach to distinguish between well differentiated liposarcomas and lipomas on MRI. Br J Surg. 2019;106(13):1800–1809. doi: 10.1002/bjs.11410. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Yang Y, Zhou Y, Zhou C, Ma X. Novel computer aided diagnostic models on multimodality medical images to differentiate well differentiated liposarcomas from lipomas approached by deep learning methods. Orphanet J Rare Dis. 2022;17(1):158. doi: 10.1186/s13023-022-02304-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Sudjai N, Siriwanarangsun P, Lektrakul N, Saiviroonporn P, Maungsomboon S, Phimolsarnti R, et al. Robustness of radiomic features: two-dimensional versus three-dimensional MRI-based feature reproducibility in lipomatous soft-tissue tumors. Diagnostics. 2023;13(2):258. doi: 10.3390/diagnostics13020258. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Mayerhoefer ME, Materka A, Langs G, Häggström I, Szczypiński P, Gibbs P, et al. Introduction to radiomics. J Nucl Med. 2020;61(4):488–495. doi: 10.2967/jnumed.118.222893. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Tibshirani R. Regression shrinkage and selection via the Lasso. J R Stat Soc Ser B Methodol. 1996;58(1):267–288. [Google Scholar]
- 27.Dinov ID. Data science and predictive analytics. 1. Cham: Springer; 2018. p. 832. [Google Scholar]
- 28.Landis JR, Koch GG. The measurement of observer agreement for categorical data. Biometrics. 1977;33(1):159–174. doi: 10.2307/2529310. [DOI] [PubMed] [Google Scholar]
- 29.Traverso A, Kazmierski M, Welch ML, Weiss J, Fiset S, Foltz WD, et al. Sensitivity of radiomic features to inter-observer variability and image pre-processing in Apparent Diffusion Coefficient (ADC) maps of cervix cancer patients. Radiother Oncol. 2020;143:88–94. doi: 10.1016/j.radonc.2019.08.008. [DOI] [PubMed] [Google Scholar]
- 30.Koo TK, Li MY. A Guideline of selecting and reporting intraclass correlation coefficients for reliability research. J Chiropr Med. 2016;15(2):155–163. doi: 10.1016/j.jcm.2016.02.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Schmidt B, Dick A, Treutlein M, Schiller P, Bunck AC, Maintz D, et al. Intra- and inter-observer reproducibility of global and regional magnetic resonance feature tracking derived strain parameters of the left and right ventricle. Eur J Radiol. 2017;89:97–105. doi: 10.1016/j.ejrad.2017.01.025. [DOI] [PubMed] [Google Scholar]
- 32.Kocak B, Durmaz ES, Kaya OK, Ates E, Kilickesmez O. Reliability of single-slice-based 2D CT texture analysis of renal masses: influence of intra- and interobserver manual segmentation variability on radiomic feature reproducibility. AJR Am J Roentgenol. 2019;213(2):377–383. doi: 10.2214/AJR.19.21212. [DOI] [PubMed] [Google Scholar]
- 33.Rizzetto F, Calderoni F, De Mattia C, Defeudis A, Giannini V, Mazzetti S, et al. Impact of inter-reader contouring variability on textural radiomics of colorectal liver metastases. Eur Radiol Exp. 2020;4(1):62. doi: 10.1186/s41747-020-00189-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Zwanenburg A, Leger S, Agolli L, Pilz K, Troost EGC, Richter C, et al. Assessing robustness of radiomic features by image perturbation. Sci Rep. 2019;9(1):614. doi: 10.1038/s41598-018-36938-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Cattell R, Chen S, Huang C. Robustness of radiomic features in magnetic resonance imaging: review and a phantom study. Vis Comput Ind Biomed Art. 2019;2(1):19. doi: 10.1186/s42492-019-0025-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.DeLong ER, DeLong DM, Clarke-Pearson DL. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics. 1988;44(3):837–845. doi: 10.2307/2531595. [DOI] [PubMed] [Google Scholar]
- 37.Lu J, Zhou Y, Lv W, Zhu H, Tian T, Yan S, et al. Identification of early invisible acute ischemic stroke in non-contrast computed tomography using two-stage deep-learning model. Theranostics. 2022;12(12):5564–5573. doi: 10.7150/thno.74125. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Mitchell PJ. Experimental design and statistical analysis for pharmacology and the biomedical sciences. 1. Chichester: Wiley-Blackwell; 2022. [Google Scholar]
- 39.Xue Z, Wang L, Sun Q, Xu J, Liu Y, Ai S, et al. Radiomics analysis using MR imaging of subchondral bone for identification of knee osteoarthritis. J Orthop Surg Res. 2022;17(1):414. doi: 10.1186/s13018-022-03314-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Dong X, Li H, Jiang Z, Grünleitner T, Güler İ, Dong J, et al. 3D deep learning enables accurate layer mapping of 2D materials. ACS Nano. 2021;15(2):3139–3151. doi: 10.1021/acsnano.0c09685. [DOI] [PubMed] [Google Scholar]
- 41.Coran A, Ortolan P, Attar S, Alberioli E, Perissinotto E, Tosi AL, et al. Magnetic resonance imaging assessment of lipomatous soft-tissue tumors. In Vivo. 2017;31(3):387–395. doi: 10.21873/invivo.11071. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Jaovisidha S, Suvikapakornkul Y, Woratanarat P, Subhadrabandhu T, Nartthanarung A, Siriwongpairat P. MR imaging of fat-containing tumours: the distinction between lipoma and liposarcoma. Singap Med J. 2010;51(5):418–423. [PubMed] [Google Scholar]
- 43.Sedaghat S, Salehi Ravesh M, Sedaghat M, Both M, Jansen O. Configuration of soft-tissue sarcoma on MRI correlates with grade of malignancy. Radiol Oncol. 2021;55(2):158–163. doi: 10.2478/raon-2021-0007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Sedaghat S, Salehi Ravesh M, Sedaghat M, Meschede J, Jansen O, Both M. Does the primary soft-tissue sarcoma configuration predict configuration of recurrent tumors on magnetic resonance imaging? Acta Radiol. 2022;63(5):642–651. doi: 10.1177/02841851211008381. [DOI] [PubMed] [Google Scholar]
- 45.Burt AM, Huang BK. Imaging review of lipomatous musculoskeletal lesions. Sicot J. 2017;3:34. doi: 10.1051/sicotj/2017015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Drevelegas A, Pilavaki M, Chourmouzi D. Lipomatous tumors of soft tissue: MR appearance with histological correlation. Eur J Radiol. 2004;50(3):257–267. doi: 10.1016/j.ejrad.2004.01.022. [DOI] [PubMed] [Google Scholar]
- 47.Brisson M, Kashima T, Delaney D, Tirabosco R, Clarke A, Cro S, et al. MRI characteristics of lipoma and atypical lipomatous tumor/well-differentiated liposarcoma: retrospective comparison with histology and MDM2 gene amplification. Skelet Radiol. 2013;42(5):635–647. doi: 10.1007/s00256-012-1517-z. [DOI] [PubMed] [Google Scholar]
- 48.Lu J, Wood D, Ingley E, Koks S, Wong D. Update on genomic and molecular landscapes of well-differentiated liposarcoma and dedifferentiated liposarcoma. Mol Biol Rep. 2021;48(4):3637–3647. doi: 10.1007/s11033-021-06362-5. [DOI] [PubMed] [Google Scholar]
- 49.Dei Tos A, Marino-Enriquez A, Pedeutour F. Dedifferentiated liposarcoma. In: The WHO classification of tumours editorial board, editor. Soft Tissue and Bone Tumours. WHO Classification of Tumours. 3. 5th ed ed. Lyon, France: IARC Press; 2020. p. 39–41.
- 50.Kiatisevi P, Asavamongkolkul A, Phimolsarnti R, Waikakul S, Benjarassamerote S. The outcomes and prognostic factors of patients with soft-tissue sarcoma. J Med Assoc Thai. 2006;89(3):334–342. [PubMed] [Google Scholar]
- 51.Presman B, Jauffred SF, Kornø MR, Petersen MM. Low recurrence rate and risk of distant metastases following marginal surgery of intramuscular lipoma and atypical lipomatous tumors of the extremities and trunk wall. Med Princ Pract. 2020;29(3):203–210. doi: 10.1159/000503621. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.