Utility of artificial intelligence in a binary classification of soft tissue tumors

Jing Di; Caylin Hickey; Cody Bumgardner; Mustafa Yousif; Mauricio Zapata; Therese Bocklage; Bonnie Balzer; Marilyn M Bui; Jerad M Gardner; Liron Pantanowitz; Shadi A Qasem

doi:10.1016/j.jpi.2024.100368

. 2024 Feb 15;15:100368. doi: 10.1016/j.jpi.2024.100368

Utility of artificial intelligence in a binary classification of soft tissue tumors

Jing Di ^a, Caylin Hickey ^a, Cody Bumgardner ^a, Mustafa Yousif ^b, Mauricio Zapata ^c, Therese Bocklage ^a, Bonnie Balzer ^d, Marilyn M Bui ^e, Jerad M Gardner ^f, Liron Pantanowitz ^g, Shadi A Qasem ^a,^h,^⁎

PMCID: PMC10940995 PMID: 38496781

Abstract

Soft tissue tumors (STTs) pose diagnostic and therapeutic challenges due to their rarity, complexity, and morphological overlap. Accurate differentiation between benign and malignant STTs is important to set treatment directions, however, this task can be difficult. The integration of machine learning and artificial intelligence (AI) models can potentially be helpful in classifying these tumors. The aim of this study was to investigate AI and machine learning tools in the classification of STT into benign and malignant categories. This study consisted of three components: (1) Evaluation of whole-slide images (WSIs) to classify STT into benign and malignant entities. Five specialized soft tissue pathologists from different medical centers independently reviewed 100 WSIs, representing 100 different cases, with limited clinical information and no additional workup. The results showed an overall concordance rate of 70.4% compared to the reference diagnosis. (2) Identification of cell-specific parameters that can distinguish benign and malignant STT. Using an image analysis software (QuPath) and a cohort of 95 cases, several cell-specific parameters were found to be statistically significant, most notably cell count, nucleus/cell area ratio, nucleus hematoxylin density mean, and cell max caliper. (3) Evaluation of machine learning library (Scikit-learn) in differentiating benign and malignant STTs. A total of 195 STT cases (156 cases in the training group and 39 cases in the validation group) achieved approximately 70% sensitivity and specificity, and an AUC of 0.68. Our limited study suggests that the use of WSI and AI in soft tissue pathology has the potential to enhance diagnostic accuracy and identify parameters that can differentiate between benign and malignant STTs. We envision the integration of AI as a supportive tool to augment the pathologists' diagnostic capabilities.

Keywords: Artificial intelligence, Digital pathology, Sarcoma, Soft tissue tumors, Whole-slide images, Deep learning, Diagnosis

Introduction

Soft tissue tumors (STT) represent a complex diagnostic area within oncology due to their rarity and heterogeneity. The diagnostic process is fraught with challenges, often resulting from the limited availability of specialized expertise and the broad spectrum of tumor subtypes, which can result in delayed or incorrect diagnoses.¹ A study reviewing second opinion diagnosis in STT found a discordance rate of 38%, with 25% of these cases being classified as major diagnostic error impacting patient management. This has also been shown to increase the number and cost of malpractice cases in sarcoma care.²^,³ Emerging technologies such as digital pathology and radiomics show promise in improving the accuracy of cancer diagnosis, characterization, and monitoring.4, 5, 6, 7, 8

The utilization of whole-slide images (WSIs) has made it easier to obtain consultations, even across different institutions.9, 10, 11 Soft tissue pathology, particularly, presents significant challenges for most pathologists, often necessitating expert consultations. Molecular techniques, including next-generation sequencing, have contributed significantly to better classification of certain soft tissue sarcomas.¹² However, getting a timely and accurate initial working diagnosis mainly based on histology will expedite the initiation of appropriate management; hence the need for additional tools to aid in that initial assessment.

Recent advancements in digital pathology and artificial intelligence (AI) have begun to show potential in enhancing diagnostic precision. Studies such as that by Foersch et al¹³ have illustrated significant advancements in the performance of AI-assisted diagnoses in soft tissue sarcoma, emphasizing the progressive nature of this field. This underscores the need for ongoing research to refine AI applications in pathology, ensuring they are robust across various studies and datasets. Distinguishing between benign/reactive processes and malignant ones is a complex task in soft tissue pathology. The large number of entities involved, and the rarity of these tumors make it challenging to apply existing machine learning and AI models to this field.⁴^,⁶^,¹³

The current investigation aims at exploring the application of AI in the classification of STT into benign and malignant entities, as a step towards the incorporation of AI into the clinical workflow. The goals of this study are: (1) identify cell-specific parameters that can aid in the classification of STTs as benign vs malignant, and (2) explore the capabilities of AI, in comparison to expert pathologists, in evaluating benign and malignant STT. By addressing these goals, the study aims to develop AI techniques that aid in accurately diagnosing STT.

Methods

For all the experiments below, the ground truth was the original diagnosis obtained utilizing all glass slide, immunostains and molecular techniques, if necessary (Table 4).

Table 4.

List of cases used for all study arms (Study arms 1, 2, and 3). Cases with * are not considered soft tissue tumors but have either presented in a location and/or showed morphological overlap with soft tissue tumors warranting them being submitted to the soft tissue service for consultation.

Case	Age/Sex	Location	Diagnosis
1	45 F	Esophagus	Ewing/PNET
2	88 F	Chest wall	Elastofibroma
3	3m M	Flank	Kaposiform hemangioendothelioma
5	8m M	Thigh	Juvenile xanthogranuloma
6	63 F	Vulva	Angiomyofibroblastoma
7	53 M	Leg	Cutaneous leiomyosarcoma
8	54 M	Intestinal	Malignant GIST
9	26 M	Rectum	*Balloon cell melanoma
11	31 M	Thumb	Perineurioma
12	49 M	Mediastinal	*Type-A thymoma
14	34 F	Retroperitoneal	Ewing/PNET
15	76 M	Elbow	*Late stage erythema elevatum diutinum
17	42 M	Arm	Nodular fasciitis
20	78 M	Shoulder	Ischemic fasciitis
21	95 F	Intra-abdominal	*Granulosa cell tumor
22	15 F	Subcutaneous	Malignant giant cell tumor of soft parts
26	42 M	Hip	Schwannoma
27	45 F	Arm	MPNST arising in a neurofibroma
28	36 M	Ankle	Clear cell sarcoma
29	2 F	Thigh	Sclerosing rhabdomyosarcoma
31	57 F	Vagina	Benign genital stromal polyp
34	50 F	Thigh	Biphasic synovial sarcoma, grade 2
35	55 M	Shoulder	Pleomorphic rhabdomyosarcoma, high grade
36	66 F	Scapula	HPC/SFT with malignant potential
38	79 F	Groin	*Metastatic melanoma
39	13 M	Leg	DFSP
40	47 M	Retroperitoneal	Schwannoma
42	7 F	Thigh	Pleomorphic sarcoma with giant cells (malignant giant cell tumor of soft parts)
45	8 M	Shoulder	Granular cell tumor
46	62 M	Nose	Well differentiated fibrosarcoma (grade 1)
48	29 M	Preauricular	solitary fibrous tumor/hemangiopericytoma
49	16 M	Tongue	Granular cell tumor
51	19 M	Intra-ventricular	Hemangioma, cavernous/capillary type
52	59 F	Pelvis	*High grade carcinosarcoma
53	49 F	Uterus	Epithelioid leiomyoma
56	24 M	Trunk	Dermatofibrosarcoma protuberans
58	38 F	Foot	Marked Stasis changes
60	42 F	Uterine serosa	Mesenchymal tumor, favor unusual smooth muscle tumor of uncertain malignant potential
63	78 M	Arm	Kaposi sarcoma
68	66 M	Abdomen	*Diffuse follicle center lymphoma
69	8 F	Small bowel	Reactive changes
70	39 F	Forearm	Sarcoma with myofibroblastic features
74	55 F	Axillary	*Malignant S100 positive tumor, favor melanoma
77	23 F	Retroperitoneal	Angiolymphoid hyperplasia (with eosinophilia)
78	59 M	Back	*Adnexal neoplasm of at least low grade malignancy
79	52 M	Arm	Myxoid variant of hemangiopericytoma/SFT
80	66 M	Parotid	Pleomorphic sarcoma with myoid differentiation
81	42 F	Arm	Epithelioid sarcoma
82	25 M	Periumbilical	Myoepithelial tumor of soft tissue, histologically benign
84	69 F	Omentum	*Low grade endometrial stromal sarcoma
86	42 M	Mesenteric	Follicular dendritic cell tumor
87	52 F	Breast	Histiocytic tumor of uncertain malignant potential
90	1 F	Abdominal wall	Juvenile xanthogranuloma
92	6 F	Finger	Cellular juvenile aponeurotic fibroma
93	84 M	Temple	*Ulcerating carcinoma
94	20 F	Chest wall	Monophasic synovial sarcoma, high grade
95	5 F	R foot	Fibrous histiocytoma
97	41 F	R tongue	Myofibroma
99	67 M	Toe	Low grade pleomorphic sarcoma
103	8 M	Hand	*Cellular blue nevus
104	30 M	Shoulder	*Soft tissue chordoma with bone erosion
105	42 F	Lower abdomen	*Malignant tumor c/w myoepithelial carcinoma, high grade
107	1 M	R chest	Infantile fibromatosis
108	35 M	Spine (T1)	Metastatic malignant peripheral nerve sheath tumor (MPNST), epithelioid type
109	43 M	Flank	Angiosarcoma, high grade
120	41 F	Breast	Leiomyoma
122	39 M	Stomach	Malignant gastrointestinal stromal tumor (GIST)
124	41 M	Rectum	Gastrointestinal stromal tumor
127	47 M	Subcutaneous	Low grade fibromyxoid sarcoma
129	12 M	Toe	Benign mesenchymal tumor with features of angiomatosis and myofibromatosis
130	67 F	Vulva	Benign genital stromal tumor
135	43 F	Wrist	Pleomorphic undifferentiated sarcoma, high grade
139	42 F	Arm	Extranodal Rosai Dorfman disease
141	75 M	Sinonasal	Hemangiopericytoma-like tumor of the nasal passages
142	45 M	Kidney	Malignant glomus tumor
143	16 M	Knee	Necrobiotic granuloma
145	51 M	Neck	Benign mesenchymal tumor, favor spindle cell lipoma
148	85 M	Scalp	*Poorly differentiated carcinoma, probably metastatic
151	45 M	Retroperitoneal	Monophasic synovial sarcoma, high grade
153	47 M	Upper leg	Dermatofibrosarcoma protuberans
157	51 F	Mediastinal	Myxoid/round cell liposarcoma, high grade
158	64 F	Toe	Ewing sarcoma
160	30 F	Paraspinal	Sclerosing epithelioid fibrosarcoma
164	14 M	L4 vertebra	Langerhans cell histiocytosis
165	1 M	Hip	Calcifying aponeurotic fibroma
167	31 M	Scalp	Alveolar soft part sarcoma. Rule out metastasis
168	5 M	Tongue	Reactive myofibroblastic proliferation
169	58 F	Sternocleidomastoid muscle	Soft tissue myoepithelioma, histologically benign
171	26 M	Omentum	Benign fibroblastic proliferation, favor reactive
174	71 F	Neck	Malignant hemangiopericytoma/solitary fibrous tumor, high grade
176	92 M	Neck	*Desmoplastic melanoma
177	33 F	Calf	Neuroblastoma-like schwannoma (schwannoma with collagen rosettes)
178	66 M	Patella	Glomus tumor
179	32 F	Knee	Angiomatoid fibrous histiocytoma
180	39 M	Shoulder	Kaposi sarcoma
184	59 F	Orbit	Hemangiopericytoma
186	18 M	Scrotum	Embryonal rhabdomyosarcoma
188	62 F	Leg	Fibrous histiocytoma with atypical (monster) cells
191	34 F	Calf	*Paraganglioma-like dermal melanocytic tumor
193	47 F	Uterine serosa	Leiomyoma
195	3 F	labia majora	Lipoblastoma
196	19 F	Brachial plexus	Epithelioid nerve sheath tumor, probably of low grade malignancy
198	60 F	Mesenteric/small bowel	Bacillary angiomatosis
199	44 F	Calf	Extraskeletal myxoid chondrosarcoma
200	50 F	Anus	*Malignant melanoma

Open in a new tab

Expert pathologist review (study arm 1)

Five soft tissue pathologists from five different medical centers independently reviewed 100 WSI of hematoxylin and eosin (H&E)-stained slides representing 100 different soft tissue cases, following the institutional research protocol approved by the Institutional Review Board. Only one slide per case was provided with limited clinical information (patient age, gender, and anatomic location). Immunohistochemical and molecular information was not provided to the reviewing pathologists. Deidentified slides were scanned at 20× magnification using an Aperio scanner (Aperio AT2, Leica Biosystems, Illinois), and acquired digital files were converted from .svs to a DICOM format. They were uploaded to a locally hosted, externally accessible compute node for review using a web-based WSI viewer system (Orthanc v1.3.2 WSI Plugin v0.5). Pathologists were asked to choose one of four diagnostic categories (benign, intermediate/borderline, malignant, and uncertain) and to provide up to three differential diagnoses. A REDCap survey (https://www.project-redcap.org/) was used to record the answers.

Answers were compared to the original “ground-truth” diagnoses. Major and minor discordances, as well as uncertainty were defined as follows. Major discordance referred to discrepancies that could change patient management or prognosis, such as mistaking a benign tumor for a malignant one, or vice versa. Minor discordance involved differences that would not affect overall treatment, such as subclassification within the same category of benign or malignant tumors. Uncertainty was used for cases where a definitive diagnosis could not be reached due to insufficient information or ambiguous histological features.

Identifying cell specific parameters (study arm 2)

A cohort of 95 STT cases was utilized from the “expert pathologist review” experiment described above, including 60 benign and 35 malignant cases, encompassing 68 distinct STT entities. The cases were scanned at 40× magnification using a high-throughput scanner (Aperio AT2) and uploaded to an OMERO server¹⁴ for annotation. The regions of interest (ROIs) were marked on all slides by the pathologist (Fig. 1). At the highest layer of resolution, the slide images were divided into 768-pixel tiles (Fig. 2). Each tile was evaluated for tissue percentage and color factors using pre-processing software developed as part of the Deep HistoPath project (https://github.com/CODAIT/deep-histopath). The metadata associated with tile and cell metrics was used to select the top 500 tiles per case within ROI. Cell detection was performed on each tile with at least 25% tissue using QuPath,¹⁵ providing 38 cell-specific parameters, which were averaged per tile. The cell-based metrics were averaged both across the top tiles per case and at the case-level. Welch’s t test was used, with p < 0.05 considered significant.

Fig. 1 — Representative whole-slide image uploaded to OMERO for annotation of ROI.

Fig. 2 — Representative case divided into 768-pixel tiles at the highest layer of resolution. 38 cell-specific parameters were detected using QuPath, which were averaged per tile.

Employing an AI model to differentiate benign vs malignant soft tissue tumors (study arm 3)

A total of 195 soft tissue cases were collected from the files of one of the authors (SQ), encompassing the 95 cases from the “cell-specific parameters” experiment described above. These cases were divided into a training group (156) and validation group (39). A free software machine learning library for Python programming language (Scikit-learn) was employed to analyze these cases into benign and malignant STT (https://scikit-learn.org/stable/). Scikit-learn, an open-source machine learning library for Python programming language, offers a wide range of classification, regression, and clustering algorithms, including support-vector machines, random forests, and gradient boosting.

Results

Pathologist diagnostic concordance rate using WSI

The concordance rate of the pathologists when assessed against the reference diagnosis, made using traditional microscope, was 70.4% across the four diagnostic categories. In detail, minor discordances were observed in 11.6% of the cases, where the pathologists’ diagnoses were close but not exactly the same as the reference. Major discordances, wherein the diagnoses substantially differed, occurred in 5% of the cases. Additionally, there was a 13% rate of uncertainty where the pathologists could not reach a definitive diagnosis (Table 1). Analyzing the cases further, malignant tumors were most accurately diagnosed with an 81.7% concordance rate, whereas benign cases exhibited the highest rate of major discordance at 7.7%. Intermediate cases showed the highest rates of minor discordance (28%) and uncertainty (22.4%) (Fig. 3). Overall, a correct differential diagnosis was established in 63% of the cases. Additionally, the pathologists reported either excellent or satisfactory quality of the scanned images for 96.4% of the cases.

Table 1.

Summary of study findings (study arm 1). “Concordance” indicates agreement with the reference diagnosis with regards to the diagnostic categories (benign, intermediate/borderline, and malignant). Correct diagnosis is recorded when one of the differentials provided by the pathologist matches the reference diagnosis (e.g., myxoid liposarcoma). Please note that this arm of the study was performed using one slide only, no ancillary testing, and limited clinical data.

Pathologists (P)	Overall concordance rate (%)	Major discordance (%)	Minor discordance (%)	Uncertain diagnosis (%)	Correct DX (%)
P1	70	5	10	15	66
P2	71	3	8	18	70
P3	70	4	9	17	63
P4	76	9	15	0	37
P5	65	4	16	15	79
Overall	70.4	5	11.6	13	63

Open in a new tab

Fig. 3 — Expert pathologists’ review (study arm 1). This was performed using one slide only, no ancillary testing, and limited clinical data. Malignant cases had the highest concordance rate (82%), benign cases had the highest major discordance rate (7%), and intermediate cases had the highest minor discordance (28%) and uncertainty rates (22%).

Cell-specific parameters and their significance in benign vs malignant STT

Within the scope of 95 cases, encompassing 52 females and 43 males aged from 9 months to 90 years (with an average age of 41 and a median of 42), image analysis highlighted several cell-specific parameters that showed statistically significant differences (p < 0.05). These parameters included cell count, nucleus/cell area ratio, nucleus hematoxylin OD* mean, cell max caliper, cell area, cell perimeter, cell circularity, and cell min caliper (Table 2). The Welch’s t-test confirmed significant distinctions in the mean values of these parameters between benign and malignant groups, suggesting their potential utility in creating machine learning models for aiding with a soft tissue diagnosis.

Table 2.

Cell-specific parameters of soft tissue tumors and their p values (Study arm 2).

	Mean		Difference between means ± SEM	95% confidence interval	p value
	0	1
Cases (n)	60	35
Nucleus/Cell area ratio	0.218	0.2731	0.05504 ± 0.008515	0.03804 to 0.07204	<0.0001
Cell count	171.9	278.9	107.0 ± 18.51	70.08 to 143.9	<0.0001
Nucleus area	28.41	29.7	1.287 ± 0.7150	−0.1327 to 2.708	0.075
Nucleus perimeter	23.16	23.21	0.04822 ± 0.3044	−0.5563 to 0.6527	0.8745
Nucleus: Circularity	0.665	0.6748	0.009753 ± 0.008050	−0.006236 to 0.02574	0.2288
Nucleus: Max caliper	8.984	8.748	−0.2359 ± 0.1266	−0.4874 to 0.01556	0.0656
Nucleus: Hematoxylin OD mean	0.3127	0.3741	0.06141 ± 0.02067	0.02032 to 0.1025	0.0039
Cell: Max caliper	16.33	14.95	−1.381 ± 0.2853	−1.949 to −0.8130	<0.0001
Nucleus: Eccentricity	0.809	0.7885	−0.02050 ± 0.005593	−0.03161 to −0.009397	0.0004
Nucleus: Min caliper	4.579	4.854	0.2756 ± 0.06869	0.1391 to 0.4120	0.0001
Nucleus: Hematoxylin OD sum	36.71	47.15	10.44 ± 2.362	5.738 to 15.15	<0.0001
Nucleus: Hematoxylin OD std dev	0.1034	0.1144	0.01096 ± 0.004120	0.002775 to 0.01914	0.0092
Nucleus: Hematoxylin OD max	0.5784	0.6721	0.09370 ± 0.02779	0.03846 to 0.1489	0.0011
Nucleus: Hematoxylin OD min	0.1047	0.1376	0.03292 ± 0.01320	0.006608 to 0.05923	0.0149
Nucleus: Hematoxylin OD range	0.4736	0.5344	0.06078 ± 0.01833	0.02436 to 0.09719	0.0013
Nucleus: Eosin OD mean	0.2425	0.2352	−0.007304 ± 0.01187	−0.03094 to 0.01633	0.5401
Nucleus: Eosin OD sum	27.54	28.55	1.012 ± 1.403	−1.803 to 3.827	0.4738
Nucleus: Eosin OD std dev	0.06607	0.0649	−0.001174 ± 0.003765	−0.008651 to 0.006303	0.7559
Nucleus: Eosin OD max	0.3925	0.3862	−0.006242 ± 0.01772	−0.04147 to 0.02899	0.7255
Nucleus: Eosin OD min	0.08946	0.08085	−0.008608 ± 0.006045	−0.02072 to 0.003505	0.1601
Nucleus: Eosin OD range	0.303	0.3054	0.002360 ± 0.01477	−0.02697 to 0.03169	0.8734
Cell: Area	135.1	110.7	−24.42 ± 5.009	−34.38 to −14.46	<0.0001
Cell: Perimeter	44.33	40.5	−3.831 ± 0.7991	−5.422 to −2.240	<0.0001
Cell: Circularity	0.8166	0.7968	−0.01988 ± 0.004069	−0.02796 to −0.01180	<0.0001
Cell: Min caliper	10.88	9.898	−0.9780 ± 0.2126	−1.401 to −0.5548	<0.0001
Cell: Eccentricity	0.6993	0.7058	0.006489 ± 0.003267	1.171e-006 to 0.01298	0.05
Cell: Eosin OD mean	0.179	0.1718	−0.007139 ± 0.009776	−0.02658 to 0.01230	0.4672
Cell: Eosin std dev	0.08495	0.08461	−0.0003421 ± 0.005141	−0.01056 to 0.009875	0.9471
Cell: Eosin OD max	0.4168	0.4017	−0.01506 ± 0.01941	−0.05364 to 0.02351	0.4397
Cell: Eosin OD min	0.01473	0.007824	−0.006906 ± 0.002861	−0.01259 to −0.001224	0.0178
Cytoplasm: Hematoxylin OD mean	0.08647	0.122	0.03549 ± 0.009522	0.01644 to 0.05454	0.0004
Cytoplasm: Hematoxylin OD std dev	0.06128	0.07984	0.01855 ± 0.004686	0.009220 to 0.02789	0.0002
Cytoplasm: Hematoxylin OD max	0.3381	0.4205	0.08239 ± 0.02081	0.04094 to 0.1238	0.0002
Cytoplasm: Hematoxylin OD min	−0.01809	−0.00513	0.01296 ± 0.004507	0.003989 to 0.02194	0.0052
Cytoplasm: Eosin OD mean	0.1613	0.1477	−0.01362 ± 0.009514	−0.03254 to 0.005288	0.1558
Cytoplasm: Eosin OD std dev	0.07641	0.074	−0.002417 ± 0.004904	−0.01216 to 0.007325	0.6233
Cytoplasm: Eosin OD max	0.3729	0.35	−0.02283 ± 0.01892	−0.06043 to 0.01477	0.2309
Cytoplasm: Eosin OD min	0.01692	0.009997	−0.006924 ± 0.002960	−0.01280 to −0.001046	0.0215

Open in a new tab

The bolded entities represent statistically significant values.

AI model performance relative to expert pathologists

The machine learning models that achieved the best performance in distinguishing benign from malignant STTs included gradient boosting, neural network, xgboost, random forest, bagging, histgradientboosting, sgdclassifier, and logistic regression. For instance, logistic regression exhibited a sensitivity of 0.737 and a specificity of 0.8, whereas random forest showed a sensitivity of 0.864 and a specificity of 0.75. The average sensitivity among these models was 0.60, specificity was 0.75, and overall accuracy was 0.68. The area under the receiver operating characteristic curve (AUC) for these models was also 0.68 (Table 3). These results indicate that the performance of the AI models is on par with that of the expert pathologists, suggesting a promising role for AI in supporting diagnostic processes in soft tissue pathology.

Table 3.

Representative AI models metrics, classifying benign and malignant soft tissue tumors (Study arm 3).

Metric	AUC	Accuracy	Sensitivity	Specificity
Gradientboosting	0.664	0.667	0.579	0.75
Neuralnetwork	0.638	0.641	0.526	0.75
Xgboost	0.639	0.641	0.579	0.7
Randomforest	0.717	0.718	0.684	0.75
Bagging	0.666	0.667	0.632	0.7
Tabpfn	0.743	0.744	0.737	0.75
Histgradientboosting	0.664	0.667	0.579	0.75
Sgdclassifier	0.584	0.59	0.368	0.8
Logisticregression	0.768	0.769	0.737	0.8

Open in a new tab

Discussion

Accurate diagnosis of STT and their subtypes is crucial in determining effective personalized oncology treatment plans for the best patient outcomes. There is scarcity of studies focusing on diagnostic discrepancies, especially in cases that have received second opinion in soft tissue pathology.¹^,¹⁶ One notable study examining the diagnosis of sarcoma through histopathology review revealed a substantial 24% discordance in diagnoses between community pathologists and an expert sarcoma reference pathology group. Sixty-six percent of these discordant cases had clinically significant implications for treatment recommendations.¹⁶ Interestingly, for all major discordant cases, excluding non-mesenchymal lesions, the diagnosis could have been made through conventional H&E-stained slides. The primary reason for diagnostic errors was the limited experience of non-specialized surgical pathologists with uncommon and atypical neoplasms.

Whole-slide imaging has emerged as a powerful tool for enhancing the care of cancer patients, accelerated by the COVID-19 pandemic and the CMS approval of remote sign out, thereby increasing the adoption of the technology.¹⁷ This technology enables timely assessment of tumor tissue by experts in STT, it fosters collaboration among specialists in sarcoma management, and it provides pathologists in underserved regions, without access to sarcoma centers, the opportunity to consult with a sarcoma specialist.¹⁸ Previous studies have also highlighted the utility of WSI technology in diagnosing STT. For instance, Sargen et al reported a notable diagnostic accuracy of 89% using WSI for STT by two experienced soft tissue pathologists.¹⁹ In another study, nine pathologists, with different levels of expertise, assessed 291 STT using WSI, and demonstrated a substantial increase in accuracy from 46.3% (±15.5%) to 87.1% (±11.1%) with the assistance of deep machine learning.¹³ These findings emphasize the pivotal role of specialized pathology assessment in sarcoma diagnosis, and the need to leverage WSI for this purpose.

The expert review experiment in this project aimed to establish a baseline of what is achievable with limited clinical information and lack of ancillary studies, for comparison with an AI-assisted scenario using H&E-stained slides only. It is important to recognize that a comprehensive diagnostic evaluation necessitates the review of all case slides, along with access to clinical information, imaging data, and often additional tests like immunohistochemistry and molecular studies.²⁰ In particular, intermediate (borderline) lesions continue to pose challenges in classification, underscoring the need for supplementary tools in the diagnostic process.

This study identified several cell-specific parameters as being statistically significant (p<0.05) in distinguishing between benign and malignant STT. Recent research has also shown the potential of nuclear morphology as a deep learning biomarker for cellular senescence, which can be applied to cancer.²¹ In another study, deep learning algorithms significantly enhanced pathologists' accuracy in diagnosing leiomyosarcomas and predicting outcomes, increasing accuracy from 46.3% to 87.1%.⁸ These findings suggest that cell-specific parameters hold promise in aiding pathologists in distinguishing benign from malignant STT and hence improving diagnostic accuracy.

In this study, Scikit-learn²² AI software, was utilized, employing gradient boosting, neural networks, xgboost, and logistic regression algorithms. The successful classification of benign and malignant STT, in our study, underscores the potential of an AI-based approach to enhance diagnostic accuracy in soft tissue pathology. A major limitation of this study is the relatively small number of cases analyzed and the inclusion of a large number of STT entities. Unfortunately, this is an expected challenge in studies involving STT, as the majority of these tumors are rare, and the differential diagnosis can be quite broad. In addition, the boundaries between benign, reactive, intermediate-, and low-grade malignancy can be blurry. Another limitation is the lack of validation using an independent dataset.

Despite the aforementioned success yielded in our pilot study, it is crucial to acknowledge that the role of AI in STT diagnosis and treatment is still premature, and that further research is imperative to validate its practical applicability in the clinical setting. Nevertheless, we believe that expanding the dataset with more cases has the potential to significantly improve the performance of the AI model. Finally, the implementation of AI in pathology mandates careful consideration of various technical and ethical aspects, such as patient data privacy, data interpretability, model transparency, and potential biases.

Conclusion

This study demonstrates the potential of AI techniques to enhance the diagnosis and classification of STTs, with promising results in distinguishing benign from malignant cases and highlighting the relevance of cell-specific parameters. A comprehensive diagnostic evaluation, encompassing all slides and access to immunohistochemical and molecular studies, remains indispensable for accurate soft tissue diagnosis. The use of large heterogeneous, well-curated and annotated/labelled datasets will be essential to bolster AI model training and accuracy in the field of soft tissue pathology. Nevertheless, the integration of AI into the diagnostic process offers substantial promise, poised to elevate our capacity to differentiate between these tumors, ultimately leading to heightened diagnostic precision and improved patient outcomes.

We would like to emphasize that this is a proof-of-concept small study, and larger studies are needed to validate and expand on the findings. The current study provides a foundation upon which future studies can be built. Future research should focus on expanding the dataset and refining AI algorithms to potentially improve diagnostic sensitivity and specificity. Additionally, ongoing efforts must be made to address technical and ethical considerations, such as ensuring data privacy and the accuracy of AI-generated results.

Declaration of competing interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgement

The authors would like to acknowledg the generous support of the Sanders-Brown Center at the University of Kentucky through their staff and equipment for whole slide imaging.

References

1.Arbiser Z.K., Folpe A.L., Weiss S.W. Consultative (expert) second opinions in soft tissue pathology. Analysis of problem-prone diagnostic situations. Am J Clin Pathol. 2001;116(4):473–476. doi: 10.1309/425H-NW4W-XC9A-005H. [DOI] [PubMed] [Google Scholar]
2.Rupani A., Hallin M., Jones R.L., Fisher C., Thway K. Diagnostic differences in expert second-opinion consultation cases at a tertiary sarcoma center. Sarcoma. 2020;2020:9810170. doi: 10.1155/2020/9810170. [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Mesko N.W., Mesko J.L., Gaffney L.M., Halpern J.L., Schwartz H.S., Holt G.E. Medical malpractice and sarcoma care--a thirty-three year review of case resolutions, inciting factors, and at risk physician specialties surrounding a rare diagnosis. J Surg Oncol. 2014;110(8):919–929. doi: 10.1002/jso.23770. [DOI] [PubMed] [Google Scholar]
4.Ailia M.J., Thakur N., Abdul-Ghafar J., Jung C.K., Yim K., Chong Y. Current trend of artificial intelligence patents in digital pathology: a systematic evaluation of the patent landscape. Cancers (Basel) 2022;14(10) doi: 10.3390/cancers14102400. [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Borowsky A.D., Glassy E.F., Wallace W.D., et al. Digital whole slide imaging compared with light microscopy for primary diagnosis in surgical pathology. Arch Pathol Lab Med. 2020;144(10):1245–1253. doi: 10.5858/arpa.2019-0569-OA. [DOI] [PubMed] [Google Scholar]
6.Crombé A., Roulleau-Dugage M., Italiano A. The diagnosis, classification, and treatment of sarcoma in this era of artificial intelligence and immunotherapy. Cancer Commun (Lond) 2022;42(12):1288–1313. doi: 10.1002/cac2.12373. [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Hanahan D., Weinberg R.A. Hallmarks of cancer: the next generation. Cell. 2011;144(5):646–674. doi: 10.1016/j.cell.2011.02.013. [DOI] [PubMed] [Google Scholar]
8.Kantidakis G., Litière S., Neven A., et al. New benchmarks to design clinical trials with advanced or metastatic liposarcoma or synovial sarcoma patients: an EORTC - Soft Tissue and Bone Sarcoma Group (STBSG) meta-analysis based on a literature review for soft-tissue sarcomas. Eur J Cancer. 2022;174:261–276. doi: 10.1016/j.ejca.2022.07.010. [DOI] [PubMed] [Google Scholar]
9.Kusta O., Rift C.V., Risør T., Santoni-Rugiu E., Brodersen J.B. Lost in digitization - a systematic review about the diagnostic test accuracy of digital pathology solutions. J Pathol Inform. 2022;13 doi: 10.1016/j.jpi.2022.100136. [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Mukhopadhyay S., Feldman M.D., Abels E., et al. Whole slide imaging versus microscopy for primary diagnosis in surgical pathology: a multicenter blinded randomized noninferiority study of 1992 cases (pivotal study) Am J Surg Pathol. 2018;42(1):39–52. doi: 10.1097/PAS.0000000000000948. [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Saini T., Bansal B., Dey P. Digital cytology: current status and future prospects. Diagn Cytopathol. 2023;51(3):211–218. doi: 10.1002/dc.25099. [DOI] [PubMed] [Google Scholar]
12.Rottmann D., Abdulfatah E., Pantanowitz L. Molecular testing of soft tissue tumors. Diagn Cytopathol. 2023;51(1):12–25. doi: 10.1002/dc.25013. [DOI] [PMC free article] [PubMed] [Google Scholar]
13.Foersch S., Eckstein M., Wagner D.C., et al. Deep learning for diagnosis and survival prediction in soft tissue sarcoma. Ann Oncol. 2021;32(9):1178–1187. doi: 10.1016/j.annonc.2021.06.007. [DOI] [PubMed] [Google Scholar]
14.Allan C., Burel J.M., Moore J., et al. OMERO: flexible, model-driven data management for experimental biology. Nat Methods. 2012;9(3):245–253. doi: 10.1038/nmeth.1896. [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Bankhead P., Loughrey M.B., Fernández J.A., et al. QuPath: open source software for digital pathology image analysis. Sci Rep. 2017;7(1):16878. doi: 10.1038/s41598-017-17204-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Al-Ibraheemi A., Folpe A.L. Voluntary second opinions in pediatric bone and soft tissue pathology: a retrospective review of 1601 cases from a single mesenchymal tumor consultation service. Int J Surg Pathol. 2016;24(8):685–691. doi: 10.1177/1066896916657591. [DOI] [PubMed] [Google Scholar]
17.Clinical Laboratory Improvement Amendments of 1988 (CLIA) Post-Public Health Emergency (PHE) Guidance CMS, Ed. 2021. [Google Scholar]
18.Rajasekaran R.B., Whitwell D., Cosker T.D.A., Gibbons C.L.M.H., Carr A. Will virtual multidisciplinary team meetings become the norm for musculoskeletal oncology care following the COVID-19 pandemic? - experience from a tertiary sarcoma centre. BMC Musculoskelet Disord. 2021;22(1):18. doi: 10.1186/s12891-020-03925-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
19.Sargen M.R., Luk K.M., Stoff B.K., et al. Diagnostic accuracy of whole slide imaging for cutaneous, soft tissue, and melanoma sentinel lymph node biopsies with and without immunohistochemistry. J Cutan Pathol. 2018;45(8):597–602. doi: 10.1111/cup.13268. [DOI] [PubMed] [Google Scholar]
20.Tsagkaris C., Trygonis N., Spyrou V., Koulouris A. Telemedicine in care of sarcoma patients beyond the COVID-19 pandemic: challenges and opportunities. Cancers (Basel) 2023;15(14) doi: 10.3390/cancers15143700. [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Heckenbach I., Mkrtchyan G.V., Ezra M.B., et al. Nuclear morphology is a deep learning biomarker of cellular senescence. Nat Aging. 2022;2(8):742–755. doi: 10.1038/s43587-022-00263-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Abraham A., Pedregosa F., Eickenberg M., et al. Machine learning for neuroimaging with scikit-learn. Front Neuroinform. 2014;8:14. doi: 10.3389/fninf.2014.00014. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0005] 1.Arbiser Z.K., Folpe A.L., Weiss S.W. Consultative (expert) second opinions in soft tissue pathology. Analysis of problem-prone diagnostic situations. Am J Clin Pathol. 2001;116(4):473–476. doi: 10.1309/425H-NW4W-XC9A-005H. [DOI] [PubMed] [Google Scholar]

[bb0010] 2.Rupani A., Hallin M., Jones R.L., Fisher C., Thway K. Diagnostic differences in expert second-opinion consultation cases at a tertiary sarcoma center. Sarcoma. 2020;2020:9810170. doi: 10.1155/2020/9810170. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0015] 3.Mesko N.W., Mesko J.L., Gaffney L.M., Halpern J.L., Schwartz H.S., Holt G.E. Medical malpractice and sarcoma care--a thirty-three year review of case resolutions, inciting factors, and at risk physician specialties surrounding a rare diagnosis. J Surg Oncol. 2014;110(8):919–929. doi: 10.1002/jso.23770. [DOI] [PubMed] [Google Scholar]

[bb0020] 4.Ailia M.J., Thakur N., Abdul-Ghafar J., Jung C.K., Yim K., Chong Y. Current trend of artificial intelligence patents in digital pathology: a systematic evaluation of the patent landscape. Cancers (Basel) 2022;14(10) doi: 10.3390/cancers14102400. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0025] 5.Borowsky A.D., Glassy E.F., Wallace W.D., et al. Digital whole slide imaging compared with light microscopy for primary diagnosis in surgical pathology. Arch Pathol Lab Med. 2020;144(10):1245–1253. doi: 10.5858/arpa.2019-0569-OA. [DOI] [PubMed] [Google Scholar]

[bb0030] 6.Crombé A., Roulleau-Dugage M., Italiano A. The diagnosis, classification, and treatment of sarcoma in this era of artificial intelligence and immunotherapy. Cancer Commun (Lond) 2022;42(12):1288–1313. doi: 10.1002/cac2.12373. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0035] 7.Hanahan D., Weinberg R.A. Hallmarks of cancer: the next generation. Cell. 2011;144(5):646–674. doi: 10.1016/j.cell.2011.02.013. [DOI] [PubMed] [Google Scholar]

[bb0040] 8.Kantidakis G., Litière S., Neven A., et al. New benchmarks to design clinical trials with advanced or metastatic liposarcoma or synovial sarcoma patients: an EORTC - Soft Tissue and Bone Sarcoma Group (STBSG) meta-analysis based on a literature review for soft-tissue sarcomas. Eur J Cancer. 2022;174:261–276. doi: 10.1016/j.ejca.2022.07.010. [DOI] [PubMed] [Google Scholar]

[bb0045] 9.Kusta O., Rift C.V., Risør T., Santoni-Rugiu E., Brodersen J.B. Lost in digitization - a systematic review about the diagnostic test accuracy of digital pathology solutions. J Pathol Inform. 2022;13 doi: 10.1016/j.jpi.2022.100136. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0050] 10.Mukhopadhyay S., Feldman M.D., Abels E., et al. Whole slide imaging versus microscopy for primary diagnosis in surgical pathology: a multicenter blinded randomized noninferiority study of 1992 cases (pivotal study) Am J Surg Pathol. 2018;42(1):39–52. doi: 10.1097/PAS.0000000000000948. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0055] 11.Saini T., Bansal B., Dey P. Digital cytology: current status and future prospects. Diagn Cytopathol. 2023;51(3):211–218. doi: 10.1002/dc.25099. [DOI] [PubMed] [Google Scholar]

[bb0060] 12.Rottmann D., Abdulfatah E., Pantanowitz L. Molecular testing of soft tissue tumors. Diagn Cytopathol. 2023;51(1):12–25. doi: 10.1002/dc.25013. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0065] 13.Foersch S., Eckstein M., Wagner D.C., et al. Deep learning for diagnosis and survival prediction in soft tissue sarcoma. Ann Oncol. 2021;32(9):1178–1187. doi: 10.1016/j.annonc.2021.06.007. [DOI] [PubMed] [Google Scholar]

[bb0070] 14.Allan C., Burel J.M., Moore J., et al. OMERO: flexible, model-driven data management for experimental biology. Nat Methods. 2012;9(3):245–253. doi: 10.1038/nmeth.1896. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0075] 15.Bankhead P., Loughrey M.B., Fernández J.A., et al. QuPath: open source software for digital pathology image analysis. Sci Rep. 2017;7(1):16878. doi: 10.1038/s41598-017-17204-5. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0080] 16.Al-Ibraheemi A., Folpe A.L. Voluntary second opinions in pediatric bone and soft tissue pathology: a retrospective review of 1601 cases from a single mesenchymal tumor consultation service. Int J Surg Pathol. 2016;24(8):685–691. doi: 10.1177/1066896916657591. [DOI] [PubMed] [Google Scholar]

[bb0085] 17.Clinical Laboratory Improvement Amendments of 1988 (CLIA) Post-Public Health Emergency (PHE) Guidance CMS, Ed. 2021. [Google Scholar]

[bb0090] 18.Rajasekaran R.B., Whitwell D., Cosker T.D.A., Gibbons C.L.M.H., Carr A. Will virtual multidisciplinary team meetings become the norm for musculoskeletal oncology care following the COVID-19 pandemic? - experience from a tertiary sarcoma centre. BMC Musculoskelet Disord. 2021;22(1):18. doi: 10.1186/s12891-020-03925-8. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0095] 19.Sargen M.R., Luk K.M., Stoff B.K., et al. Diagnostic accuracy of whole slide imaging for cutaneous, soft tissue, and melanoma sentinel lymph node biopsies with and without immunohistochemistry. J Cutan Pathol. 2018;45(8):597–602. doi: 10.1111/cup.13268. [DOI] [PubMed] [Google Scholar]

[bb0100] 20.Tsagkaris C., Trygonis N., Spyrou V., Koulouris A. Telemedicine in care of sarcoma patients beyond the COVID-19 pandemic: challenges and opportunities. Cancers (Basel) 2023;15(14) doi: 10.3390/cancers15143700. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0105] 21.Heckenbach I., Mkrtchyan G.V., Ezra M.B., et al. Nuclear morphology is a deep learning biomarker of cellular senescence. Nat Aging. 2022;2(8):742–755. doi: 10.1038/s43587-022-00263-3. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0110] 22.Abraham A., Pedregosa F., Eickenberg M., et al. Machine learning for neuroimaging with scikit-learn. Front Neuroinform. 2014;8:14. doi: 10.3389/fninf.2014.00014. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Utility of artificial intelligence in a binary classification of soft tissue tumors

Jing Di

Caylin Hickey

Cody Bumgardner

Mustafa Yousif

Mauricio Zapata

Therese Bocklage

Bonnie Balzer

Marilyn M Bui

Jerad M Gardner

Liron Pantanowitz

Shadi A Qasem

Abstract

Introduction