Skip to main content
International Journal of Endocrinology logoLink to International Journal of Endocrinology
. 2025 Jun 23;2025:2209918. doi: 10.1155/ije/2209918

Prediction of Patients With Anaplastic Thyroid Carcinoma With Bone Metastasis: A Population-Based Study

Yisong Yao 1,2,3,4, Guibin Zheng 5, Xi Chen 1,2,3,4, Yaqi Wang 1,2,3,4, Congxian Lu 1,2,3,4, Jiaxuan Li 1,2,3,4, Ting Yuan 1,2,3,4, Caiyu Sun 1,2,3,4, Yakui Mou 1,2,3,4, Yumei Li 1,2,3,4,, Xicheng Song 1,2,3,4,
PMCID: PMC12208758  PMID: 40589601

Abstract

Background: Bone metastasis (BM) is a significant risk factor for the survival and prognosis of patients with anaplastic thyroid carcinoma (ATC). The aim of this study was to predict BM in patients with ATC.

Methods: Demographic and clinicopathological data of patients with ATC were extracted from the Surveillance, Epidemiology, and End Results database between 2010 and 2020. Logistic regression (LR) was used to identify the linear influencing factors for BM. We developed prediction models for BM using six machine learning models: support vector machine (SVM), LR, adaptive boosting (AD), decision tree (DT), eXtreme Gradient Boosting (XGB), and random forest (RF). The area under the receiver operating characteristic curve (AUC) values, accuracy, recall rate, precision, F1 scores, calibration curves, and precision-recall curves were used to determine the best model and evaluate its effectiveness. The SHapley Additive exPlanations algorithm was used to reveal the interpretability of the prediction model.

Results: This study included 781 patients with ATC, of whom 78 (9.99%) patients occurred BM and 703 (90.01%) patients were free of BM. The XGB model significantly outperformed the other models, with the highest F1 (0.897), accuracy (0.878), precision (0.924), recall (0.900), and AUC (0.897) values. The results of the LR model showed that age, gender, lung metastasis, and liver metastasis were linear influencing factors. According to XGB model, metropolitan area, median household income, N stage, and race were also strongly associated with BM among patients with ATC.

Conclusion: We explored influencing factors for BM and established a prediction model based on XGB that yielded excellent results in predicting BM in patients with ATC. This study provides a theoretical basis for early decision making in clinical practice.

Keywords: anaplastic thyroid carcinoma, bone metastasis, machine learning, prediction

1. Introduction

The incidence of thyroid cancer (TC) has increased dramatically worldwide in recent decades [1], making it the most common malignant tumor of the endocrine system [2]. Based on differences in tumor origin and differentiation, TC is mainly classified into papillary thyroid carcinoma (PTC), follicular thyroid carcinoma (FTC), medullary thyroid carcinoma (MTC), and anaplastic thyroid carcinoma (ATC) [1, 2]. Among them, ATC is a relatively rare malignant thyroid tumor, accounting for only 1% of all TCs [3]. However, because the majority of ATC is highly malignant and aggressive, surgery often fails to achieve satisfactory results, and chemoresistance frequently develops during clinical treatment. The long-term survival prognosis of patients with ATC is often poor [4, 5], with a median survival of only 5 months and a 1-year overall survival rate of only 20% [6].

Distant metastasis (DM) has been reported to occur in 50% of patients with ATC [7] and is characterized by extrathyroidal growth, vascular lymphatic infiltration, and distant organ invasion [8], while only 10% of patients with ATC have progression limited to the thyroid [9]. DM has long been recognized as the most important independent risk factor affecting the survival prognosis of patients [10], with a median survival of 171 days in ATC patients with DM, a 1-year overall survival rate of 15% [7], and 70% of patients dying within 4 years of diagnosis [11].

Bone is one of the most common sites of DM in patients with TC [8]. The incidence of bone metastasis (BM) is approximately 4% in patients with TC [12] and up to 5%–15% in patients with ATC [6]. Patients with TC with BM usually have a worse prognosis, with a 10-year survival rate as low as 27% [13]. In addition, 78% of patients develop severe clinical bone-related events (SREs), and more than 60% of SREs recur during the long-term course of ATC, often leading to a variety of symptoms, such as bone pain, limb movement limitation, and even pathological fracture, which greatly reduce patients' quality of life [14]. However, most BMs are asymptomatic and can only be detected in a few cases, such as when patients present with SREs or undergo unconventional whole-body bone scans [15], by which time the majority of patients with ATC have already progressed to an advanced stage or missed the optimal time for treatment [6]. Therefore, it is essential to explore the occurrence of BM in patients with ATC in a timely manner.

Most studies based on the Surveillance, Epidemiology, and End Results (SEER) database defined the study population as newly diagnosed TC or specific histological subtypes (e.g., differentiated thyroid cancer (DTC)) [1518]. No study has yet focused on the development of BM in patients with ATC [19]. Therefore, this study focused on the development of BM in patients with ATC using the SEER program.

2. Methods

2.1. Study Design and Patients

This study included samples from the SEER program, which includes demographic, tumor pathology, and clinical treatment information for patients with cancers [20]. After obtaining a license and regulatory approval for the SEER database, we included data on patients with ATC between 2010 and 2020 because information on specific DM (such as BM) was not available until 2010 [21]. Additional information is available on the official SEER database website (https://seer.cancer.gov/about/).

The inclusion criteria were as follows: (1) year of diagnosis: 2010–2020; (2) histologically confirmed diagnosis of ATC according to the International Classification of Diseases of Oncology, 3rd Revision (ICD-O-3) histological codes, which were mainly 8020/3, 8021/3, 8022/2, 8022/3, 8023/3, 8030/3, 8031/3, 8032/3, 8032/3, 8033/3, 8034/3, 8035/3, and 8805/3; (3) patients with ATC with BM requiring confirmation by imaging and/or pathology; and (4) patients with ATC without brain metastasis. The exclusion criterion included lack of specific information. Ethical approval and informed consent were not required for this study because the SEER database does not release personally identifiable information.

Patients' data were downloaded from the “Incidence-SEER Research Data, 17 Registries, Nov 2022 Sub (2000–2020)” database using SEER∗Stat 8.4.2 software (released 8/14/2023). The AJCC 7th edition was released in 2009, so patients diagnosed between 2010 and 2020 were staged using AJCC 7th edition.

2.2. Variables and Endpoints

Variables extracted from the SEER database included age, gender, race, marital status, median household income, metropolitan area, grade, tumor size, T stage, N stage, liver metastasis, lung metastasis, laterality, and the number of tumors. The primary endpoint of this study was BM.

2.3. Data Preprocessing

Categorical variables underwent one-hot coding [22]. It is worth noting that the incidence of BM in patients with ATC was extremely low, indicating a highly imbalanced original dataset. Stratified sampling was used to divide the dataset into a training set (80%) and a test set (20%). To address the imbalance, we used oversampling to preprocess the raw data, analyzing changes in the dataset through a correlation matrix [23]. The Synthetic Minority Over-sampling Technique for Nominal (SMOTEN) used in this study is a standard approach for balancing class distribution in imbalanced datasets.

2.4. Linear Influencing Factor Selection

After univariate analysis, significant variables were subjected to multivariate logistic regression (LR) analyses to confirm linear influencing factors of ATC patients with BM.

2.5. Prediction Model Training and Validation

Considering the complex interactions between variables and the nonlinear relationships that may be overlooked by LR, we conducted ablation studies to determine the factors influencing the occurrence of BM in patients with ATC and the optimal model for predicting it. We conducted two models in turn. For the first model, we integrated four significant influencing factors suggested by the multivariate LR model. To maximize the incremental information included in the prediction model, for the second model, we integrated relevant variables from the SEER database into the model based on previous literature and clinical experience. The dataset was divided into a training set and a test set in a ratio of 8:2. The training set was subjected to analysis using six machine learning (ML) algorithms trained by 10-fold cross-validation to construct BM prediction models. These algorithms included support vector machine (SVM), LR, adaptive boosting (AD), decision tree (DT), eXtreme Gradient Boosting (XGB), and random forest (RF). Our literature review indicated that these ML models have been widely used in the clinical prediction of TC [24, 25]. The test set was used to assess the performance of these ML models. Accuracy, recall rate, area under the receiver operating characteristic (ROC) curve (AUC) value, precision, and the F1 score were used to evaluate these ML models. The methods used to compute these hyperparameters are presented in the following equations:

Accuracy=TP+TNTP+TN+FP+FN,Recall rate=TPTP+FN,AUC=insiεposrankinsiMM+1/2MN,Precision=TPTP+FP,F1 score=2TP2TP+FP+FN. (1)

We also used a calibration curve and a precision-recall (PR) curve to test the reliability of the prediction model. Additionally, we developed an interpretable model for ML based on the SHapley Additive exPlanations (SHAP) algorithm [26]. In this process, we visualized and interpreted 1 randomly selected patient and 20 randomly selected patients, respectively. The waterfall plot reveals how values are driven from the model's expected output values on the dataset to the model's predicted output values. The decision plot shows how SHAP values start at the base value and move to the top of the graph. Finally, partial dependence plots (PDPs) were applied for each feature's interpretability [27].

2.6. Statistical Analysis

Data were compiled and stored using Microsoft Excel spreadsheets (Microsoft Corp., Redmond, WA, USA). Categorical variables were presented as counts and percentages, while continuous variables were expressed as means ± standard deviation. Normal distribution was confirmed using the Shapiro–Wilk normality test. Differences between patients with and without BM were assessed using the t-test for continuous variables, and categorical variables were compared using the chi-squared or Fisher's exact test, as appropriate. The Kruskal–Wallis test was used for multiple categorical variables. Data analysis was conducted using SPSS 26.0 (IBM Corp., Armonk, NY, USA), Python 3.8.0 (https://www.python.org/), and SEER∗Stat (https://seer.cancer.gov/seerstat/) software.

3. Results

3.1. Characteristics of the Patients With ATC

A total of 781 patients with ATC were included in this study. Among them, BM occurred in 78 patients (9.99%) and 703 patients were free of BM at initial diagnosis (90.01%). Age, sex, T stage, N stage, lung metastasis, and liver metastasis were significantly different between the two groups (p < 0.05). The detailed information is shown in Table 1.

Table 1.

Characteristics of patients with ATC.

Factor Bone metastasis X 2 p
No (n = 703) Yes (n = 78)
Age 1.637 < 0.05
 20–44 years 10 (1.4%) 2 (2.6%)
 45–64 years 233 (33.1%) 30 (38.5%)
 ≥ 65 years 460 (65.4%) 46 (59%)
Race 1.971 0.578
 White 556 (79.1%) 62 (79.5%)
 Black 54 (7.7%) 5 (6.4%)
 Other 91 (12.9%) 10 (12.8%)
 Unknown 2 (0.3%) 1 (1.3%)
Sex 8.288 < 0.05
 Male 290 (41.3%) 46 (59%)
 Female 413 (58.7%) 32 (41%)
Marital status 1.641 0.440
 Have no partner 287 (40.8%) 26 (33.3%)
 Have a partner 392 (55.8%) 49 (62.8%)
 Unknown 24 (3.4%) 3 (3.8%)
Median household income 6.880 0.650
 < $35,000 10 (1.4%) 0 (0%)
 $35,000–$39,999 9 (1.3%) 2 (2.6%)
 $40,000–$44,999 14 (2%) 1 (1.3%)
 $45,000–$49,999 30 (4.3%) 7 (9%)
 $50,000–$54,999 41 (5.8%) 4 (5.1%)
 $55,000–$59,999 54 (7.7%) 5 (6.4%)
 $60,000–$64,999 88 (12.5%) 10 (12.8%)
 $65,000–$69,999 92 (13.1%) 13 (16.7%)
 $70,000–$74,999 92 (13.1%) 8 (10.3%)
 $75,000+ 273 (38.8%) 28 (35.9%)
Metropolitan 5.755 0.218
 Nonmetropolitan not adjacent to a metropolitan area 39 (5.5%) 5 (6.4%)
 Nonmetropolitan adjacent to a metropolitan area 45 (6.4%) 10 (12.8%)
 Metropolitan of < 250 K inhabitants 37 (5.3%) 5 (6.4%)
 Metropolitan > 250 K–1 M inhabitants 174 (24.8%) 14 (17.9%)
 Metropolitan > 1 M inhabitants 408 (58%) 44 (56.4%)
Grade 0.452 0.501
 Grade I 0 (0.0%) 0 (0.0%)
 Grade II 0 (0.0%) 0 (0.0%)
 Grade III 7 (1%) 2 (2.6%)
 Grade IV 696 (99%) 76 (97.4%)
Laterality 0.155 0.925
 Left 75 (10.7%) 8 (10.3%)
 Right 6 (0.9%) 1 (1.3%)
 Bilateral/NOS 622 (88.5%) 69 (88.5%)
T Stage 11.227 < 0.05
 T0 0 (0%) 1 (1.3%)
 T1 2 (0.3%) 0 (0%)
 T2 7 (1%) 0 (0%)
 T3 38 (5.4%) 2 (2.6%)
 T4 650 (92.5%) 75 (96.2%)
 Tx 6 (0.9%) 0 (0%)
N stage 9.055 < 0.05
 N0 239 (34%) 15 (19.2%)
 N1 399 (56.8%) 58 (74.4%)
 Nx 65 (9.2%) 5 (6.4%)
Tumor size 9.689 0.085
 < 10 3 (0.4%) 0 (0%)
 10–19 15 (2.1%) 2 (2.6%)
 20–29 21 (3%) 5 (6.4%)
 30–39 53 (7.5%) 3 (3.8%)
 ≥ 40 510 (72.5%) 49 (62.8%)
 Unknown 101 (14.4%) 19 (24.4%)
Liver metastasis 54.875 < 0.05
 No 687 (97.7%) 62 (79.5%)
 Yes 16 (2.3%) 16 (20.5%)
Lung metastasis 30.990 < 0.05
 No 465 (66.1%) 26 (33.3%)
 Yes 238 (33.9%) 52 (66.7%)
Number of tumors 0.478 0.569
 1 666 (94.7%) 74 (94.9%)
 > 1 37 (5.3%) 4 (5.1%)

3.2. Linear Influencing Factor Selection

In the multivariate LR model, age, sex, liver metastasis, and lung metastasis were identified as independent predictors of BM in ATC. Further details can be found in Table S1.

3.3. Prediction Model Performance

We used six ML algorithms, including SVM, XGB, LR, AD, DT, and RF, to develop BM prediction models for patients with ATC. Optimal parameters were determined through a grid search. Firstly, we integrated age, gender, liver metastasis, and lung metastasis in ML models. The performance of the models was relatively average, with AUCs mostly ranging from 0.716 to 0.719 (Table S2).

Then, a total of 14 variables were included in these models, including age, gender, race, marital status, median household income, metropolitan area, grade, tumor size, T stage, N stage, liver metastasis, lung metastasis, laterality, and the number of tumors. As the number of training set samples increased, the F1 scores for all prediction models exceeded 0.7, which indicated robust performance and demonstrated effective predictive capabilities with no apparent overfitting (Figure 1).

Figure 1.

Figure 1

Learning curves for all prediction models with oversampling. Abbreviations: AD, adaptive boosting; DT, decision tree; RF, random forest; LR, logistic regression; SVM, support vector machine; XGB, eXtreme Gradient Boosting.

Ten-fold cross-validation was used to explore the average AUC of each ML model (Figures 2(a) and 2(b)). The XGB prediction model significantly outperformed the other models, achieving the highest average AUC (0.90). Additionally, we performed the t-test to compare the 10-fold cross-validation AUC between different models (Table 2). The XGB model was also better than others. More details about XGB are shown in Table S3.

Figure 2.

Figure 2

Ten-fold cross-validation for average AUC (a), receiver operating characteristic (ROC) curve (b), calibration curves (c), and precision-recall (PR) curve (d) for all prediction models with oversampling. Abbreviations: AD, adaptive boosting; DT, decision tree; RF, random forest; LR, logistic regression; SVM, support vector machine; XGB, eXtreme Gradient Boosting.

Table 2.

The t-test to compare the 10-fold cross-validation AUC between different models.

Models t value p
XGB compared to LR −11.696967197853882 < 0.001
XGB compared to SVM −10.772999784004162 < 0.001
XGB compared to DT −9.86378420619267 < 0.001
XGB compared to RF −9.646534116888832 < 0.001
XGB compared to AD −3.537375803165431 0.006

Abbreviations: AD = adaptive boosting, AUC = area under the curve, DT = decision tree, LR = logistic regression, RF = random forest, SVM = support vector machine, and XGB = eXtreme Gradient Boosting.

A calibration curve was used to visualize the consistency between the predicted and observed variables in the six BM prediction models for patients with ATC (Figure 2(c)). On the curve, the 45-degree blue straight line represents the perfect match between the observed (y-axis) and predicted (x-axis) variables. A smaller distance between actual and gray straight line indicates a better outcome. Figure 2(d) shows the PR curves for six ML models for predicting BM in patients with ATC. The XGB prediction model significantly outperformed the other models, achieving the highest F1 (0.897), accuracy (0.878), precision (0.924), recall (0.900), and AUC (0.897) values among all the models (Figure 2 and Table 3).

Table 3.

Performance comparison between prediction models based on different machine learning algorithms.

Models AUC Accuracy Precision Recall rate F1 score
XGB 0.897 0.878 0.924 0.900 0.897
AD 0.861 0.897 0.817 0.853 0.861
SVM 0.865 0.846 0.889 0.867 0.864
LR 0.735 0.737 0.733 0.733 0.736
DT 0.790 0.789 0.790 0.788 0.790
RF 0.847 0.853 0.839 0.846 0.848

Abbreviations: AD = adaptive boosting, AUC = area under the curve, DT = decision tree, LR = logistic regression, RF = random forest, SVM = support vector machine, and XGB = eXtreme Gradient Boosting.

3.4. Prediction Model Interpretability

Based on the SHAP plot of the XGB model, lung metastasis, liver metastasis, gender, metropolitan area, median household income, N stage, and race emerged as the most important factors influencing BM development in patients with ATC (Figure 3(a)). The plot illustrated the individual characteristics of patients with ATC who are at increased or decreased risk of developing BM. This information can be used for individualized healthcare planning with prospective implications based on the predicted XGB model. Figure 3(b) shows a male ATC patient with a partner who was unlikely to develop BM. Figure 3(c) displays a decision plot where each line represented a participant, and the SHAP value indicated the contribution of each feature to the final category. Finally, features' interpretability was shown as PDP in Figures S1S7.

Figure 3.

Figure 3

(a) Explanatory visualization of individuals' predictions. (b) Explanatory visualization of single individual's predictions. (c) Visual decision plot for multiple sample prediction.

4. Discussion

Patients with ATC with BM usually have a poor prognosis. The insidious nature of early clinical symptoms and the complexity of diagnostic modalities have made recognizing BM at the initial diagnosis a long-standing clinical challenge [28]. This study aims for risk factor investigation and prediction model establishment for BM in patients with ATC.

Since LR can only identify limited linear relationship, this study firstly identified significant linear influencing factors affecting the occurrence of BM in patients with ATC by LR, including age, gender, liver metastasis, and lung metastasis. In addition, we performed ablation studies to explore all potential influencing factors associated with BM and the optimal prediction model. We found that the effect of the prediction model which included 14 variables based on XGB (AUC = 0.897) was significantly better than the effect of the prediction model built from four significant linear influencing factors (AUC = 0.718)). Therefore, we were able to determine that there were still variables whose effects were not observed by the LR model. The most effective algorithm, XGB, was selected to construct a clinically informed model to predict BM in patients with ATC. Through the comparison of SHAP algorithm, we identified significant associations between lung metastasis, liver metastasis, gender, metropolitan area, median household income, N stage, and race with BM in patients with ATC.

In this study, we found that the results of different ML models all showed that DMs in the lung and liver were risk factors for the development of BM in patients with ATC, which might suggest that those patients tend to have multiple DMs. According to Besic and Gazic [29], 84% of ATC cases had two or more metastatic sites. It was also noted that about 27.6% of patients were found to have multiple organ metastasis at the time of initial diagnosis, and solitary BMs were even rarer [19]. This metastasis pattern is not exclusive to BM in ATC, but also occurs in other cancers. For breast cancer, other distant organ metastasis can also indicate the possibility of BM [30]. Wang et al. also noted that both bone and lung metastases can be significant predictors of liver metastasis in patients with colorectal cancer [31]. For tumors that might be highly malignant, there could be a relationship between different distant organ metastases. It has been shown that the accumulation of senescent cells can expedite the development of a fused bone environment, leading to the recurrence and metastasis of dormant ATC, while DMs to other sites are occurring [32, 33]. Additionally, surgically induced physical damage and inflammation-induced chemical damage have been linked to metastatic growth at nonsurgical sites [3436].

In addition to tumor clinical information, sociodemographic characteristics play an important role in predicting BM risk in patients with ATC. Relevant studies have revealed that although the incidence of TC is higher in women than in men [37], the rate of extrathyroidal tissue invasion, distal metastasis, recurrence, and mortality are higher in male patients than in women [38]. Our study also showed that men were a risk factor for BM. Previous studies might have overlooked the influence of metropolitan area on disease outcomes. Medical insurance measures in larger scale cities are more comprehensive, and a complete health insurance system can effectively improve the disease outcomes of patients with cancers [39]. In addition, larger scale cities might have more complete medical facilities with advanced technology, providing patients with more equal access to medical care and more timely medical screening and interventions. Household income may have the same mechanisms with city scale and medical insurance as disposable financial resources. Moreover, ethnicity can be a risk factor. Among patients with DTC, black patients had a greater risk of BM [40]. Similarly, in the population of breast cancer patients, outcomes for black patients were worse [41]. However, according to the results of our study, the predicted risk factor for BM of ATC is white ethnicity. This could be the result of the uneven distribution of respondents among different racial groups. Additionally, we also found that older patients were more likely to have BM. We speculated that it may be caused by physical development. Aging is accompanied by cell aging, including changes of protein, metabolism, and genome instability, which can be involved in the progression of tumors [42].

It is worth noting that lymph node metastasis also has a significant influence on the occurrence of BM in patients with ATC. Previous studies have highlighted the importance of lymph node metastasis in predicting BM in patients with TC with various pathological subtypes such as PTC, MTC, and FTC [4345].

However, this study has certain limitations. First, the SEER database only provides cases with synchronous metastasis, and BM often occurs during the follow-up of patients with TC, which might lead to a large sample loss of patients with BM. This might consequently diminish the precision of this prediction model in real-world settings. Additionally, we did not perform validation against external databases, which might have affected our ability to prove the extensibility of this prediction model to some extent. Therefore, more clinical studies, especially randomized clinical trials, are needed in the future.

5. Conclusions

The present study developed a XGB-based prediction model for BM in patients with ATC, which was capable of outperforming other ML models. The results of the LR model showed that age, gender, lung metastasis, and liver metastasis were linear influencing factors. According to the XGB model, metropolitan area, median household income, N stage, and race were also strongly associated with BM among patients with ATC. This might facilitate personalized diagnosis and refine clinical decision making for BM in patients with ATC.

Acknowledgments

We would like to express our profound gratitude to Yi-Pei Chen of the University of Electronic Science and Technology of China (UESTC) for his unwavering methodological support.

Contributor Information

Yumei Li, Email: myheart1263@163.com.

Xicheng Song, Email: drxchsong@163.com.

Data Availability Statement

The datasets generated and/or analyzed during the current study are available in the SEER database (https://seer.cancer.gov/). Further inquiries can be directed to the corresponding authors.

Ethics Statement

We obtained permission to access the research data file within the SEER program from the National Cancer Institute, US. Approval was waived by the local ethics committee, as SEER data are publicly available.

Consent

The authors have nothing to report.

Conflicts of Interest

The authors declare no conflicts of interest.

Author Contributions

Yisong Yao contributed to conception and design of the study. Jiaxuan Li, Ting Yuan, Caiyu Sun, and Yakui Mou organized the database. Yisong Yao, Xi Chen, and Congxian Lu performed the statistical analysis. Yisong Yao, Guibin Zheng, and Xi Chen wrote the manuscript. Yumei Li and Xicheng Song reviewed the article. All authors participated in the manuscript's revision, review, and approval of the final submitted version. Yisong Yao, Guibin Zheng, and Xi Chen contributed equally to this work.

Funding

This work was supported by the Taishan Scholar Project (No. ts20190991), the Key R&D Project of Shandong Province (2022CXPT023), Shandong Medical Association Clinical Research Fund-Qilu Special Fund YXH2022ZX02185, and Scholar Project of Yantai's “Double Hundreds plan”.

Supporting Information

Supporting Information

Additional supporting information can be found online in the Supporting Information section.

2209918.f1.docx (437.6KB, docx)

Table S1: multivariate logistic regression of factors related to distant metastasis—we identified that age, sex, liver metastasis, and lung metastasis were independent predictors of BM in ATC. Table S2: performance comparison between prediction models based on different machine learning algorithms including four linear influencing factors—we found that the performance of the models built by four linear significant factors was relatively average, with AUCs mostly ranging from 0.716 to 0.719. Table S3: detailed settings for the XGB model—we reported more details about XGB model. Figure S1: partial dependent plots of clinical characteristics “gender”; the shaded portion represents the confidence interval—we found that men were a risk factor for BM. Figure S2: partial dependent plots of clinical characteristics “liver metastasis”; the shaded portion represents the confidence interval—we found that liver metastasis was a risk factor for BM. Figure S3: partial dependent plots of clinical characteristics “lung metastasis”; the shaded portion represents the confidence interval—we found that individuals with lung metastasis had higher risk of BM. Figure S4: partial dependent plots of clinical characteristics “median household income”; the shaded portion represents the confidence interval—we found that patients with higher median household income had lower risk of BM. Figure S5: partial dependent plots of clinical characteristics “metropolitan”; the shaded portion represents the confidence interval—we found that patients from larger metropolitan had lower risk of BM. Figure S6: partial dependent plots of clinical characteristics “N stage”; the shaded portion represents the confidence interval—we found that patients with higher N stage had greater risk of BM. Figure S7: partial dependent plots of clinical characteristics “race”; the shaded portion represents the confidence interval—we found that black patients with ATC had greater risk of BM.

References

  • 1.Benjamin B. Health and Vital Statistics . London: Routledge; 2022. Cancer Statistics; pp. 269–278. [DOI] [Google Scholar]
  • 2.Bray F., Ferlay J., Soerjomataram I., Siegel R. L., Torre L. A., Jemal A. Global Cancer Statistics 2018: Globocan Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries. CA: A Cancer Journal for Clinicians . 2018;68(6):394–424. doi: 10.3322/caac.21492. [DOI] [PubMed] [Google Scholar]
  • 3.Pereira M., Williams V. L., Hallanger Johnson J., Valderrabano P. Thyroid Cancer Incidence Trends in the United States: Association With Changes in Professional Guideline Recommendations. Thyroid . 2020;30(8):1132–1140. doi: 10.1089/thy.2019.0415. [DOI] [PubMed] [Google Scholar]
  • 4.Lin B., Ma H., Ma M., et al. The Incidence and Survival Analysis for Anaplastic Thyroid Cancer: A SEER Database Analysis. American Journal of Translational Research . 2019;11(9):5888–5896. [PMC free article] [PubMed] [Google Scholar]
  • 5.Neff R. L., Farrar W. B., Kloos R. T., Burman K. D. Anaplastic Thyroid Cancer. Endocrinology and Metabolism Clinics of North America . 2008;37(2):525–538. doi: 10.1016/j.ecl.2008.02.003. [DOI] [PubMed] [Google Scholar]
  • 6.Bible K. C., Kebebew E., Brierley J., et al. 2021 American Thyroid Association Guidelines for Management of Patients With Anaplastic Thyroid Cancer. Thyroid . 2021;31(3):337–386. doi: 10.1089/thy.2020.0944. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Lee J. S., Lee J. S., Yun H. J., et al. Prognosis of Anaplastic Thyroid Cancer With Distant Metastasis. Cancers . 2022;14(23):p. 5784. doi: 10.3390/cancers14235784. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Hvilsom G. B., Londero S. C., Hahn C. H., et al. Anaplastic Thyroid Carcinoma in Denmark 1996–2012: A National Prospective Study of 219 Patients. Cancer Epidemiology . 2018;53:65–71. doi: 10.1016/j.canep.2018.01.011. [DOI] [PubMed] [Google Scholar]
  • 9.Ranganath R., Shah M. A., Shah A. R. Anaplastic Thyroid Cancer. Current Opinion in Endocrinology Diabetes and Obesity . 2015;22(5):387–391. doi: 10.1097/MED.0000000000000189. [DOI] [PubMed] [Google Scholar]
  • 10.Xu L., Cai L., Zhu Z., Chen G. Comparison of the Cox Regression to Machine Learning in Predicting the Survival of Anaplastic Thyroid Carcinoma. BMC Endocrine Disorders . 2023;23(1):p. 129. doi: 10.1186/s12902-023-01368-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Iñiguez-Ariza N. M., Bible K. C., Clarke B. L. Bone Metastases in Thyroid Cancer. Journal of Bone Oncology . 2020;21:p. 100282. doi: 10.1016/j.jbo.2020.100282. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Choksi P., Papaleontiou M., Guo C., Worden F., Banerjee M., Haymart M. Skeletal Complications and Mortality in Thyroid Cancer: A Population-Based Study. The Journal of Clinical Endocrinology & Metabolism . 2017;102(4):1254–1260. doi: 10.1210/jc.2016-3906. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Wu D., Gomes Lima C. J., Moreau S. L., et al. Improved Survival After Multimodal Approach with 131I Treatment in Patients With Bone Metastases Secondary to Differentiated Thyroid Cancer. Thyroid . 2019;29(7):971–978. doi: 10.1089/thy.2018.0582. [DOI] [PubMed] [Google Scholar]
  • 14.Farooki A., Leung V., Tala H., Tuttle R. M. Skeletal-Related Events due to Bone Metastases From Differentiated Thyroid Cancer. The Journal of Clinical Endocrinology & Metabolism . 2012;97(7):2433–2439. doi: 10.1210/jc.2012-1169. [DOI] [PubMed] [Google Scholar]
  • 15.Tong Y., Hu C., Huang Z., Fan Z., Zhu L., Song Y. Novel Nomogram to Predict Risk of Bone Metastasis in Newly Diagnosed Thyroid Carcinoma: A Population-Based Study. BMC Cancer . 2020;20(1):p. 1055. doi: 10.1186/s12885-020-07554-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Liu W. C., Li Z. Q., Luo Z. W., Liao W. J., Liu Z. L., Liu J. M. Machine Learning for the Prediction of Bone Metastasis in Patients With Newly Diagnosed Thyroid Cancer. Cancer Medicine . 2021;10(8):2802–2811. doi: 10.1002/cam4.3776. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Khired Z. A., Hussein M. H., Jishu J. A., et al. Osseous Metastases in Thyroid Cancer: Unveiling Risk Factors, Disease Outcomes, and Treatment Impact. Cancers . 2023;15(14):p. 3557. doi: 10.3390/cancers15143557. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Kato S., Demura S., Shinmura K., Yokogawa N., Shimizu T., Tsuchiya H. Current Management of Bone Metastases From Differentiated Thyroid Cancer. Cancers . 2021;13(17):p. 4429. doi: 10.3390/cancers13174429. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Vuong H. G., Le M. K., Hassell L., Kondo T., Kakudo K. The Differences in Distant Metastatic Patterns and Their Corresponding Survival Between Thyroid Cancer Subtypes. Head & Neck . 2022;44(4):926–932. doi: 10.1002/hed.26987. [DOI] [PubMed] [Google Scholar]
  • 20.Doll K. M., Rademaker A., Sosa J. A. Practical Guide to Surgical Data Sets: Surveillance, Epidemiology, and End Results (SEER) Database. JAMA Surgery . 2018;153(6):p. 588. doi: 10.1001/jamasurg.2018.0501. [DOI] [PubMed] [Google Scholar]
  • 21.Cheng S., Yang L., Dai X., Wang J., Han X. The Risk and Prognostic Factors for Brain Metastases in Esophageal Cancer Patients: An Analysis of the SEER Database. BMC Cancer . 2021;21(1):p. 1057. doi: 10.1186/s12885-021-08802-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Tekkeşin A. İ. Artificial Intelligence in Healthcare: Past, Present and Future. The Anatolian Journal of Cardiology . 2019;22(Suppl 2):8–9. doi: 10.14744/anatoljcardiol.2019.28661. [DOI] [PubMed] [Google Scholar]
  • 23.Kim J. Y., Kong H. J., Kim S. H., et al. Machine Learning-Based Preoperative Datamining Can Predict the Therapeutic Outcome of Sleep Surgery in OSA Subjects. Scientific Reports . 2021;11(1):p. 14911. doi: 10.1038/s41598-021-94454-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Hou F., Zhu Y., Zhao H., et al. Development and Validation of an Interpretable Machine Learning Model for Predicting the Risk of Distant Metastasis in Papillary Thyroid Cancer: A Multicenter Study. eClinicalMedicine . 2024;77:p. 102913. doi: 10.1016/j.eclinm.2024.102913. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Mao Y., Lan H., Lin W., et al. Machine Learning Algorithms Are Comparable to Conventional Regression Models in Predicting Distant Metastasis of Follicular Thyroid Carcinoma. Clinical Endocrinology . 2022;98(1):98–109. doi: 10.1111/cen.14693. [DOI] [PubMed] [Google Scholar]
  • 26.Bernard D., Doumard E., Ader I., et al. Explainable Machine Learning Framework to Predict Personalized Physiological Aging. Aging Cell . 2023;22(8):p. e13872. doi: 10.1111/acel.13872. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Kerrigan D., Barr B., Bertini E. PDPilot: Exploring Partial Dependence Plots Through Ranking, Filtering, and Clustering. IEEE Transactions on Visualization and Computer Graphics . 2025:1–14. doi: 10.1109/TVCG.2025.3545025. [DOI] [PubMed] [Google Scholar]
  • 28.Maniakas A., Dadu R., Busaidy N. L., et al. Evaluation of Overall Survival in Patients With Anaplastic Thyroid Carcinoma, 2000–2019. JAMA Oncology . 2020;6(9):p. 1397. doi: 10.1001/jamaoncol.2020.3362. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Besic N., Gazic B. Sites of Metastases of Anaplastic Thyroid Carcinoma: Autopsy Findings in 45 Cases From a Single Institution. Thyroid . 2013;23(6):709–713. doi: 10.1089/thy.2012.0252. [DOI] [PubMed] [Google Scholar]
  • 30.See A., Iyer N. G., Tan N. C., et al. Distant Metastasis as the Sole Initial Manifestation of Well-Differentiated Thyroid Carcinoma. European Archives of Oto-Rhino-Laryngology . 2017;274(7):2877–2882. doi: 10.1007/s00405-017-4532-9. [DOI] [PubMed] [Google Scholar]
  • 31.Wang Q., Shen K., Fei B., Wei M., Xie Z. Nomogram for Predicting Occurrence and Prognosis of Liver Metastasis in Elderly Colorectal Cancer Patients: a Population-Based Study. Frontiers in Oncology . 2023;13:p. 1295650. doi: 10.3389/fonc.2023.1295650. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Lawson M. A., McDonald M. M., Kovacic N., et al. Osteoclasts Control Reactivation of Dormant Myeloma Cells by Remodelling the Endosteal Niche. Nature Communications . 2015;6(1):p. 8983. doi: 10.1038/ncomms9983. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.LeBleu V. S., O’Connell J. T., Gonzalez Herrera K. N., et al. PGC-1α Mediates Mitochondrial Biogenesis and Oxidative Phosphorylation in Cancer Cells to Promote Metastasis. Nature Cell Biology . 2014;16(10):992–1003. doi: 10.1038/ncb3039. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Balkwill F., Mantovani A. Inflammation and Cancer: Back to Virchow? The Lancet . 2001;357(9255):539–545. doi: 10.1016/S0140-6736(00)04046-0. [DOI] [PubMed] [Google Scholar]
  • 35.Lee J. Y., Murphy S. M., Scanlon E. F. Effect of Trauma of Implantation of Metastatic Tumor in Bone in Mice. Journal of Surgical Oncology . 1994;56(3):178–184. doi: 10.1002/jso.2930560311. [DOI] [PubMed] [Google Scholar]
  • 36.Demicheli R., Retsky M. W., Hrushesky W. J., Baum M., Gukas I. D. The Effects of Surgery on Tumor Growth: A Century of Investigations. Annals of Oncology . 2008;19(11):1821–1828. doi: 10.1093/annonc/mdn386. [DOI] [PubMed] [Google Scholar]
  • 37.Bi J., Zhang H. Nomogram Predicts Risk and Prognostic Factors for Lung Metastasis of Anaplastic Thyroid Carcinoma: A Retrospective Study in the Surveillance Epidemiology and End Results (SEER) Database. Translational Cancer Research . 2023;12(12):3547–3564. doi: 10.21037/tcr-23-1195. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Mazzaferri E. L. The Prognostic Value of Primary Tumor Size in Papillary and Follicular Thyroid Carcinoma: A Comparative Analysis. Year Book of Endocrinology . 2006;2006:272–274. doi: 10.1016/s0084-3741(08)70379-2. [DOI] [Google Scholar]
  • 39.Zhao J., Han X., Nogueira L., et al. Health Insurance Status and Cancer Stage at Diagnosis and Survival in the United States. CA: A Cancer Journal for Clinicians . 2022;72(6):542–560. doi: 10.3322/caac.21732. [DOI] [PubMed] [Google Scholar]
  • 40.Qi L., Zhang W., Ren X., et al. Incidence and Predictors of Synchronous Bone Metastasis in Newly Diagnosed Differentiated Thyroid Cancer: A Real-World Population-Based Study. Frontiers in Surgery . 2022;9:p. 778303. doi: 10.3389/fsurg.2022.778303. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Iqbal J., Ginsburg O., Rochon P. A., Sun P., Narod S. A. Differences in Breast Cancer Stage at Diagnosis and Cancer-Specific Survival by Race and Ethnicity in the United States. JAMA . 2015;313(2):165–173. doi: 10.1001/jama.2014.17322. [DOI] [PubMed] [Google Scholar]
  • 42.Chen B., Zeng Y., Liu B., et al. Risk Factors, Prognostic Factors, and Nomograms for Distant Metastasis in Patients With Newly Diagnosed Osteosarcoma: A Population-Based Study. Frontiers in Endocrinology . 2021;12:p. 672024. doi: 10.3389/fendo.2021.672024. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Wang W., Shen C., Yang Z. Nomogram Individually Predicts the Risk for Distant Metastasis and Prognosis Value in Female Differentiated Thyroid Cancer Patients: A SEER-Based Study. Frontiers in Oncology . 2022;12:p. 800639. doi: 10.3389/fonc.2022.800639. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Qiao L., Li H., Wang Z., Sun H., Feng G., Yin D. Machine Learning Based on SEER Database to Predict Distant Metastasis of Thyroid Cancer. Endocrine . 2023;84(3):1040–1050. doi: 10.1007/s12020-023-03657-4. [DOI] [PubMed] [Google Scholar]
  • 45.Chen Z., Mao Y., You T., Chen G. Establishment and Validation of a Nomogram Model for Predicting Distant Metastasis in Medullary Thyroid Carcinoma: An Analysis of the SEER Database Based on the AJCC 8th TNM Staging System. Frontiers in Endocrinology . 2023;14:p. 1119656. doi: 10.3389/fendo.2023.1119656. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supporting Information

Additional supporting information can be found online in the Supporting Information section.

2209918.f1.docx (437.6KB, docx)

Data Availability Statement

The datasets generated and/or analyzed during the current study are available in the SEER database (https://seer.cancer.gov/). Further inquiries can be directed to the corresponding authors.


Articles from International Journal of Endocrinology are provided here courtesy of Wiley

RESOURCES