Skip to main content
PeerJ logoLink to PeerJ
. 2023 Jul 12;11:e15678. doi: 10.7717/peerj.15678

Establishment of a prognostic risk prediction modelfor non-small cell lung cancer patients with brainmetastases: a retrospective study

Fei Hou 1,#, Yan Hou 2,#, Xiao-Dan Sun 3, Jia lv 1, Hong-Mei Jiang 1, Meng Zhang 1, Chao Liu 1,, Zhi-Yong Deng 1,
Editor: Abhishek Tyagi
PMCID: PMC10349557  PMID: 37456882

Abstract

Background

Patients with non-small cell lung cancer (NSCLC) who develop brain metastases (BM) have a poor prognosis. This study aimed to construct a clinical prediction model to determine the overall survival (OS) of NSCLC patients with BM.

Methods

A total of 300 NSCLC patients with BM at the Yunnan Cancer Centre were retrospectively analysed. The prediction model was constructed using the least absolute shrinkage and selection operator-Cox regression. The bootstrap sampling method was employed for internal validation. The performance of our prediction model was compared using recursive partitioning analysis (RPA), graded prognostic assessment (GPA), the update of the graded prognostic assessment for lung cancer using molecular markers (Lung-molGPA), the basic score for BM (BSBM), and tumour-lymph node-metastasis (TNM) staging.

Results

The prediction models comprising 15 predictors were constructed. The area under the curve (AUC) values for the 1-year, 3-year, and 5-year time-dependent receiver operating characteristic (curves) were 0.746 (0.678–0.814), 0.819 (0.761–0.877), and 0.865 (0.774–0.957), respectively. The bootstrap-corrected AUC values and Brier scores for the prediction model were 0.811 (0.638–0.950) and 0.123 (0.066-0.188), respectively. The time-dependent C-index indicated that our model exhibited significantly greater discrimination compared with RPA, GPA, Lung-molGPA, BSBM, and TNM staging. Similarly, the decision curve analysis demonstrated that our model displayed the widest range of thresholds and yielded the highest net benefit. Furthermore, the net reclassification improvement and integrated discrimination improvement analyses confirmed the enhanced predictive power of our prediction model. Finally, the risk subgroups identified by our prognostic model exhibited superior differentiation of patients’ OS.

Conclusion

The clinical prediction model constructed by us shows promise in predicting OS for NSCLC patients with BM. Its predictability is superior compared with RPA, GPA, Lung-molGPA, BSBM, and TNM staging.

Keywords: Brain metastasis, Non-small cell lung cancer, Prognostic model, LASSO regression, Predictors

Introduction

Lung cancer ranks among the most prevalent malignancies worldwide, with non-small cell lung cancer (NSCLC) being the predominant pathology (Sung et al., 2020). Interestingly, the brain is the most common site of distant metastasis in NSCLC. Approximately 10% of patients present with brain metastases (BM) at the time of diagnosis, while an additional 40–50% of patients develop BM during their disease (Ouyang et al., 2020). The prognosis of NSCLC with BM is extremely poor, with untreated patients having a median overall survival (OS) of only 1–3 months, and a 1-year survival rate of 10–20% (Schuler et al., 2016). Treatment modalities for NSCLC with BM include radiotherapy, surgery, chemotherapy, molecular targeting, and immunotherapy, among others, which can be categorised as local or systemic interventions. Given the diverse clinicopathological characteristics observed in patients, accurate prognosis prediction for NSCLC patients with BM is important for selecting a more individualised treatment strategy.

While the tumour-lymph node-metastasis (TNM) staging system remains the gold standard for assessing cancer prognosis, it is not without limitations. First, it is primarily based on the anatomical progression of the disease, assuming that more advanced staging corresponds to poorer prognostic outcomes. However, patients with identical anatomical progression might still exhibit varying prognoses. Second, TNM staging does not include the primary tumor size, lymph node metastasis, and distant metastasis as continuous variables, which can create an imprecise staging. Additionally, TMN staging does not take other variables into consideration to predict the patient prognosis, such as age, gender and histology that would contribute to predicting patient prognosis (Balachandran et al., 2015). Consequently, TNM staging proves inadequate in accurately predicting the prognosis of NSCLC patients with BM.

Currently, the most widely used prognostic models for BM are the recursive partitioning analysis (RPA), graded prognostic assessment (GPA), the update of the graded prognostic assessment for lung cancer using molecular markers (Lung-molGPA), and the basic score for BM (BSBM) (Gaspar et al., 2000; Sperduto et al., 2008; Sperduto et al., 2017; Gao et al., 2018). The RPA focuses on factors such as age, the Karnofsky performance score (KPS), control of the primary tumour, and the presence of extracranial metastases. GPA focuses on factors such as age, the KPS, the number of BM, and the presence of extracranial metastases. The Lung-molGPA, in addition to GPA factors, incorporates the mutation status of NSCLC driver genes. Lastly, the BSBM focuses on the KPS, control of the primary tumour, and the presence of extracranial metastases. While these models are simple and convenient to use, they have certain limitations. For instance, RPA and GPA are general models for BMs without a specific focus on lung cancer primaries, and BSBM does not consider brain metastatic lesion characteristics. The Lung-molGPA, being a specific model for lung cancer BM, stands as a comparatively robust model. However, these models employ subjective or challenging to quantify evaluation metrics. Furthermore, only a few researchers have developed prognostic prediction models for BM in patients with NSCLC. For instance, Li et al. (2022) introduced a novel prognostic model based on clinical features and inflammation markers to enhance the prognostic information accuracy for NSCLC patients with BM compared with the adjusted prognostic analysis, RPA, and GPA. Additionally, Zhang et al. (2020) examined the feasibility of employing computed tomography imaging radiomics to predict the survival of NSCLC patients with BM undergoing whole-brain radiotherapy.

However, these studies have certain limitations, including the non-rigorous selection method used for predictor selection and the limited clinical applicability of the developed models. Therefore, it becomes crucial to identify clinically meaningful and cost-effective prognostic factors that are readily available at the time of BM onset, as this would provide more valuable insights. This study aimed to establish a novel prognostic model based on clinicopathological characteristics, serological indicators, and treatment information using least absolute shrinkage and selection operator (LASSO)-Cox regression analysis to bridge this knowledge gap, thereby achieving a more precise reflection of the prognostic information on NSCLC patients diagnosed with BM. Our model could assist clinicians in formulating reasonable treatment plans.

Materials & Methods

This clinical prediction model was constructed according to the transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD) checklist (Collins et al., 2015).

This research adheres to the Declaration of Helsinki. The Ethics Committee of the Third Affiliated Hospital of Kunming Medical University approved this study (review number: KYLX2022221). Given the retrospective design of the study and the challenges associated with assessing certain patients, the Ethics Committee granted a waiver of informed consent for a subset of the patients.

Study population and follow-up

This retrospective study included 300 patients with NSCLC who were diagnosed with BM between January 2006 and May 2020 at Yunnan Cancer Hospital, Third Affiliated Hospital of Kunming Medical University. The inclusion criteria for this study were as follows: (1) NSCLC confirmed via pathological examination; (2) magnetic resonance imaging-confirmed BM; (3) availability of patient demographic characteristics, clinicopathological features, serological indicators, and treatment information; (4) absence of other concurrent cancers. The survival duration of the patients was determined by reviewing the medical records and conducting telephonic inquiries. The OS was defined as the interval from the initial diagnosis until death from any cause or the date of the last follow-up visit (Pilz, Manegold & Schmid-Bindert, 2012).

Data collection

Baseline clinical data were obtained from medical records at the time of initial diagnosis of BM. The collected data encompassed various aspects, including general conditions (age, sex, body mass index (BMI), smoking history, and KPS), tumour markers (carcinoembryonic antigen (CEA), neuron-specific enolase (NSE), cytokeratins (cytoplasmic protein fragment of cytokeratin 19 (CYFRA21)), and squamous cell carcinoma antigen (SCCA)), serological indicators (albumin (ALB), lactate dehydrogenase, and alkaline phosphatase (ALP)), serum inflammatory indicators (neutrophils, platelets, lymphocytes, monocytes, platelet/lymphocyte ratio (PLR), neutrophil/lymphocyte ratio (NLR), systemic immune-inflammation index (SII) = platelet × neutrophil/lymphocyte, advance lung cancer inflammation index (ALI) = BMI × ALB/NLR), prognostic nutritional index (PNI) = ALB + 5 × lymphocyte), advance distant metastases (number of BM, lung metastasis, intrathoracic metastasis (malignant pleural effusion, pericardial effusion, or pleural metastasis), liver metastasis, bone metastasis, adrenal metastasis, and metastases to other sites), signs and symptoms of BM (intracranial hypertension, focal signs and symptoms, epilepsy, and decreased cognitive function), type of pathology, pathological stage (tumour stage, lymph node stage (N_stage), or metastasis stage (M_stage)/TNM stage), epidermal growth factor receptor (EGFR) gene mutation status, treatment status (surgery for primary lung cancer foci, radiotherapy of primary lung cancer, radiotherapy for BM lesions (whole-brain radiation therapy or stereotactic radiation therapy), surgical treatment of metastatic brain lesions, chemotherapy, or EGFR-tyrosine kinase inhibitor (TKI) treatment), and classification information of RPA, GPA, Lung-molGPA, and BSBM models. The above predictors were complete and comprised objective data. All predictors were assessed independently of each other, without any knowledge of the clinical outcome. All continuity predictors maintained their continuity and were not categorised. The categorised predictors were all predetermined before model construction. The continuous and categorical variables are presented in Table S1. The sample size of this study met the criterion of having events per variable (EPV) of >10 (Peduzzi et al., 1996; Austin, Allignol & Fine, 2017; Moons et al., 2019).

Model construction and evaluation

The final predictors were selected via a 10-fold cross-validation of LASSO-Cox regression, whereby the λ-value associated with the minimum standard error was chosen. Subsequently, the final predictors were incorporated into a multivariate Cox regression analysis, and the risk score for each patient was calculated using the “predict ()” function. Finally, a prognostic model was constructed.

Riskscore=h0t× expβ1X1+β2X2++βnXn

The discriminatory ability of the model was assessed by evaluating the area under the receiver operating characteristic (ROC) curve (AUC). Furthermore, the calibration curve was plotted and the Brier score was calculated to measure the calibration of the model. Internal validation was conducted using the bootstrap method (resampling 1,000 times). The discriminative ability and clinical utility of the novel prognostic models were compared with RPA, GPA, Lung-molGPA, BSBM, and TNM staging using time-dependent C-index and decision curve analysis (DCA). A larger AUC value indicated better predictability of the model (Carrington et al., 2023). DCA demonstrated the relationship between benefits and risks across models by examining various cut points (thresholds) in different models (Van Calster et al., 2018). Furthermore, the integrated discrimination improvement (IDI) and net reclassification improvement (NRI) were employed to assess the reclassification performance and discrimination of our novel prediction models compared with RPA, GPA, Lung-molGPA, BSBM, and TNM staging. A nomogram was developed based on the selected predictors to facilitate individual survival prediction in NSCLC patients with BM. Subsequently, patients were classified into low-risk, intermediate-risk, and high-risk groups based on the new prediction model RiskScore. The differences in OS among these three subgroups were analysed using the Kaplan–Meier method. All statistical analyses were performed using R software (version 4.2.1; R Core Team, 2022), and statistical significance was set at P ≤ 0.05.

Results

Patient characteristics

A total of 300 NSCLC patients with BM were included in this study, all of whom had complete baseline clinical and laboratory data. The clinicopathological characteristics and laboratory results of these patients are summarised in Table S1. The mean age of the patients was 55.4 years (range 31–83 years). Among the patients, 185 were males and 115 were females. The median follow-up duration of the patients was 13.9 months, with a minimum and maximum follow-up duration of 0.1 months and 173.83 months, respectively. The last follow-up was performed on 16 June 2021. The OS rates for these patients at 1, 3, and 5 years were 75%, 49%, and 40.3%, respectively.

Construction of the prognostic models

First, LASSO-Cox regression analysis was used to identify the optimal predictors and constructed the model. Cross-validation was used to select the λ-value associated with lambda.min (λ = 0.054), which yielded the highest model fit (Fig. 1). This λ-value corresponds to the most significant prognostic factor for OS. The final model comprised 15 predictors, namely age, KPS, NSE, PLR, lymphocyte count, ALP, smoking history, intrathoracic metastasis, metastases to other sites, N_stage, M_stage, surgery for primary lung cancer foci, chemotherapy EGFR mutation, and TKI treatment. The EPV for each variable was 12.4. The regression coefficients of these predictors were used to construct the prognostic model. The risk score of the prognostic model was calculated as follows: risk score = h0(t) exp [(age × 0.0105543) + (kps × −0.0082627) + (NSE × 0.0017274) + (PLR × 0.0007071) + (lymphocyte count ×−0.0174690)+ (ALP ×−0.0001577) + (smoke × 0.1433414) + (intrathoracic_metastasis ×0.4382260) + (metastases_to_other_sites × 0.0330467) + (N_stage × 0.1062274)+(M_stage × 0.0277756) + (surgery × −0.2624173) + (chemotherapy × −0.0027364) + (TKI_therapy × −0.3508606) + (EGFR mutation × −0.0952148)]. The continuous variables in the formula were based on the original numerical levels, while the categorical variables were represented by codes, as presented in Table S1.

Figure 1. The LASSO regression algorithm was used for screening the predictors.

Figure 1

The LASSO regression algorithm was used for screening the predictors. (A) Path diagram of regression coefficients. Each curve represents the trajectory of each independent variable coefficient with log(ë). (B) Cross-validation curves of the LASSO regression analysis. The left dashed line is lambda.min, which is the smallest deviation of ë, while the right dashed line is lambda.1se, which is one standard error to the right of the smallest ë.

We conducted Cox proportional risk regression analysis using the 15 predictors identified through LASSO regression (Fig. 2). The time-dependent ROC curves suggested AUC values of 0.746 (0.678−0.814), 0.819 (0.761−0.877), and 0.865 (0.774−0.957) for our prediction model at 1, 3, and 5 years, respectively. The Brier score was calculated to measure the overall performance of the model, yielding a value of 0.103 (0.072−0.134). The Brier score combines discrimination and calibration measures, with lower values indicating better model performance (0 for perfect overall performance and 0.25 for worthless model) (Schneider et al., 2022; Assel, Sjoberg & Vickers, 2017). Additionally, internal validation using the bootstrap method (resampling of 1,000) demonstrated an AUC of 0.811 (0.638−0.950) and a Brier score of 0.123 (0.066−0.188; based on 5 years), confirming the model’s good discriminative ability and calibration (Fig. 3).

Figure 2. The hazard ratios and 95% confidence intervals for the 15 predictors.

Figure 2

Figure 3. The ROC, calibration curves, and Brier scores for the prediction model and internal validation.

Figure 3

(A) The time-dependent ROC curves, with AUC values and 95% confidence intervals, of the prediction model for 1, 3, and 5 years; (B) calibration curves, with AUC values, Brier scores, and 95% confidence intervals, based on 5 years. The solid gray line represents a perfect prediction of an ideal model, while the solid black line indicates the performance of the constructed model. (C) The internal validation using the bootstrap method (resampling of 1,000).

Evaluation of the performance between our novel prognostic model, RPA, GPA, Lung-molGPA, BSBM, and TNM staging

The time-dependent C-index was used to evaluate the accuracy of our model’s predictions. Our model demonstrated superior discriminatory ability compared with RPA, GPA, Lung-molGPA, BSBM, and TNM staging, and these results were consistent with our bootstrap validation (Fig. 4).

Figure 4. Comparison of the time-dependent C-index of the six models.

Figure 4

(A) Comparison of our models with RPA, GPA, Lung-molGPA, BSBM and TNM staging based on the time-dependent C-indices. (B) Internal validation using the bootstrap method.

Furthermore, DCA was employed to evaluate which focus the clinical applicability of our predictive model. Notably, the applicable threshold range varied among the six DCA curves. The widest threshold range was observed for our model, approximately 0.4−0.8, among the six curves. Moreover, within most of the threshold ranges, our model yielded the highest net benefits (Fig. 5). These findings indicate that our model is the optimal choice.

Figure 5. The DCA curves of the six models.

Figure 5

The decision curves show that the threshold probabilities of our models range from 0.4−0.8, which is the widest threshold range of all models. Among most of the threshold ranges, our constructed model has the highest net benefit overall DCA curves compared to the curves for all treatment (“ALL” curve), no treatment (“None” curve), and the other three models.

Lastly, the IDI and NRI metrics were incorporated to assess the performance of our model. The IDI analysis allowed us to measure the extent to which our new model improved its predictive power compared with the existing model, and the NRI analysis was used to determine the extent to which the new model improved the proportion of correct reclassifications compared with the existing model. The IDI analysis revealed that our model demonstrated positive improvements compared with GPA 0.152 (0.063−0.287), RPA 0.209 (0.113−0.347), Lung-molGPA 0.106 (0.030−0.240), BSBM 0.120 (0.044−0.247), and TNM staging 0.218 (0.122−0.354) with IDI >0. Meanwhile, the NRI analysis revealed that our model demonstrated positive improvements compared with GPA 0.537 (0.172−0.676), RPA 0.474 (0.270−0.722), Lung-molGPA 0.525 (0.124−0.641), BSBM 0.457 (0.149−0.644), and TNM staging 0.536 (0.269−0.711) with NRI >0 (Table 1). These results indicate that our new model has effectively improved the outcome events compared with the previous model.

Table 1. IDI and NRI were used to evaluate the improvement in predictive power and proportion of correct reclassifications of our model compared to the older models for RPA, GPA, Lung-molGPA, BSBM and TNM staging.

IDI*(95% CI) P value NRI*(95% CI) P value
our model vs GPA 0.152(0.063∼0.287) 0.002 0.537(0.172∼0.676) 0.002
our model vs RPA 0.209(0.113∼0.347) <0.001 0.474(0.270∼0.722) 0.002
our model vs Lung_molGPA 0.106(0.030∼0.240) 0.014 0.525(0.124∼0.641) 0.01
our model vs BSBM 0.120(0.044∼0.247) 0.002 0.457(0.149∼0.644) 0.004
our model vs TNM 0.218(0.122∼0.354) <0.001 0.536(0.269∼0.711) <0.001

Notes.

IDI
Integrated Discrimination Improvement
NRI
Net Reclassification Index
*

Positive value represents better accuracy, negative velue represents worse accuracy.

Constructing a nomogram for predicting OS

A nomogram was constructed to visualise our model. This provided a convenient, personalised tool to predict the 1-, 3-, and 5-year OS in NSCLC patients with BM. Each predictor is associated with a score, and the scores of all predictors were summed together to obtain a total score, from which the OS at 1, 3, and 5 years could be obtained (Fig. 6).

Figure 6. Constructed nomogram for predicting 1-, 3-, and 5-year OS in NSCLC patients diagnosed with BM based on 15 predictors.

Figure 6

The nomogram is used by summing the points for each prognostic factor. The total score on the bottom scale corresponds to the patient’s probability of survival at 1, 3, and 5 years.

Risk stratification based on our model

The risk score of each patient was calculated, following which they were classified into three groups, namely the low-risk ( n = 75), intermediate-risk (n = 150), and high-risk (n = 75) groups based on the tertiles of their risk score to assess whether our model could accurately assess patient risk. The OS was significantly lower (P < 0.001) in the high-risk group (risk score >2.125) compared with the low-risk (risk score ≤0.789) and intermediate-risk (0.789 < risk score ≤2.125) groups. Additionally, patients in the intermediate-risk group had a lower OS than those in the low-risk group (P < 0.001).

The Lung-molGPA and BSBM models, along with our model, help distinguish the OS of patients. However, when using risk groupings derived from the GPA and RPA models, the differentiation of OS among patients is not fully effective. Specifically, according to the GPA model, there was no statistically significant difference in the OS between patients in the “GPA 1.5−2.5” group and the “GPA 3” group (P = 0.275). Similarly, based on the RPA model, there was no statistically significant difference between the OS of patients in the “class II” group and the “class III” group (P = 0.122).

While the dichotomous risk grouping based on TNM staging can differentiate patients’ OS, it lacks the level of detail provided by our model, which comprises three subgroups. The results demonstrate the superior performance of our prognostic model in accurately distinguishing the prognosis of NSCLC patients with BM (Figs. 7, 8).

Figure 7. Kaplan–Meier survival curve analysis for different models.

Figure 7

Kaplan–Meier plots are shown for RPA (A), GPA (B), Lung-molGPA (C), BSBM (D), our model (E) and TNM staging (F).

Figure 8. Kaplan–Meier survival curve analysis for diûerent models.

Figure 8

Kaplan–Meier plots are shown for RPA (A), GPA (B), Lung-molGPA (C), BSBM (D), our model (E) and TNM staging (F).

Discussion

In this study, a prediction model based on the LASSO-Cox regression algorithm was developed for predicting OS in NSCLC patients with BM. The prediction model comprised 15 variables, including age, KPS, NSE, PLR, lymphocyte count, ALP, smoking history, intrathoracic metastasis, metastases to other sites, N_stage, M_stage, surgery for primary lung cancer foci, chemotherapy, EGFR mutation, and TKI treatment. The prediction model exhibited good discriminative ability, calibration, and clinical utility. In addition, it outperformed the conventional BM models such as GPA, RPA, Lung-molGPA, BSBM, and TNM staging. Furthermore, the patients were categorised based on their risk scores, demonstrating significant differences in the OS of the low-, intermediate-, and high-risk subgroups as classified by our model.

Previous studies have identified that age, surgical treatment of the primary tumour, KPS, extracerebral metastasis, targeted therapy, NSE level, ALP level, and PLR are factors that influence the prognosis of NSCLC patients with BM (Rodrigus, de Brouwer & Raaymakers, 2001; Sanchez de Cos et al., 2009; Fuchs et al., 2021; Junger et al., 2021; Yu et al., 2021; Jacot et al., 2001; Cho et al., 2021). In our model, five individual prognostic factors, namely NSE, PLR, ALP, intrathoracic metastasis, and targeted therapy, were identified. Notably, these factors were not considered in previously published clinical prediction models for NSCLC with BM. For instance, Jacot et al. (2001) demonstrated that high serum NSE levels were associated with a worse prognosis in NSCLC patients with BM, suggesting a correlation between elevated NSE and the extent of tumour-induced damage to normal brain tissue. On the other hand, PLR serves as an index of inflammation, and a study by Cho et al. (2021) reported that an increase of 10 in PLR was associated with a 1.3% increase in the risk of death in NSCLC patients with BM. These findings might be attributed to the association between inflammation and cancer progression, wherein elevated platelet levels result in the production of inflammatory cytokines and chemokines, thereby facilitating tumour progression (Lim et al., 2019). Additionally, lymphocytes play a crucial role in antitumour immunity, and a decrease in lymphocyte count indicates an impaired cell-mediated immune response and compromised antitumour immunity (Jiang et al., 2019). Jacot et al. (2001) observed that NSCLC patients with BM and elevated ALP levels had shorter survival. Moreover, the use of TKIs targeting driver mutations in NSCLC, such as EGFR-TKIs and anaplastic lymphoma receptor tyrosine kinase gene (ALK)-TKIs, has significantly improved the prognosis of NSCLC patients with BM who possess corresponding gene mutations (Rotow & Bivona, 2017; Planchard et al., 2018). Based on the above evidence, the predictors incorporated into our model are valid and plausible. However, it is worth noting that there is a controversial point regarding intrathoracic metastasis. While the study by Hirashima et al. (2014) found intrathoracic metastasis to be a significant favourable prognostic factor for NSCLC patients with distant metastases, our study arrived at the opposite conclusion, suggesting that intrathoracic metastasis is an unfavourable prognostic factor in NSCLC patients with BM. Further research is warranted to validate these findings and address the existing conflicts.

Few studies have developed predictive models for assessing the prognostic risk of NSCLC patients with BM, and these studies have certain limitations. For instance, Wang et al. (2021) constructed clinical prediction models using univariate analysis to screen predictors (Zhang et al., 2020; Wang et al., 2021; Huang et al., 2020). However, according to the prediction model risk of bias assessment tool guidelines, bias can arise when univariate analysis leads to the exclusion of certain variables from the, as some predictors only demonstrate significance when adjusted for other factors simultaneously during analysis (Moons et al., 2019). In contrast, our model is based on LASSO regression, which effectively filters predictors. This method is a robust reduction algorithm that actively selects relevant and interpretable predictors from a large pool of variables, considering possible multicollinearity. Moreover, it helps avoid model overfitting (Balachandran et al., 2015; McEligot et al., 2020). Another study by Zhang et al. (2020) constructed a predictive model based on a predictor known as the computed tomography imaging histology score (Rad-score). However, the clinical applicability of their model is limited since the Rad-score predictor is not readily available during hospitalisation as it is not a routine examination item (Zhang et al., 2020). In contrast, our model incorporates readily available predictors, enhancing its clinical applicability. Similarly, Li et al. (2022) also established a nomogram combining patient clinicopathological factors and serological inflammatory markers (PLR, NLR, SII, PNI, and ALI) to predict survival in NSCLC patients with BM. Our study not only includes these indices but also incorporates a range of lung cancer-related tumour markers, such as CEA, NSE, CYFRA21, and SCCA, which have a potential impact on the prognosis of NSCLC patients with BM.

In summary, our LASSO-Cox regression analysis revealed that our prediction model provides a good fit for predicting OS in NSCLC patients with BM. The calibration plots demonstrated good calibration, while the time-dependent C-index analysis confirmed the model’s strong prognostic accuracy compared with RPA, GPA, Lung-molGPA, BSBM, and TNM staging. Additionally, DCA revealed that our model yields the highest overall net benefit. Moreover, the IDI and NRI results showed significant improvements in predictive power and reclassification ratio when compared with RPA, GPA, Lung-molGPA, BSBM, and TNM staging. Notably, our model effectively stratified NSCLC patients with BM into low-, intermediate-, and high-risk subgroups, with the high-risk group exhibiting the worst survival outcomes. In conclusion, our clinical prediction model offers numerous advantages, including cost-effectiveness, broad applicability, simplicity of use, accessibility, and high accuracy. It holds great potential for predicting prognosis and aiding treatment decisions in NSCLC patients with BM.

Our study has certain limitations. First, the retrospective nature of our study introduces the possibility of selection bias, information bias, and confounding bias. Second, the data were obtained from a single hospital, and the sample size was relatively small. Therefore, future studies incorporating larger, multicentre cohorts are warranted to validate our findings. Furthermore, the inclusion of 15 predictive factors in our model might limit its clinical applicability due to the complexity associated with a large number of predictors. Finally, while our model incorporates easily accessible predictors, it is important to recognise that the specificity of the prognostic model could be improved by including NSCLC-related immunohistochemical markers or other relevant genetic mutations, such as programmed death-ligand 1, cytotoxic T-lymphocyte–associated antigen 4, ALK rearrangement, and ROS1 rearrangement (Ahmadzada et al., 2018).

Conclusions

The clinical prediction model we constructed holds the potential for predicting OS in NSCLC patients with BM, outperforming established models such as RPA, GPA, Lung-molGPA, BSBM, and TNM staging.

Supplemental Information

Supplemental Information 1. Original clinical information of 300 patients.

The continuous variables in the formula based on the original numerical levels and the codes representing the categorical variables.

DOI: 10.7717/peerj.15678/supp-1

Acknowledgments

We express our appreciation to Bullet Edits for their expert language services.

Funding Statement

The authors received no funding for this work.

Contributor Information

Chao Liu, Email: liuchao@kmmu.edu.cn.

Zhi-Yong Deng, Email: 13888158986@163.com.

Additional Information and Declarations

Competing Interests

The authors declare there are no competing interests.

Author Contributions

Fei Hou conceived and designed the experiments, performed the experiments, analyzed the data, prepared figures and/or tables, authored or reviewed drafts of the article, and approved the final draft.

Yan Hou performed the experiments, analyzed the data, prepared figures and/or tables, authored or reviewed drafts of the article, and approved the final draft.

Xiao-Dan Sun performed the experiments, prepared figures and/or tables, and approved the final draft.

Jia lv performed the experiments, prepared figures and/or tables, and approved the final draft.

Hong-Mei Jiang analyzed the data, prepared figures and/or tables, and approved the final draft.

Meng Zhang analyzed the data, prepared figures and/or tables, and approved the final draft.

Chao Liu conceived and designed the experiments, authored or reviewed drafts of the article, and approved the final draft.

Zhi-Yong Deng conceived and designed the experiments, authored or reviewed drafts of the article, and approved the final draft.

Data Availability

The following information was supplied regarding data availability:

The clinicopathological characteristics and laboratory results of these patients are available in the Supplementary File.

References

  • Ahmadzada et al. (2018).Ahmadzada T, Kao S, Reid G, Boyer M, Mahar A, Cooper WA. An update on predictive biomarkers for treatment selection in non-small cell lung cancer. Journal of Clinical Medicine. 2018;7(6):153. doi: 10.3390/jcm7060153. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Assel, Sjoberg & Vickers (2017).Assel M, Sjoberg DD, Vickers AJ. The Brier score does not evaluate the clinical utility of diagnostic tests or prediction models. Diagnostic and Prognostic Research. 2017;1:19. doi: 10.1186/s41512-017-0020-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Austin, Allignol & Fine (2017).Austin PC, Allignol A, Fine JP. The number of primary events per variable affects estimation of the subdistribution hazard competing risks model. Journal of Clinical Epidemiology. 2017;83:75–84. doi: 10.1016/j.jclinepi.2016.11.017. [DOI] [PubMed] [Google Scholar]
  • Balachandran et al. (2015).Balachandran VP, Gonen M, Smith JJ, De Matteo RP. Nomograms in oncology: more than meets the eye. The Lancet Oncology. 2015;16(4):e173-e180. doi: 10.1016/S1470-2045(14)71116-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Carrington et al. (2023).Carrington AM, Manuel DG, Fieguth P, Ramsay TO, Osmani V, Wernly B, Bennett C, Hawken S, Magwood O, Sheikh Y, McInnes M, Holzinger A. Deep ROC analysis and AUC as balanced average accuracy, for improved classifier selection, audit and explanation. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2023;45(1):329–340. doi: 10.1109/TPAMI.2022.3145392. [DOI] [PubMed] [Google Scholar]
  • Cho et al. (2021).Cho A, Untersteiner H, Hirschmann D, Fitschek F, Dorfer C, Rossler K, Zochbauer-Muller S, Gatterbauer B, Hochmair MJ, Frischer JM. Pre-radiosurgery leucocyte ratios and modified glasgow prognostic score predict survival in non-small cell lung cancer brain metastases patients. Journal of Neuro-Oncology. 2021;151(2):257–265. doi: 10.1007/s11060-020-03660-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Collins et al. (2015).Collins GS, Reitsma JB, Altman DG, Moons KG. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): the TRIPOD statement. BMJ. 2015;350:g7594. doi: 10.1136/bmj.g7594. [DOI] [PubMed] [Google Scholar]
  • Fuchs et al. (2021).Fuchs J, Fruh M, Papachristofilou A, Bubendorf L, Hauptle P, Jost L, Zippelius A, Rothschild SI. Resection of isolated brain metastases in non-small cell lung cancer (NSCLC) patients - evaluation of outcome and prognostic factors: a retrospective multicenter study. PLOS ONE. 2021;16(6):e0253601. doi: 10.1371/journal.pone.0253601. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Gao et al. (2018).Gao HX, Huang SG, Du JF, Zhang XC, Jiang N, Kang WX, Mao J, Zhao Q. Comparison of prognostic indices in NSCLC patients with brain metastases after radiosurgery. International Journal of Biological Sciences. 2018;14(14):2065–2072. doi: 10.7150/ijbs.28608. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Gaspar et al. (2000).Gaspar LE, Scott C, Murray K, Curran W. Validation of the RTOG recursive partitioning analysis (RPA) classification for brain metastases. International Journal of Radiation Oncology*Biology*Physics. 2000;47(4):1001–1006. doi: 10.1016/s0360-3016(00)00547-2. [DOI] [PubMed] [Google Scholar]
  • Hirashima et al. (2014).Hirashima T, Suzuki H, Okamoto N, Morishita N, Yamadori T, Tamiya M, Shiroyama T, Kurata K, Kawase I. Important factors for achieving survival of five years or more in non-small cell lung cancer patients with distant metastasis. Oncology Letters. 2014;8(1):327–334. doi: 10.3892/ol.2014.2107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Huang et al. (2020).Huang Z, Hu C, Tong Y, Fan Z, Zhao C. Construction of a nomogram to predict the prognosis of non-small-cell lung cancer with brain metastases. Medicine. 2020;99(31):e21339. doi: 10.1097/MD.0000000000021339. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Jacot et al. (2001).Jacot W, Quantin X, Boher JM, Andre F, Moreau L, Gainet M, Depierre A, Quoix E, Chevalier TL, Pujol JL. Association d’Enseignement et de Recherche des Internes en O. Brain metastases at the time of presentation of non-small cell lung cancer: a multi-centric AERIO analysis of prognostic factors. British Journal of Cancer. 2001;84(7):903–909. doi: 10.1054/bjoc.2000.1706. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Jiang et al. (2019).Jiang T, Bai Y, Zhou F, Li W, Gao G, Su C, Ren S, Chen X, Zhou C. Clinical value of neutrophil-to-lymphocyte ratio in patients with non-small-cell lung cancer treated with PD-1/PD-L1 inhibitors. Lung Cancer. 2019;130:76–83. doi: 10.1016/j.lungcan.2019.02.009. [DOI] [PubMed] [Google Scholar]
  • Junger et al. (2021).Junger ST, Reinecke D, Meissner AK, Goldbrunner R, Grau S. Resection of symptomatic non-small cell lung cancer brain metastasis in the setting of multiple brain metastases. Journal of Neurosurgery. 2021;136:1–7. doi: 10.3171/2021.7.JNS211172. [DOI] [PubMed] [Google Scholar]
  • Li et al. (2022).Li X, Gu W, Liu Y, Wen X, Tian L, Yan S, Chen S. A novel quantitative prognostic model for initially diagnosed non-small cell lung cancer with brain metastases. Cancer Cell International. 2022;22(1):251–261. doi: 10.1186/s12935-022-02671-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Lim et al. (2019).Lim JU, Yeo CD, Kang HS, Park CK, Kim JS, Kim JW, Kim SJ, Lee SH. Elevated pretreatment platelet-to-lymphocyte ratio is associated with poor survival in stage IV non-small cell lung cancer with malignant pleural effusion. Scientific Reports. 2019;9(1):4721. doi: 10.1038/s41598-019-41289-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • McEligot et al. (2020).McEligot AJ, Poynor V, Sharma R, Panangadan A. Logistic LASSO regression for dietary intakes and breast cancer. Nutrients. 2020;12(9):2652–2665. doi: 10.3390/nu12092652. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Moons et al. (2019).Moons KGM, Wolff RF, Riley RD, Whiting PF, Westwood M, Collins GS, Reitsma JB, Kleijnen J, Mallett S. PROBAST: a tool to assess risk of bias and applicability of prediction model studies: explanation and elaboration. Annals of Internal Medicine. 2019;170(1):W1–W33. doi: 10.7326/M18-1377. [DOI] [PubMed] [Google Scholar]
  • Ouyang et al. (2020).Ouyang W, Yu J, Zhou Y, Xu Y, Li J, Gong J, Zhang J, Xie C. Metachronous brain metastasis in patients with EGFR-mutant NSCLC indicates a worse prognosis. Journal of Cancer. 2020;11(24):7283–7290. doi: 10.7150/jca.46462. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Peduzzi et al. (1996).Peduzzi P, Concato J, Kemper E, Holford TR, Feinstein AR. A simulation study of the number of events per variable in logistic regression analysis. Journal of Clinical Epidemiology. 1996;49(12):1373–1379. doi: 10.1016/s0895-4356(96)00236-3. [DOI] [PubMed] [Google Scholar]
  • Pilz, Manegold & Schmid-Bindert (2012).Pilz LR, Manegold C, Schmid-Bindert G. Statistical considerations and endpoints for clinical lung cancer studies: Can progression free survival (PFS) substitute overall survival (OS) as a valid endpoint in clinical trials for advanced non-small-cell lung cancer? Translational Lung Cancer Research. 2012;1(1):26–35. doi: 10.3978/j.issn.2218-6751.2011.12.08. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Planchard et al. (2018).Planchard D, Popat S, Kerr K, Novello S, Smit EF, Faivre-Finn C, Mok TS, Reck M, Van Schil PE, Hellmann MD, Peters S, Committee EG. Metastatic non-small cell lung cancer: ESMO clinical practice guidelines for diagnosis, treatment and follow-up. Annals of Oncology. 2018;29(Suppl 4):iv192–iv237. doi: 10.1093/annonc/mdy275. [DOI] [PubMed] [Google Scholar]
  • R Core Team (2022).R Core Team . Vienna: R Foundation for Statistical Computing; 2022. [Google Scholar]
  • Rodrigus, de Brouwer & Raaymakers (2001).Rodrigus P, de Brouwer P, Raaymakers E. Brain metastases and non-small cell lung cancer. Prognostic factors and correlation with survival after irradiation. Lung Cancer. 2001;32(2):129–136. doi: 10.1016/s0169-5002(00)00227-0. [DOI] [PubMed] [Google Scholar]
  • Rotow & Bivona (2017).Rotow J, Bivona TG. Understanding and targeting resistance mechanisms in NSCLC. Nature Reviews Cancer. 2017;17(11):637–658. doi: 10.1038/nrc.2017.84. [DOI] [PubMed] [Google Scholar]
  • Sanchez de Cos et al. (2009).Sanchez de Cos J, Sojo Gonzalez MA, Montero MV, Perez Calvo MC, Vicente MJ, Valle MH. Non-small cell lung cancer and silent brain metastasis. Survival and prognostic factors. Lung Cancer. 2009;63(1):140–145. doi: 10.1016/j.lungcan.2008.04.013. [DOI] [PubMed] [Google Scholar]
  • Schneider et al. (2022).Schneider C, Aubert CE, Del Giovane C, Donze JD, Gastens V, Bauer DC, Blum MR, Dalleur O, Henrard S, Knol W, O’Mahony D, Curtin D, Lee SJ, Aujesky D, Rodondi N, Feller M. Comparison of 6 mortality risk scores for prediction of 1-year mortality risk in older adults with multimorbidity. JAMA Network Open. 2022;5(7):e2223911. doi: 10.1001/jamanetworkopen.2022.23911. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Schuler et al. (2016).Schuler M, Wu YL, Hirsh V, O’Byrne K, Yamamoto N, Mok T, Popat S, Sequist LV, Massey D, Zazulina V, Yang JC. First-line afatinib versus chemotherapy in patients with non-small cell lung cancer and common epidermal growth factor receptor gene mutations and brain metastases. Journal of Thoracic Oncology. 2016;11(3):380–390. doi: 10.1016/j.jtho.2015.11.014. [DOI] [PubMed] [Google Scholar]
  • Sperduto et al. (2008).Sperduto PW, Berkey B, Gaspar LE, Mehta M, Curran W. A new prognostic index and comparison to three other indices for patients with brain metastases: an analysis of 1,960 patients in the RTOG database. International Journal of Radiation Oncology*Biology*Physics. 2008;70(2):510–514. doi: 10.1016/j.ijrobp.2007.06.074. [DOI] [PubMed] [Google Scholar]
  • Sperduto et al. (2017).Sperduto PW, Yang TJ, Beal K, Pan H, Brown PD, Bangdiwala A, Shanley R, Yeh N, Gaspar LE, Braunstein S, Sneed P, Boyle J, Kirkpatrick JP, Mak KS, Shih HA, Engelman A, Roberge D, Arvold ND, Alexander B, Awad MM, Contessa J, Chiang V, Hardie J, Ma D, Lou E, Sperduto W, Mehta MP. Estimating survival in patients with lung cancer and brain metastases: an update of the graded prognostic assessment for lung cancer using molecular markers (Lung-molGPA) JAMA Oncology. 2017;3(6):827–831. doi: 10.1001/jamaoncol.2016.3834. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Sung et al. (2020).Sung H, Ferlay J, Siegel RL, Laversanne M, Soerjomataram I, Jemal A, Bray F. GLOBOCAN estimates of incidence and mortality worldwide for 36 Cancers in 185 countries. CA A Cancer Journal for Clinicians 2021. 2020;71(3):209–249. doi: 10.3322/caac.21660. [DOI] [PubMed] [Google Scholar]
  • Van Calster et al. (2018).Van Calster B, Wynants L, Verbeek JFM, Verbakel JY, Christodoulou E, Vickers AJ, Roobol MJ, Steyerberg EW. Reporting and interpreting decision curve analysis: a guide for investigators. European Urology. 2018;74(6):796–804. doi: 10.1016/j.eururo.2018.08.038. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Wang et al. (2021).Wang J, Zhang B, Pang Q, Zhang T, Chen X, Er P, Wang Y, You J, Wang P. A nomogram for predicting brain metastases of EGFR-mutated lung adenocarcinoma patients and estimating the efficacy of therapeutic strategies. Journal of Thoracic Disease. 2021;13(2):883–882. doi: 10.21037/jtd-20-1587. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Yu et al. (2021).Yu X, Sheng J, Pan G, Fan Y. Real-world utilization of EGFR TKIs and prognostic factors for survival in EGFR-mutated non-small cell lung cancer patients with brain metastases. International Journal of Cancer. 2021;149(5):1121–1128. doi: 10.1002/ijc.33677. [DOI] [PubMed] [Google Scholar]
  • Zhang et al. (2020).Zhang J, Jin J, Ai Y, Zhu K, Xiao C, Xie C, Jin X. Computer tomography radiomics-based nomogram in the survival prediction for brain metastases from non-small cell lung cancer underwent whole brain radiotherapy. Frontiers in Oncology. 2020;10:610691. doi: 10.3389/fonc.2020.610691. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplemental Information 1. Original clinical information of 300 patients.

The continuous variables in the formula based on the original numerical levels and the codes representing the categorical variables.

DOI: 10.7717/peerj.15678/supp-1

Data Availability Statement

The following information was supplied regarding data availability:

The clinicopathological characteristics and laboratory results of these patients are available in the Supplementary File.


Articles from PeerJ are provided here courtesy of PeerJ, Inc

RESOURCES