Summary
Prostate cancer is the most common cancer in men in the Western world. One-third of the patients with prostate cancer will develop resistance to hormonal therapy and progress into metastatic castration-resistant prostate cancer (mCRPC). Currently, docetaxel is a preferred treatment for mCRPC. However, about 20% of the patients will undergo early therapeutic failure owing to adverse events induced by docetaxel-based chemotherapy. There is an emergent need for a computational model that can accurately stratify patients into docetaxel-tolerable and docetaxel-intolerable groups. Here we present the best-performing algorithm in the Prostate Cancer DREAM Challenge for predicting adverse events caused by docetaxel treatment. We integrated the survival status and severity of adverse events into our model, which is an innovative way to complement and stratify the treatment discontinuation information. Critical stratification biomarkers were further identified in determining the treatment discontinuation. Our model has the potential to improve future personalized treatment in mCRPC.
Subject Areas: Bioinformatics, Cancer, Algorithms, Artificial Intelligence
Graphical Abstract

Highlights
-
•
Predicting the docetaxel treatment discontinuation in prostate cancer
-
•
The winning solution in the DREAM Prostate Cancer Challenge
-
•
Integrating survival status and adverse events to stratify treatment discontinuation
Bioinformatics; Cancer; Algorithms; Artificial Intelligence
Introduction
Prostate cancer is the most common cancer in men and the second leading cause of cancer-related mortality in the Western world (Gupta et al., 2014). Androgen deprivation therapy (ADT) is widely used to treat prostate cancer, but one-third of the patients will develop resistance to ADT, which is called castration-resistant prostate cancer (CRPC) (Kristiyanto et al., 2016). In general, patients with prostate cancer progress into CRPC in 18–48 months, and most deaths result from metastatic CRPC (mCRPC) with a median survival time of fewer than 2 years (Cookson et al., 2015). There are many other therapeutic options available for patients with progressive cancer. Docetaxel is one of the first-line treatments for mCRPC (Heidenreich et al., 2014). It has been shown that combined with prednisone, 75 mg/m2 docetaxel treatment for 3 weeks significantly reduced pain and improved overall survival time and quality of life in phase III trials (Tannock et al., 2004).
Although many studies have proved the positive effect of docetaxel on population-level survival (Berthold et al., 2008, Machiels et al., 2008), some patients become resistant to the therapy and stop the treatment (Petrylak et al., 2004). Approximately 10%–20% of the patients who were initially under docetaxel treatment discontinued the therapy owing to toxicity-induced adverse events (AEs) (Templeton et al., 2013). Since docetaxel-based chemotherapy is still an essential treatment for mCRPC and hormone-sensitive metastatic prostate cancer (Sweeney et al., 2015), it is important to identify whether a patient is able to tolerate the docetaxel therapy. However, whether the early discontinuation is predictable based on the clinical characteristics of a patient remains unclear (Seyednasrollah et al., 2017).
Recognizing the critical clinical need for identifying patients who are expected to suffer from immediate adverse events upon docetaxel treatment, the international Dialogue for Reverse Engineering Assessment and Methodology (DREAM) committee organized the Prostate Cancer Challenge, which called for the research community to develop algorithms for predicting AE (Seyednasrollah et al., 2017). In this challenge, a benchmark dataset was held out to systematically evaluate different computational methods. Here we report a model based on our winning solution to this challenge, which ranked first among 34 participating teams. In this model, we use 78 features associated with the laboratory test, metastases data, patient clinical features, and medical history. We find that the survival status is highly informative, complementing the prediction of discontinuation. We further reveal important predictive features that can be directly used to guide treatment in clinical settings. The model presented in this study not only sets the state of the field for personalized modeling of mCRPC treatment but also provides potential biomarkers and risk factors in the future prostate cancer treatment.
Results
Overview of the Experimental Design for Predicting Treatment Discontinuation in Prostate Cancer
The schematic illustration of our experimental design is shown in Figure 1. In this study, we used the data collected at three different cohorts from a total of 1,600 patients in phase III prostate cancer clinical trials Table 1. To address the cohort and batch effects, the original data were processed through missing value imputation and normalization (see details in Methods). We tested and compared different types of machine learning models via the standard 5-fold cross-validation experiments. The prediction performance was evaluated by area under the receiver operating characteristic curve (AUC) (Bradley, 1997) and area under the precision-recall curve (AUPRC). To understand the crucial features in predicting treatment discontinuation, we further performed feature importance analysis and identified top contributing features.
Figure 1.
The Workflow of the Algorithm Design for Predicting Early Discontinuation of Treatment in Prostate Cancer
The overall workflow consists of three main parts: (1) Data Collection; (2) Data Processing; (3) Model Construction & Inspection. Data were collected from three cohorts. In data processing, 78 features were used and normalized. Missing values were imputed. We generated two sets of gold standards for the subsequent model fitting, in which “New” (right in the “Data Processing” part) represents the gold standard assembled by the death, discontinuation, and treatment statuses. The “Model Construction & Inspection” part has three steps: (step 1) choose the best base learner; (step 2) construct a combinatory gold standard; (step 3) calculate the feature importance. All the performances were evaluated by randomly partitioning the data 10 times into 5-fold cross-validation, either on the full dataset or on an individual cohort. The final model submitted during the challenge was trained on the full dataset, which combined three cohorts.
Table 1.
Patient Characteristics of the Three Cohorts
| Cohorts | ASCENT2 (ASC) | CELGENE (CEL) | EFC6546 (VEN) |
|---|---|---|---|
| Basic Information | |||
| # Sample | 476 | 526 | 598 |
| Median age (years)a | 71 | 68 | 68 |
| Discontinuation Status | |||
| % Discontinuation | 22.05 | 7.79 | 8.52 |
| Median time to discontinuation (days) | 153.0 | 211.0 | 202.5 |
| % Discontinuation records missing | 0.00 | 18.06 | 0.00 |
| Death Status | |||
| % Death | 28.99 | 17.49 | 72.41 |
| Median time to death (days) | 357.0 | 279.0 | 642.5 |
| Treatment Status | |||
| % AE | 9.03 | 13.31 | 21.07 |
| % Possible AE | 46.85 | 45.24 | 22.24 |
| % Progression | 19.96 | 21.29 | 55.68 |
| % Complete | 24.16 | 0.00 | 0.17 |
| % Treatment status records missing | 0.00 | 20.15 | 0.84 |
“≥85” is converted to 85.
We first tested the prediction performance of five base learners: (1) linear regression, (2) logistic regression, (3) Cox regression (using AE as an endpoint), (4) bootstrap aggregation classification and regression trees (BAG-CART) (Sutton, 2005), and (5) random forest (RF) (Chen et al., 2004). The first three methods are linear or generalized linear statistical models, and the last two methods are nonlinear tree-based models. For this task of predicting treatment discontinuation primarily based on laboratory test and patient clinical features, we anticipated that the nonlinear interactions between input features would be crucial. We, therefore, investigated two tree-based methods, which have slightly different implementations—when building trees, BAG-CART considers all features for each node for a split and RF considers a random subset of features for each node for a split. We performed 5-fold cross-validation for ten times in two scenarios: (1) using the combined dataset of three cohorts (hereafter referred to as “the full dataset”; Figures 2A and 2B) and (2) using each cohort individually (Figures 2C and 2D).
Figure 2.
Evaluation of Different Base Learning Using AUC and AUPRC
The (A) AUCs and (B) AUPRCs of five base learners trained on the full dataset are shown in different colors. The (C) AUCs and (D) AUPRCs of these base learners trained on each cohort individually were also calculated. The colored boxes or violins represent the values of 10 times 5-fold cross-validation of five models, and the white ones represent the baselines. Of note, the AUC baseline of random prediction is always 0.5 in different cohorts. In contrast, the AUPRC baseline will change, which equals the percentage of early discontinuation events in a particular cohort.
In general, the discontinuation status can be predicted by the models trained from clinical features with those widely used base learners. When trained and evaluated on the full dataset, all the five base learners achieved acceptable performances, and the tree-based methods, including BAG-CART and RF, performed relatively better than the other three (Figures 2A and 2B). Similarly, when trained and evaluated on each cohort individually, the tree-based methods consistently achieved good performance, whereas the linear, logistic regression and Cox regression model performed even worse than the random prediction baseline in EFC6546(VEN) (Figure 2C; Table S1). The highest AUC scores were achieved by RF, RF, BAG-CART, RF, respectively in the full dataset and each of the three cohorts, where the median AUCs were 0.6269, 0.5701, 0.6562, and 0.5378 (Figure 2C). They also outperformed other base learners in terms of AUPRC, where the medians were 0.1999, 0.3080, 0.1792, and 0.0959 (Figure 2D). Of note, the average baseline AUPRCs of random prediction were 0.1308, 0.2146, 0.0952, and 0.0885, reflecting the overall ratio of positive cases. Since BAG-CART and RF are similar tree-based models and RF achieved the highest performance in two of the three cohorts and the full dataset, we chose RF as our base learner in the following analysis.
Cross-Reference Multiple Labels Improves Prediction Performance
To improve the performance of our model, we carefully inspected the dataset and found that multiple labels provided non-redundant information. The original definition of discontinuation was whether an AE occurred within 92 days. According to this definition, patients with AE at 93 days (high risk to discontinue) or 393 days (low risk to discontinue) were indistinguishable and their labels were identical. A major challenge was how to integrate this hidden information into a model. In fact, the death and treatment statuses were highly associated with the discontinuation. And a patient who died early (within 3 months) could be, although not definitely, associated with an adverse event. Therefore, we treated early death as another type of “adverse event” and assumed that patients labeled by “AE” and “possible AE” in treatment status should have a higher risk to discontinue than those labeled by “progress” and “complete.” These labels are intermediate variables reflecting diverse categories of the discontinuation reason, where “progress” means the treatment is in progress without AE and “complete” means the treatment has finished without AE.
Therefore, we integrated death status, discontinuation status. and treatment status (AE, possible AE, progression and complete) and created a new gold standard, retaining the original labels for a fair comparison. Specifically, we re-labeled the original discontinuation status: if a patient had AE or possible AE, or died early during the study, we treated this patient as a “discontinuation” case (see Methods). The distributions of the original and new labels are shown in Figures 3A and 3B, respectively. Notably, the high-risk patients (red) to discontinue treatment are separated from the low-risk ones (yellow and orange). Then we re-trained our model using this new gold standard and compared it with the model trained on the original discontinuation label. In all three cohorts, the performance was improved using the new gold standard (Table S2, Figures 3C and 3D). The AUC scores were increased to 0.6356, 0.5726, 0.6420, and 0.547, and the AUPRC scores were increased to 0.2001, 0.3089, 0.1598, and 0.1006, in the full dataset, ASCENT(ASC), CELGENE(CEL), and EFC6546(VEN) cohorts, respectively. This result indicates that the discontinuation label alone cannot fully reflect the status of a patient; the death and treatment status provides important and complementary information for more accurate predictions.
Figure 3.
Comparison of Models Using the Original Discontinuation Labels and New Labels
We evaluated the performance of our model using two types of gold standards: (A) the original discontinuation status and (B) our new gold standard integrating both treatment and death status. The colors represent the treatment status of each sample, and the shapes represent the death status. The vertical dash line is placed at 93 treatment days, which is the original time threshold of early discontinuation. The early death cases are re-labeled as positive, which are more likely caused by the treatment adverse events. Models using labels defined by different gold standards are evaluated by (C) AUC and (D) AUPRC. “Discontinuation” represents the model using labels of the discontinuation status, and the “New standard” represents the model using labels of the new gold standard. Notably, the AUC baseline is a constant 0.5 (so that it is a horizontal line instead of a box), whereas the AUPRC baseline changes along with the partition of the dataset.
Inspection of Important Features in Predicting Treatment Discontinuation
To find the key determinants of treatment discontinuation, we investigated the feature importance by calculating the delta-error of each feature in our random forest model (see Methods), on both the full dataset and subsets (ASC + CEL; CEL + VEN; VEN + ASC). We also colored the features according to their categories defined in the previous study (Seyednasrollah et al., 2017).
The top contributing features of models trained the full dataset and three individual cohorts are shown in Figure 4A and Figure S1, respectively. In general, laboratory test features play more important roles in this model, and the top 10 important features are consistent among different datasets (Figures 4B–4D, starred features). However, we also notice that, when trained on the combined dataset of CEL and VEN, the situation is different (Figure 4C). This may result from the cohort effects, which we will discuss in detail later. To investigate the impact of these features on patients with mCRPC, we first tested the linear relationship (Pearson's correlations) between the top 10 features and the risk of discontinuation on patients with mCRPC (Table S3). We observed that the impact of an individual feature on the discontinuation risk was weak because these clinical features had complex nonlinear interactions, which cannot be reflected by linear Pearson's correlation. This is also why the random forest model outperformed other base learners for this prediction task since the random forest was able to learn the non-linear interactions between features and robust to outliers or noises when the dataset is relatively small. We further calculated the correlations between the prediction of the random forest model using only the top 10 features and the treatment discontinuation (the last row in Table S3). This prediction can be regarded as a new non-linearly “combined” feature based on the top 10 features, and it has a much higher and significant correlation with the discontinuation risk on mCRPC patients.
Figure 4.
The Feature Importance Map of All 78 Features used in Our Model
(A) The importance of all features (features with negative importance have been removed). The top 10 important features are ALB, NA., TPRO, MG, TESTO, WBC, PHOS, MHVASC, NEU, and MHSOCIAL, which represent albumin, sodium, total protein, magnesium, testosterone, white blood cell, phosphorus, medical history of vascular disorders, neutrophils, and medical history of social circumstances, respectively. The colors of the bars represent the categories that the features belong to, and the sub-boxplot summaries the feature importance in each category. Note that there is only one feature that belongs to metastases.
(B–D) Top 10 important features in different subsets of data (B: ASC + CEL, C: CEL + VEN, D: VEN + ASC) are also presented here, where the stars represent the overlaps between them and the overall top 10 in panel A.
The top 10 important features are ALB (Albumin), NA. (Sodium), TPRO (Total Protein), MG (Magnesium), TESTO (Testosterone), WBC (White Blood Cells), PHOS (Phosphorus), MHVASC (Medical History: Vascular Disorders), NEU (Neutrophils), and MHSOCIAL (Medical History: Social Circumstance). Most of them play important roles in cancer-related diagnosis, prognosis, and therapy in previous studies, especially the biological features including serum albumin, sodium, magnesium, testosterone, phosphorus, and neutrophils (Halabi et al., 2014). Our findings are not incidental; in fact, many previous studies have reported these features for predicting the survival of patients with mCRPC. Serum albumin can be used to predict prognosis and recurrence for patients with prostate cancer (Sejima et al., 2013, Wang et al., 2017); neutrophils is also regarded as a prognostic factor (Langsenlehner et al., 2015, Sonpavde et al., 2014, Templeton et al., 2014), especially for the patients with mCRPC with docetaxel chemotherapy (Yao et al., 2015). Sodium (Na) and magnesium (Mg) regulate and influence cancer development as ions and compounds: sodium compounds can affect prostate cancer cell activities and the function of related proteins (Berggren et al., 2009, Kim et al., 2014), whereas the rate of Ca2+/Mg2+ is significantly associated with the risk and proliferation (Dai et al., 2011, Sun et al., 2013). Phosphorus, as another important chemical element for humans, has also been proven to be associated with the risk of high-grade prostate cancer (Wilson et al., 2014). And testosterone is also used as a therapeutic strategy and a predictive feature (Baillargeon et al., 2015, Mearini et al., 2013, Pastuszak et al., 2015), although its relationship with prostate cancer is controversial (Klap et al., 2015). Of note, these top features are highly associated with predicting treatment discontinuation, yet they do not imply causation. In this study, we focused on developing accurate machine learning models based on large datasets from three independent cohorts and identified the cross-cohort putative risk factors associated with (instead of causing) treatment discontinuation. Identifying etiological features requires further data collection and experimental validation, which is beyond the scope of this paper. Nevertheless, we here summarize literature evidence about these potential causal factors, which could provide guidance for future experimental design.
Validation on an External Cohort by Blind Test via the DREAM Platform
The validation of prediction was carried out through an objective evaluation venue. The international DREAM consortium had created a framework to unbiasedly evaluate the accuracy of different models (Guan, 2019). The participants were provided with the training data and the gold standard, and the test data, whereas the validation gold standard was hidden from participants. Then, through an online platform, the participants submitted a one-shot prediction file, which was evaluated automatically on the server.
Different gold standard creations and machine learning methods were explored by 34 teams during the challenge (Figure 5A). The basic gold standard labels are the discontinuation status (DISCONT) and censoring date (ENTRT_PC) of a patient, which were prevalently used by most teams (blue tiles in Figure 5A left). However, the death status (DEATH), date (LKADT_P), and the adverse effect (ENDTRS_C) of a patient were only used in our method, except for the death status was used once by another team. We integrated all these features and reconstructed the gold standard, which achieved the highest AUPRC for predicting discontinuation (the red model in Figure 5A). This comparison together with our cross-validation result suggests that gold standard reconstruction by capturing multiple sources of information was an important component of our high performance. Moreover, the base learners used by the participants were summarized and RF was used most by 14 of 34 teams. This suggests that the curation of the gold standard, rather than the exact machine learning base learner, distinguished our method from others.
Figure 5.
A Summary of the Challenge Results
All data were the final round results and retrieved from the challenge official website. Considering the potential privacy problems, all the team names except ours are hidden in this figure.
(A) Information of gold standard constructions, model categories, and learning strategies of the 34 teams (the left heatmap), as well as the final AUPRC estimation results (the right barplot). In summary, there were five variables used in the gold standard construction and 13 types of base learners. The participants mainly used ensemble study (Ensembl in heatmap x axis) to improve their models, and some of them reconstructed the gold standard (Reconstruction in heatmap x axis) for better performance. The blue tiles in the heatmap represent the variables or base learners used by the teams, whereas the white ones represent variables/learners that were not used. The gray tiles represent the missing information, and the red ones highlight our method. The barplot on the right shows the final AUPRC scores, where the dashed line represents the baseline of the validation data: 0.104.
(B) The p value of how significant a model is better than the score of 5,000 random predictions.
(C) The Bayes factor for the top performers. The Bayes factor is calculated by comparing each team with the best performer.
The validation cohort of this challenge comes from the comparative arm (docetaxel-placebo) of ENTHUSE33 (Fizazi et al., 2013), a phase III trial previously designed for studying the effect of Docetaxel in Combination With Zibotentan in Patients With mCRPC, including 470 patients. Although the failure of the trial results was previously published, the data were not publicly available, allowing it to be used as an independent validation cohort in this study. In this validation cohort, our algorithm achieved the top performance among the competing teams with an AUPRC of 0.190, compared with the random baseline of 0.104 (p = 0.003, Figure 5). It means that if today's therapy would have mistakenly put an expected 104 out of 1,000 patients to docetaxel treatment, with our algorithm deployed in place, we will be able to move (0.190–0.104)/(1–0.104)∗104, approximately 10 patients out of a wrong therapy. As with any personalized medicine algorithm using machine learning, a perfect prediction is our DREAM to pursue, but it is still a dream. However, translating this 10 of 104 patients into a type of cancer that affects the largest number of men in this world, it is expected to have a significant impact on the care to our patients.
Discussion
In this work, we report a machine learning model for predicting the docetaxel treatment discontinuation in mCRPCs. We find that tree-based methods are suited to this problem and outperform the Cox regression, one of the most popular methods in survival analysis. This phenomenon originated from the nature of the clinical dataset, where the features had complicated high-level interactions with each other in determining the treatment discontinuation. Tree-based methods are good at constructing models on a complex or non-linear dataset, and random forest applies the strategy of training on different parts of the dataset and averaging multiple decision trees to reduce the variance and avoid overfitting (Breiman, 2001, Li and Guan, 2019); thus, tree-based methods, especially random forest, are more suitable for such problems.
Apart from the best base learner selection, we also observed that the assembled gold standard would generate better predictions than using discontinuation status alone. This result reveals that the mCRPC clinical features have potential relationships with the risk of discontinuation of patients, which was reflected by the treatment status and the discontinuation time in this dataset. A better method to evaluate the risk should give even better performance than ours because the binary combination cannot reflect the risk order entirely. Our work provides an attempt to integrate multiple information to address this problem. A dataset with continuous confidence labels instead of binary labels would facilitate method development in the future.
One limitation of this work results from the data availability. Most of the features in this dataset have more than 25% of their records missed (Figure S2). This situation can be even more severe when considering the cohort individually, such as albumin (ALB), which is totally missing in ASC. To impute the missing values, we just simply replaced them with the average of the corresponding feature, which may introduce biases, and intensify the batch effects among cohorts and complexity among features. This should be another reason why linear, logistic regression and Cox regression failed to generate acceptable models. A more intact dataset should generate more precision models.
The clinical data itself is largely biased by cohort effects. The inconsistent results show not only in the feature importance calculation but also in our previous attempts to evaluate models cross cohorts even after normalizing. So we used t-distributed stochastic neighbor embedding (t-SNE) (Van Der Maaten and Hinton, 2008) to visualize the differences among cohorts (Figure S3). This figure shows the separation of these three cohorts, in which CEL and VEN have a closer relationship than to ASC. This explains why the results are inconsistent between the datasets with and without ASC.
Overall, our research provides a positive answer to the question of whether the early discontinuation can be predicted from basic clinical information and points out that random forest is the most suitable base learner. Our assembled gold standard performance also suggests that considering the risk of discontinuation for each patient rather than the binary label of early discontinuation can further improve the accuracy of prediction.
Limitation of the Study
-
1.
Feature availability. Most of the features have more than 25% missing values. And some of them are even completely missing in one cohort, like albumin (ALB) in the ASC cohort. Our imputation using the average of non-missing values may introduce biases into their original distributions.
-
2.
Cohort effects. Data in the competition were provided by three providers, which could include cohort effects. Although we have normalized the whole data, our results indicate the existence of these effects. The performances vary when we built and tested models with and without the ASC cohort.
-
3.
Base model. In recent years, releases of Xgboost (Chen and Guestrin, 2016) and LightGBM (Ke et al., 2017) provide us more efficient and powerful tools for constructing and training tree-based models. They may achieve better performances than the random forest model.
Methods
All methods can be found in the accompanying Transparent Methods supplemental file.
Acknowledgments
This work is supported by NSF-US14-PAF07599 (#1452656; CAREER: On-line Service for Predicting Protein Phosphorylation Dynamics Under Unseen Perturbations NSF), R35-GM133346-01 (Machine Learning for Drug Response Prediction), AWD007950 (Digital Biomarkers in Voices for Parkinson's Disease American Parkinson's Disease Association), University of Michigan O'Brien Kidney Translational Core Center, and Michael J. Fox Foundation #17373 to Y.G. This work is also supported by 19AMTG34850176 (American Heart Association and Amazon Web Services 3.0 Data Grant Portfolio: Artificial Intelligence and Machine Learning Training Grants) to H.L. We thank the GPU donation from Nvidia and the AWS donation from Amazon.
We thank the Prostate Cancer DREAM Challenge organizers:
Justin Guinney (Sage Bionetworks); Tao Wang (UT Southwestern); Brian Bot (Sage Bionetworks); Christopher Bare (Sage Bionetworks); Mike Kellen (Sage Bionetworks); Stephen Jones (Covance); Elise Blaese (IBM); Joseph Morrell (Project Data Sphere, LLC); Thea Norman (Sage Bionetworks); Kald Abdallah (Project Data Sphere, LLC); Stephen Friend (Sage Bionetworks); Yang Xie (UT Southwestern); James Costello (University of Colorado Anschutz Medical Campus); Gustavo Stolovitzky (IBM Research & Icahn School of Medicine at Mt. Sinai); Liz Zhou (Project Data Sphere, LLC)
We thank the Prostate Cancer DREAM Challenge data providers:
AstraZeneca; Celgene; Sanofi; Memorial Sloan Kettering Cancer Center.
Author Contributions
Y.G. designed and implemented the best-performing algorithm in the Prostate Cancer DREAM Challenge. Post-challenge, K.D. and H.L. performed computational analysis of the observed and predicted data and analyzed model performance and feature importance. K.D. prepared figures. K.D., H.L., and Y.G. wrote this manuscript. All authors read and approved the final manuscript.
Declaration of Interests
Y.G. receives personal payment from Eli Lilly and Company, Genentech Inc, F. Hoffmann-La Roche AG, and Cleerly Inc; holds equity shares at Cleerly Inc and Ann Arbor Algorithms Inc; receives research support from Merck KGaA as research contracts and Ryss Tech as unrestricted donations.
Published: February 21, 2020
Footnotes
Supplemental Information can be found online at https://doi.org/10.1016/j.isci.2019.100804.
Supplemental Information
References
- Baillargeon J., Kuo Y.-F., Fang X., Shahinian V.B. Long-term exposure to testosterone therapy and the risk of high grade prostate cancer. J. Urol. 2015;194:1612–1616. doi: 10.1016/j.juro.2015.05.099. [DOI] [PubMed] [Google Scholar]
- Berggren M., Sittadjody S., Song Z., Samira J.-L., Burd R., Meuillet E.J. Sodium selenite increases the activity of the tumor suppressor protein, PTEN, in DU-145 prostate cancer cells. Nutr. Cancer. 2009;61:322–331. doi: 10.1080/01635580802521338. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Berthold D.R., Pond G.R., Soban F., de Wit R., Eisenberger M., Tannock I.F. Docetaxel plus prednisone or mitoxantrone plus prednisone for advanced prostate cancer: updated survival in the TAX 327 study. J. Clin. Oncol. 2008;26:242–245. doi: 10.1200/JCO.2007.12.4008. [DOI] [PubMed] [Google Scholar]
- Bradley A.P. The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recogn. 1997;30:1145–1159. [Google Scholar]
- Breiman L. Random forests. Mach. Learn. 2001;45:5–32. [Google Scholar]
- Chen C., Liaw A., Breiman L. vol. 110. University of California, Berkeley; 2004. pp. 1–12. (Using random forest to learn imbalanced data). [Google Scholar]
- Chen T., Guestrin C. In: Proceedings of the 22Nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD ’16. ACM; New York, NY, USA: 2016. XGBoost: A Scalable Tree Boosting System; pp. 785–794. [Google Scholar]
- Cookson M.S., Lowrance W.T., Murad M.H., Kibel A.S., American Urological Association Castration-resistant prostate cancer: AUA guideline amendment. J. Urol. 2015;193:491–499. doi: 10.1016/j.juro.2014.10.104. [DOI] [PubMed] [Google Scholar]
- Dai Q., Motley S.S., Smith J.A., Jr., Concepcion R., Barocas D., Byerly S., Fowke J.H. Blood magnesium, and the interaction with calcium, on the risk of high-grade prostate cancer. PLoS One. 2011;6:e18237. doi: 10.1371/journal.pone.0018237. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fizazi K., Fizazi K.S., Higano C.S., Nelson J.B., Gleave M., Miller K., Morris T., Nathan F.E., McIntosh S., Pemberton K., Moul J.W. Phase III, randomized, placebo-controlled study of docetaxel in combination with zibotentan in patients with metastatic castration-resistant prostate cancer. J. Clin. Oncol. 2013;31:1740–1747. doi: 10.1200/JCO.2012.46.4149. [DOI] [PubMed] [Google Scholar]
- Guan Y. Waking up to data challenges. Nat. Mach. Intell. 2019;1:67. doi: 10.1038/s42256-018-0011-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gupta E., Guthrie T., Tan W. Changing paradigms in management of metastatic castration resistant prostate cancer (mCRPC) BMC Urol. 2014;14:55. doi: 10.1186/1471-2490-14-55. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Halabi S., Lin C.-Y., Kelly W.K., Fizazi K.S., Moul J.W., Kaplan E.B., Morris M.J., Small E.J. Updated prognostic model for predicting overall survival in first-line chemotherapy for patients with metastatic castration-resistant prostate cancer. J. Clin. Oncol. 2014;32:671–677. doi: 10.1200/JCO.2013.52.3696. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Heidenreich A., Bastian P.J., Bellmunt J., Bolla M., Joniau S., van der Kwast T., Mason M., Matveev V., Wiegel T., Zattoni F., Mottet N., European Association of Urology EAU guidelines on prostate cancer. Part II: treatment of advanced, relapsing, and castration-resistant prostate cancer. Eur. Urol. 2014;65:467–479. doi: 10.1016/j.eururo.2013.11.002. [DOI] [PubMed] [Google Scholar]
- Ke G., Meng Q., Finley T., Wang T., Chen W., Ma W., Ye Q., Liu T.-Y. LightGBM: A Highly Efficient Gradient Boosting Decision Tree. In: Guyon I., Luxburg U.V., Bengio S., Wallach H., Fergus R., Vishwanathan S., Garnett R., editors. Advances in Neural Information Processing Systems 30. Curran Associates, Inc.; 2017. pp. 3146–3154. [Google Scholar]
- Kim Y., Jeong I.G., You D., Song S.H., Suh N., Jang S.-W., Kim S., Hwang J.J., Kim C.-S. Sodium meta-arsenite induces reactive oxygen species-dependent apoptosis, necrosis, and autophagy in both androgen-sensitive and androgen-insensitive prostate cancer cells. Anticancer Drugs. 2014;25:53–62. doi: 10.1097/CAD.0000000000000013. [DOI] [PubMed] [Google Scholar]
- Klap J., Schmid M., Loughlin K.R. The relationship between total testosterone levels and prostate cancer: a review of the continuing controversy. J. Urol. 2015;193:403–413. doi: 10.1016/j.juro.2014.07.123. [DOI] [PubMed] [Google Scholar]
- Kristiyanto D., Anderson K.E., Hung L.-H., Yeung K.Y. Predicting discontinuation of docetaxel treatment for metastatic castration-resistant prostate cancer (mCRPC) with random forest. F1000Res. 2016;5 [Google Scholar]
- Langsenlehner T., Thurner E.-M., Krenn-Pilko S., Langsenlehner U., Stojakovic T., Gerger A., Pichler M. Validation of the neutrophil-to-lymphocyte ratio as a prognostic factor in a cohort of European prostate cancer patients. World J. Urol. 2015;33:1661–1667. doi: 10.1007/s00345-015-1494-7. [DOI] [PubMed] [Google Scholar]
- Li H., Guan Y. Machine learning empowers phosphoproteome prediction in cancers. Bioinformatics. 2019 doi: 10.1093/bioinformatics/btz639. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Machiels J.-P., Mazzeo F., Clausse M., Filleul B., Marcelis L., Honhon B., D'Hondt L., Dopchie C., Verschaeve V., Duck L. Prospective randomized study comparing docetaxel, estramustine, and prednisone with docetaxel and prednisone in metastatic hormone-refractory prostate cancer. J. Clin. Oncol. 2008;26:5261–5268. doi: 10.1200/JCO.2008.16.9524. [DOI] [PubMed] [Google Scholar]
- Mearini L., Zucchi A., Nunzi E., Villirillo T., Bini V., Porena M. Low serum testosterone levels are predictive of prostate cancer. World J. Urol. 2013;31:247–252. doi: 10.1007/s00345-011-0793-x. [DOI] [PubMed] [Google Scholar]
- Pastuszak A.W., Khanna A., Badhiwala N., Morgentaler A., Hult M., Conners W.P., Sarosdy M.F., Yang C., Carrion R., Lipshultz L.I., Khera M. Testosterone therapy after radiation therapy for low, intermediate and high risk prostate cancer. J. Urol. 2015;194:1271–1276. doi: 10.1016/j.juro.2015.05.084. [DOI] [PubMed] [Google Scholar]
- Petrylak D.P., Tangen C.M., Hussain M.H.A., Lara P.N., Jr., Jones J.A., Taplin M.E., Burch P.A., Berry D., Moinpour C., Kohli M. Docetaxel and estramustine compared with mitoxantrone and prednisone for advanced refractory prostate cancer. N. Engl. J. Med. 2004;351:1513–1520. doi: 10.1056/NEJMoa041318. [DOI] [PubMed] [Google Scholar]
- Sejima T., Iwamoto H., Masago T., Morizane S., Yao A., Isoyama T., Kadowaki H., Takenaka A. Low pre-operative levels of serum albumin predict lymph node metastases and ultimately correlate with a biochemical recurrence of prostate cancer in radical prostatectomy patients. Cent. European J. Urol. 2013;66:126–132. doi: 10.5173/ceju.2013.02.art3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Seyednasrollah F., Koestler D.C., Wang T., Piccolo S.R., Vega R., Greiner R., Fuchs C., Gofer E., Kumar L., Wolfinger R.D. A DREAM challenge to build prediction models for short-term discontinuation of docetaxel in metastatic castration-resistant prostate cancer. JCO Clin. Cancer Inform. 2017;1:1–15. doi: 10.1200/CCI.17.00018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sonpavde G., Pond G.R., Armstrong A.J., Clarke S.J., Vardy J.L., Templeton A.J., Wang S.L., Paolini J., Chen I., Chow-Maneval E. Prognostic impact of the neutrophil-to-lymphocyte ratio in men with metastatic castration-resistant prostate cancer. Clin. Genitourin. Cancer. 2014;12:317–324. doi: 10.1016/j.clgc.2014.03.005. [DOI] [PubMed] [Google Scholar]
- Sun Y., Selvaraj S., Varma A., Derry S., Sahmoun A.E., Singh B.B. Increase in serum Ca2+/Mg2+ ratio promotes proliferation of prostate cancer cells by activating TRPM7 channels. J. Biol. Chem. 2013;288:255–263. doi: 10.1074/jbc.M112.393918. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sutton C.D. 11-Classification and regression trees, bagging, and boosting. In: Rao C.R., Wegman E.J., Solka J.L., editors. vol. 24. Elsevier; 2005. pp. 303–329. (Handbook of Statistics). [Google Scholar]
- Sweeney C.J., Chen Y.-H., Carducci M., Liu G., Jarrard D.F., Eisenberger M., Wong Y.N., Hahn N., Kohli M., Cooney M.M. Chemohormonal therapy in metastatic hormone-sensitive prostate cancer. N. Engl. J. Med. 2015;373:737–746. doi: 10.1056/NEJMoa1503747. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tannock I.F., de Wit R., Berry W.R., Horti J., Pluzanska A., Chi K.N., Oudard S., Théodore C., James N.D., Turesson I., TAX 327 Investigators Docetaxel plus prednisone or mitoxantrone plus prednisone for advanced prostate cancer. N. Engl. J. Med. 2004;351:1502–1512. doi: 10.1056/NEJMoa040720. [DOI] [PubMed] [Google Scholar]
- Templeton A.J., Pezaro C., Omlin A., McNamara M.G., Leibowitz-Amit R., Vera-Badillo F.E., Attard G., de Bono J.S., Tannock I.F., Amir E. Simple prognostic score for metastatic castration-resistant prostate cancer with incorporation of neutrophil-to-lymphocyte ratio. Cancer. 2014;120:3346–3352. doi: 10.1002/cncr.28890. [DOI] [PubMed] [Google Scholar]
- Templeton A.J., Vera-Badillo F.E., Wang L., Attalla M., De Gouveia P., Leibowitz-Amit R., Knox J.J., Moore M., Sridhar S.S., Joshua A.M. Translating clinical trials to clinical practice: outcomes of men with metastatic castration resistant prostate cancer treated with docetaxel and prednisone in and out of clinical trials. Ann. Oncol. 2013;24:2972–2977. doi: 10.1093/annonc/mdt397. [DOI] [PubMed] [Google Scholar]
- Van Der Maaten L., Hinton G. Visualizing high-dimensional data using t-sne. J. Machine Learn. Res. 2008;9:26. [Google Scholar]
- Wang Y., Chen W., Hu C., Wen X., Pan J., Xu F., Zhu Y., Shao X., Shangguan X., Fan L. Albumin and fibrinogen combined prognostic grade predicts prognosis of patients with prostate cancer. J. Cancer. 2017;8:3992–4001. doi: 10.7150/jca.21061. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wilson K.M., Shui I.M., Mucci L.A., Giovannucci E. Calcium and phosphorus intake and prostate cancer risk: a 24-y follow-up study–. Am. J. Clin. Nutr. 2014;101:173–183. doi: 10.3945/ajcn.114.088716. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yao A., Sejima T., Iwamoto H., Masago T., Morizane S., Honda M., Takenaka A. High neutrophil-to-lymphocyte ratio predicts poor clinical outcome in patients with castration-resistant prostate cancer treated with docetaxel chemotherapy. Int. J. Urol. 2015;22:827–833. doi: 10.1111/iju.12839. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.





