Skip to main content
Frontiers in Oncology logoLink to Frontiers in Oncology
. 2021 Mar 10;11:604584. doi: 10.3389/fonc.2021.604584

Analysis of Gut Microbiome Using Explainable Machine Learning Predicts Risk of Diarrhea Associated With Tyrosine Kinase Inhibitor Neratinib: A Pilot Study

Chi Wah Wong 1, Susan E Yost 2, Jin Sun Lee 2, John D Gillece 3, Megan Folkerts 3, Lauren Reining 3, Sarah K Highlander 3, Zahra Eftekhari 1, Joanne Mortimer 2, Yuan Yuan 2,*
PMCID: PMC8008168  PMID: 33796451

Abstract

Neratinib has great efficacy in treating HER2+ breast cancer but is associated with significant gastrointestinal toxicity. The objective of this pilot study was to understand the association of gut microbiome and neratinib-induced diarrhea. Twenty-five patients (age ≥ 60) were enrolled in a phase II trial evaluating safety and tolerability of neratinib in older adults with HER2+ breast cancer (NCT02673398). Fifty stool samples were collected from 11 patients at baseline and during treatment. 16S rRNA analysis was performed and relative abundance data were generated. Shannon’s diversity was calculated to examine gut microbiome dysbiosis. An explainable tree-based approach was utilized to classify patients who might experience neratinib-related diarrhea (grade ≥ 1) based on pre-treatment baseline microbial relative abundance data. The hold-out Area Under Receiver Operating Characteristic and Area Under Precision-Recall Curves of the model were 0.88 and 0.95, respectively. Model explanations showed that patients with a larger relative abundance of Ruminiclostridium 9 and Bacteroides sp. HPS0048 may have reduced risk of neratinib-related diarrhea and was confirmed by Kruskal-Wallis test (p ≤ 0.05, uncorrected). Our machine learning model identified microbiota associated with reduced risk of neratinib-induced diarrhea and the result from this pilot study will be further verified in a larger study.

Clinical Trial Registration

ClinicalTrials.gov, identifier NCT02673398.

Keywords: gut microbiota, breast cancer, neratinib, diarrhea, artificial intelligence, explainable machine learning

Introduction

Neratinib is a potent small molecule tyrosine kinase inhibitor (TKI) that inhibits human epidermal growth factor receptors (HER1, HER2, and HER4). Neratinib has recently been granted FDA approval as extended therapy for early stage HER2+ breast cancer and in combination with capecitabine for treatment of HER2+ metastatic breast cancer (13). Despite excellent efficacy data, neratinib is associated with significant gastrointestinal (GI) toxicity, with grades 1–4 diarrhea observed in 95% of patients and grade 3–4 diarrhea in over 40% of patients in earlier trials (35). In nine trials of neratinib alone or in combination with other therapy, dose reductions due to diarrhea ranged from 20%–53% (1, 3, 612). Analysis of the ExteNET trial showed neratinib-associated diarrhea had a distinct and predictable clinical course, with 28.6% of patients having grade 3 events during the first month, then decreasing to ≤ 6% after month 3. Grade 3 events are generally short-lived and occur within the first month of treatment, allowing targeted preventive management with antidiarrheal prophylaxis early in the treatment course (13).

Older adults with breast cancer undergoing therapy with neratinib are particularly vulnerable to severe diarrhea due to potential changes in absorption, drug metabolism and distribution with increased age. An investigator-initiated clinical trial was designed to evaluate the safety and tolerability of neratinib in adults 60 and older with metastatic HER2+ breast cancer (NCT02673398). A total of 25 patients were enrolled and here we report gut microbiome analysis using 16S rRNA gene sequencing of longitudinally collected stool specimen in 11 patients.

The human gut contains a dense microbiome ecosystem that is essential in maintaining a healthy host physiology, and disruption of this ecosystem has been linked with increased risk of toxicities from systemic cancer therapy (1419). The bacteriomic profile of the gut microbiome can be an indicator of general health and disease such as inflammation, digestive inefficiencies, and the presence of pathogens. The advent of next-generation sequencing technologies such as 16S rRNA gene or metagenome sequencing have enabled characterization of the gut microbiome architecture in an affordable and culture-free approach (20).

The objective of this study was to understand the association of host gut microbiome and neratinib-induced diarrhea. Utilizing 16S rRNA gene sequencing data, a machine learning model was built for prediction of diarrhea from baseline intestinal microbiota data in breast cancer patients.

Materials and Methods

Patients

A phase II, single arm, open label study was conducted between 09/2015 and 12/2019 (NCT02673398). Eligibility criteria were histologically proven metastatic breast cancer; HER2+ defined by ASCO/CAP guideline; age ≥ 60; Eastern Cooperative Oncology Group (ECOG) performance status (PS) 0–2. Patients were started with a neratinib dose of 240 mg oral daily in a 28-day cycle. Diarrhea prophylaxis with loperamide was mandatory during the first cycle of treatment and was used as needed beyond the first cycle. Adverse events (AEs) were assessed by NCI Common Terminology Criteria for Adverse Events (CTCAE) 4.0. This study was approved by City of Hope’s regulatory and ethics committees and was conducted in accordance with the Declaration of Helsinki and the principles of Good Clinical Practice. All participants provided written informed consent.

Sample Collection

Stool samples were collected at baseline, Cycle 1 Day 15, Cycle 2 Day 1, Cycle 2 Day 15, and Cycle 3 Day 1. Fifty samples from 11 patients were collected using Zymo DNA/RNA Shield Fecal Collection tubes. For the machine learning model, stool samples and patient adverse event of diarrhea at any time over the course of treatment were analyzed.

Microbiome Analysis

DNA was extracted using the MagMax Microbiome Ultra Extraction Kit (AA2358, Thermo-Fisher, Waltham, MA), with prior bead beating on a TissueLyser (Qiagen), using the KingFisher Magnetic Extraction Instrument (ThermoFisher). DNAs were quantitated by the BactQuant assay (21). 16S rRNA libraries were created as described by Kozich et al. (22), using modified primers described by Walters et al. (23), that amplify the 16s rRNA variable region 4 (V4) of the rRNA gene of bacteria and archaea. More than 16,000 reads per sample were produced. Reads were quality filtered, trimmed and chimeras removed then taxonomically classified using the Silva database using QIIME2 (24, 25). The relative abundance of 26 taxa present at ≥ 5% in any one sample within this cohort was reported to the genus level or above. Shannon’s diversity index was also calculated to examine gut microbiome dysbiosis.

Explainable Tree-Based Predictive Modeling

To classify which patients would have neratinib-related diarrhea, microbial relative abundance data from the 11 patients (50 specimens) were utilized for predictive modeling. Input features were fed into machine learning models to classify which patients would have diarrhea. A non-parametric and non-linear gradient-boosted tree approach [xgboost package (26)] was utilized for classification. For model performance assessment, a leave-one-patient-out approach was chosen. In each iteration, input data from one patient was held out while data from the rest of the patients were used for training a model. Default set of regularization and hyperparameters were used to fit the model. The fitted model was then applied only on the pre-treatment baseline data from the hold-out patient. Receiver Operating Characteristic (ROC) and Precision-Recall Curves (PRC) were used for model assessment. Subsequently, the entire dataset was used to fit a final model. A tree-explainer was used to compute local explanations [probabilistic SHAP value, shap package (27, 28)] based on the associated exact shapley values generated for each feature from individual patient’s data. The overall feature importance was obtained by calculating the mean absolute SHAP values of individual features. Post-hoc analyses were then performed on the most important features using Kruskal-Wallis tests to assess the differences of the most important microbiota relative abundance between patients without and without neratinib-induced diarrhea. Python 3.7.6, scikit-learn 0.22.1, xgboost 1.0.1, shap 0.35.0, and scipy 1.4.1 were used for machine learning modeling and statistical analysis.

Results

Patients

A total of 11/25 patients who were accrued between December 2016 and March 2019 provided 50 longitudinal stool samples. Patient and disease characteristics are summarized in Supplementary Table 1 . The median age was 66 years old (60–78). Sixty-four percent were Caucasian and 36% were Asian, with 27% of Hispanic ethnicity. Eighty-two percent were hormone receptor positive HER2+ metastatic breast cancer. Seventy-three percent developed grade ≥ 1 diarrhea attributed to neratinib over the course of treatment.

Microbiome Analysis

16S rRNA gene analysis was performed and the relative abundance of 26 taxa present at ≥ 5% in any one sample within this cohort was reported to the genus level or higher as classified by QIIME2 (25) ( Figure 1A ). Bacteroides (green) is a common anaerobic gut inhabitant that comprises a large proportion of the human gut microbiome, is responsible for fermentation of long chain carbohydrates, produces butyrate, and dominates in populations who eat a Western diet. Bacteroides sp. were present in most patients in this study (9/11, 82%), and were predominant at baseline and throughout treatment. Prevotella 9 (pink/violet) is a member of the same family as Bacteroides, but is more prevalent in non-Westernized populations. Two out of 11 (18%) patients had Prevotella during treatment. Akkermansia (orange) is associated with a healthy gut, and has anti-inflammatory effects. Five out of 11 (45%) patients had Akkermansia during treatment.

Figure 1.

Figure 1

16S rRNA gene sequencing analysis. (A) Relative abundance of top 26 taxa by patient and cycle; (B) Shannon’s alpha diversity by patient and cycle.

Shannon’s alpha diversity (29) measures the evenness in the fecal communities. Shannon diversity was calculated to examine gut microbiome dysbiosis ( Figure 1B ). A normal healthy gut microbiome usually has a Shannon diversity index of 3.5 or greater. Several samples fell below this threshold; however, no clear association of Shannon alpha diversity index and diarrhea was identified.

Explainable Prediction Model for Diarrhea Due to Neratinib

The cohort for machine learning modeling included patients 1, 2, 5, 7–11 with at least one occurrence of diarrhea (n = 8), and patients 3, 4, 6 with no diarrhea (n = 3). The Area Under ROC (AUROC) and Area Under PRC (AUPRC) of the classification model were 0.88 and 0.95, respectively ( Figures 2A, B ).

Figure 2.

Figure 2

Model assessment. (A) Area Under Receiver Operating Characteristic (ROC) Curve; (B) Area Under Precision-Recall Curve (PRC).

The bar plot of mean absolute SHAP values of individual features associated with the final optimal model suggests Ruminiclostridium 9 and Bacteroides sp. HPS0048 are the two most impactful features overall for predicting risk for treatment-related diarrhea ( Figure 3A ). A beeswarm plot (27) shows each row corresponding to one feature and each dot corresponding to one patient in the full dataset ( Figure 3B ). The color of each dot corresponds to normalized feature value (qualitative: blue for low values, red for high values) whereas its position along the x-axis depicts the SHAP value describing the impact on the model prediction. Using the beeswarm plot, the directionality of the relation between individual features and outcome can be observed, along with the magnitude of the relation. Specifically, patients with larger relative abundance of Ruminiclostridium 9 and Bacteroides sp. HPS0048 may have reduced risk of treatment-related diarrhea. A heatmap displays the relative abundance of Ruminiclostridium 9 and Bacteroides sp. HPS0048 in log10 scale ( Figure 4 ). Using Kruskal-Wallis tests, statistically significant differences between patients with and without diarrhea for Ruminiclostridium 9 (p = 0.04, uncorrected) and Bacteroides sp. HPS0048 (p = 0.05, uncorrected) were identified. No statistically significant difference was observed for the next three important features (D_5__Subdoligranulum: p = 0.21; D_4__Lachnospiraceae_d: p = 0.21, D_5__Tyzzerella 4: p = 0.24, uncorrected).

Figure 3.

Figure 3

Feature importance and local explanation of final model. (A) Bar plot of mean absolute SHAP values of individual features; and (B) Beeswarm plot showing feature values and impact on the model prediction.

Figure 4.

Figure 4

Heatmap showing differences in microbiota relative abundance between patients with and without neratinib-induced diarrhea (in log10 scale). *Kruskal-Wallis test with p ≤ 0.05 uncorrected.

Discussion

Using the relative abundance of microbiome taxa data, we have developed an explainable tree-based predictive model to estimate the risk of diarrhea associated with neratinib treatment in patients ≥ 60 with HER2+ metastatic breast cancer. Utilizing a nested cross-validation approach, we achieved a promising 0.88 AUROC and 0.95 AUPRC for model performance. We also identified the most important features that could predict diarrhea and the risk with the associated directionality: patients with higher relative abundance of Ruminiclostridium 9 and Bacteroides sp. HPS0048 may have a reduced risk of treatment-related diarrhea. The findings were confirmed by Kruskal-Wallis tests (p ≤ 0.05, uncorrected). Both species produce short chain fatty acids (SCFAs) including acetate, propionate, and butyrate, which are important metabolites for maintaining intestinal homeostasis. In addition, butyrate also has important immunomodulatory functions and may activate signaling cascades that control immune functions (30).

Multiple studies have shown that the gut microbiota has the potential to influence the efficacy of cancer therapy (31, 32). There are several potential mechanisms of action for TKI induced diarrhea, including direct target inhibition, chloride secretion, intestinal inflammation, and a dysbiotic microbiome (31). Both preclinical and clinical studies have demonstrated decreased total bacterial abundance and diversity, as well as decreases in commensals such as Lactobacillus and Bifidobacteria, with increases in Bacteroidetes and Escherichia coli contributing to TKI-induced diarrhea (32). The translocation, immunomodulation, metabolism, enzymatic degradation, reduced diversity (TIMER) model proposed by Alexander and colleagues has outlined how the functions of the microbiome may have a central role in determining the extent and intensity of diarrhea induced by systemic therapy (33). Heshiki et al. investigated the contribution of the intestinal microbiome on treatment outcomes in a heterogeneous cohort that included multiple cancer types to identify microbes with a global impact on immune response (34). This human gut metagenomic analysis revealed that responders had significantly higher microbial diversity and different microbiota compositions compared to non-responders, and species such as Bacteroides ovatus and Bacteroides xylanisolvens were positively correlated with treatment outcomes. In the current study, no clear association of Shannon alpha diversity index and diarrhea was identified. This likely reflects the relatively small sample size.

Machine learning and deep learning models are being widely used for precision oncology research (35). Although the predictive models often have impressive predictive performance, they are typically difficult to explain. Explaining predictive models is one of the key factors driving their use in a clinical setting (36, 37). To correlate model prediction accuracy and explainability, various approaches have been proposed to generate intuitive interpretations on predictive models (27, 28, 38, 39). Using a concept in game theory, Lungberg et al. proposed a unified framework called SHAP (SHapley Additive exPlanations) that works for essentially all predictive models including tree-based and deep learning models (38). In SHAP, each feature is assigned an importance value (SHAP value) and the addition of all SHAP values leads to the actual prediction. A model prediction can be decomposed into a unique set of SHAP values associated with the features, allowing clinicians and scientists to utilize contributions of individual feature SHAP values and their interactions to draw insight from the dataset.

The current pilot study is limited by its small sample size. Longitudinal samples were collected from 11 patients with 50 specimens. To reduce potential bias due to training a predictive model with limited sample size, we have adopted a robust train-test split approach where the test dataset in each leave-one-patient-out loop was kept entirely independent of the train dataset (40). Increased cohort size and an external dataset would boost the utility of this model. Nevertheless, the high model performance (AUROC and AUPRC) with meaningful feature explanation offers a foundation for future studies. Our study also contributes to the new field of gut microbiome in oncology by providing a novel predictive model associating host gut microbiota with treatment-induced diarrhea. TKIs have been broadly used in current oncology practice and diarrhea is a common and challenging side effect. Modulation of the microbiota may support cancer therapy and improve patient’s quality of life in the future (34).

Conclusion

Our machine learning model identified microbiota associated with reduced risk of neratinib-induced diarrhea and the result from this pilot study will be further verified in a larger study.

Data Availability Statement

The original contributions presented in the study are publicly available. This data can be found here: https://github.com/cwwong-alec/Analysis-of-Gut-Microbiome-Using-xML-Predicts-Risk-of-Diarrhea-Associated-With-TKI-Neratinib

Ethics Statement

The studies involving human participants were reviewed and approved by City of Hope’s IRB-approved protocol. All procedures were carried out in accordance with The Code of Ethics of the World Medical Association (Declaration of Helsinki) and Recommendations for the Conduct, Reporting, Editing, and Publication of Scholarly Work in Medical Journals. Privacy rights were observed and written informed consent was obtained from all participants of this study under IRB15342 and ClinicalTrials.gov NCT02673398. The patients/participants provided their written informed consent to participate in this study.

Author Contributions

CW and YY made substantial contributions to conception and design, analysis, and interpretation of data, drafting and revising the manuscript, and final approval for publication. JL and SY contributed to database management, chart review, data analysis, manuscript preparation, and revision. JG, MF, LR, and SH performed the 16S rRNA gene sequencing and analysis. ZE, JM, and SH revised the manuscript and provided final approval for publication. YY is the guarantor and agrees to be accountable for all aspects of the work. All authors contributed to the article and approved the submitted version.

Funding

Puma sponsored the trial and provided neratinib. This study was supported by COH Pathology Research Services Core and Biostatistics and Mathematical Modeling Core (National Cancer Institute of the National Institutes of Health under award number P30CA033572). The authors also thank the NIH R03 AG050931-02(PI YY) STOP Cancer Foundation (PI YY), NIH K-12 Career Development Award (K12CA001727, PI JM), and NIH 1R01CA206911-01A1 (PI Emily Wang). Puma was not involved in the study design, collection, analysis, interpretation of data, the writing of this article or the decision to submit it for publication.

Conflict of Interest

YY has contracted research sponsored by Merck, Eisai, Novartis, Puma, Genentech, Celgene, and Pfizer; is a consultant for Puma, Pfizer, Immunomedics, and is on the Speakers Bureau for Eisai, Novartis, Genentech, AstraZeneca, Daiichi Sankyo, Pfizer, Merck and Immunomedics.

The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

The authors gratefully acknowledge Puma Biotechnology Inc. for sponsoring the clinical trial. Research reported in this publication includes work supported by the National Cancer Institute (NCI). The content is solely the responsibility of the authors and does not necessarily represent the official views of the NIH.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2021.604584/full#supplementary-material

References

  • 1. Saura C, Oliveira M, Feng YH, Dai MS, Chen SW, Hurvitz SA, et al. Neratinib Plus Capecitabine Versus Lapatinib Plus Capecitabine in HER2-Positive Metastatic Breast Cancer Previously Treated With >/= 2 HER2-Directed Regimens: Phase III NALA Trial. J Clin Oncol (2020) 38(27):3138–49.   10.1200/JCO.20.00147 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Singh H, Walker AJ, Amiri-Kordestani L, Cheng J, Tang SH, Balcazar P, et al. US Food and Drug Administration Approval: Neratinib for the Extended Adjuvant Treatment of Early-Stage HER2-Positive Breast Cancer. Clin Cancer Res (2018) 24(15):3486–91.   10.1158/1078-0432.Ccr-17-3628 [DOI] [PubMed] [Google Scholar]
  • 3. Chan A, Delaloge S, Holmes FA, Moy B, Iwata H, Harvey VJ, et al. Neratinib after trastuzumab-based adjuvant therapy in patients with HER2-positive breast cancer (ExteNET): a multicentre, randomised, double-blind, placebo- controlled, phase 3 trial. Lancet Oncol (2016) 17(3):367–77.   10.1016/s1470-2045(15)00551-3 [DOI] [PubMed] [Google Scholar]
  • 4. Rugo HS, Di Palma JA, Tripathy D, Bryce R, Moran S, Olek E, et al. The characterization, management, and future considerations for ErbB-family TKI-associated diarrhea. Breast Cancer Res Treat (2019) 175(1):5–15.   10.1007/s10549-018-05102-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Barcenas CH, Hurvitz SA, Di Palma JA, Bose R, Chien AJ, Iannotti N, et al. Improved tolerability of neratinib in patients with HER2-positive early-stage breast cancer: the CONTROL trial. Ann Oncol (2020) 31(9):1223–30.   10.1016/j.annonc.2020.05.012 [DOI] [PubMed] [Google Scholar]
  • 6. Meric-Bernstam F, Johnson AM, Dumbrava EEI, Raghav K, Balaji K, Bhatt M, et al. Advances in HER2-Targeted Therapy: Novel Agents and Opportunities Beyond Breast and Gastric Cancer. Clin Cancer Res (2019) 25(7):2033–41.   10.1158/1078-0432.Ccr-18-2275 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Freedman RA, Gelman RS, Anders CK, Melisko ME, Parsons HA, Cropp AM, et al. TBCRC 022: A Phase II Trial of Neratinib and Capecitabine for Patients With Human Epidermal Growth Factor Receptor 2-Positive Breast Cancer and Brain Metastases. J Clin Oncol (2019) 37(13):1081–9.   10.1200/jco.18.01511 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Murthy RK, Loi S, Okines A, Paplomata E, Hamilton E, Hurvitz SA, et al. Tucatinib, Trastuzumab, and Capecitabine for HER2-Positive Metastatic Breast Cancer. N Engl J Med (2020) 382(7):597–609.   10.1056/NEJMoa1914609 [DOI] [PubMed] [Google Scholar]
  • 9. Rabindran SK, Discafani CM, Rosfjord EC, Baxter M, Floyd MB, Golas J, et al. Antitumor activity of HKI-272, an orally active, irreversible inhibitor of the HER-2 tyrosine kinase. Cancer Res (2004) 64(11):3958–65.   10.1158/0008-5472.Can-03-2868 [DOI] [PubMed] [Google Scholar]
  • 10. Tiwari SR, Mishra P, Abraham J. Neratinib, A Novel HER2-Targeted Tyrosine Kinase Inhibitor. Clin Breast Cancer (2016) 16(5):344–8.   10.1016/j.clbc.2016.05.016 [DOI] [PubMed] [Google Scholar]
  • 11. Connell CM, Doherty GJ. Activating HER2 mutations as emerging targets in multiple solid cancers 2017. ESMO Open 2(5):e000279.   10.1136/esmoopen-2017-000279 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Ben-Baruch NE, Bose R, Kavuri SM, Ma CX, Ellis MJ. HER2-Mutated Breast Cancer Responds to Treatment With Single-Agent Neratinib, a Second-Generation HER2/EGFR Tyrosine Kinase Inhibitor. J Natl Compr Cancer Network (2015) 13(9):1061–4.   10.6004/jnccn.2015.0131 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Mortimer J, Di Palma J, Schmid K, Ye Y, Jahanzeb M. Patterns of occurrence and implications of neratinib-associated diarrhea in patients with HER2-positive breast cancer: analyses from the randomized phase III ExteNET trial. Breast Cancer Res (2019) 21(1):32.   10.1186/s13058-019-1112-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Aarnoutse R, Ziemons J, Penders J, Rensen SS, de Vos-Geelen J, Smidt ML. The Clinical Link between Human Intestinal Microbiota and Systemic Cancer Therapy. Int J Mol Sci (2019) 20(17).   10.3390/ijms20174145 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Khan MAW, Ologun G, Arora R, McQuade JL, Wargo JA. Gut Microbiome Modulates Response to Cancer Immunotherapy. Dig Dis Sci (2020) 65(3):885–96.   10.1007/s10620-020-06111-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Cheng Y, Ling Z, Li L. The Intestinal Microbiota and Colorectal Cancer. Front Immunol (2020) 11:615056:615056.   10.3389/fimmu.2020.615056 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Helmink BA, Khan MAW, Hermann A, Gopalakrishnan V, Wargo JA. The microbiome, cancer, and cancer therapy. Nat Med (2019) 25(3):377–88.   10.1038/s41591-019-0377-7 [DOI] [PubMed] [Google Scholar]
  • 18. Elinav E, Garrett WS, Trinchieri G, Wargo J. The cancer microbiome. Nat Rev Cancer (2019) 19(7):371–6.   10.1038/s41568-019-0155-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Schwabe RF, Jobin C. The microbiome and cancer. Nat Rev Cancer (2013) 13(11):800–12.   10.1038/nrc3610 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Osman MA, Neoh HM, Ab Mutalib NS, Chin SF, Jamal R. 16S rRNA Gene Sequencing for Deciphering the Colorectal Cancer Gut Microbiome: Current Protocols and Workflows. Front Microbiol (2018) 9:767.   10.3389/fmicb.2018.00767 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Liu CM, Aziz M, Kachur S, Hsueh PR, Huang YT, Keim P, et al. BactQuant: an enhanced broad-coverage bacterial quantitative real-time PCR assay. BMC Microbiol (2012) 12:56.   10.1186/1471-2180-12-56 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Kozich JJ, Westcott SL, Baxter NT, Highlander SK, Schloss PD. Development of a dual-index sequencing strategy and curation pipeline for analyzing amplicon sequence data on the MiSeq Illumina sequencing platform. Appl Environ Microbiol (2013) 79(17):5112–20.   10.1128/AEM.01043-13 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Walters W, Hyde ER, Berg-Lyons D, Ackermann G, Humphrey G, Parada A, et al. Improved bacterial 16S rRNA gene (V4 and V4-5) and fungal internal transcribed spacer marker gene primers for microbial community surveys. mSystems (2016) 1(1).   10.1128/mSystems.00009-15 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Caporaso JG, Kuczynski J, Stombaugh J, Bittinger K, Bushman FD, Costello EK, et al. QIIME allows analysis of high-throughput community sequencing data. Nat Methods (2010) 7(5):335–6.   10.1038/nmeth.f.303 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Bolyen E, Rideout JR, Dillon MR, Bokulich NA, Abnet CC, Al-Ghalith GA, et al. Reproducible, interactive, scalable and extensible microbiome data science using QIIME 2. Nat Biotechnol (2019) 37(8):852–7.   10.1038/s41587-019-0209-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Chen T, Guestrin C. XGBoost: A Scalable Tree Boosting System. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. (2016). pp. 785–94.   10.1145/2939672.2939785 [DOI] [Google Scholar]
  • 27. Lundberg SM, Erion G, Chen H, DeGrave A, Prutkin JM, Nair B, et al. From local explanations to global understanding with explainable AI for trees. Nat Mach Intell (2020) 2(1):56–67.   10.1038/s42256-019-0138-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. Lundberg SM, Nair B, Vavilala MS, Horibe M, Eisses MJ, Adams T, et al. Explainable machine-learning predictions for the prevention of hypoxaemia during surgery. Nat BioMed Eng (2018) 2(10):749–60.   10.1038/s41551-018-0304-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Shannon CE. The mathematical theory of communication. 1963. MD Comput (1997) 14(4):306–17. [PubMed] [Google Scholar]
  • 30. Parada Venegas D, De la Fuente MK, Landskron G, González MJ, Quera R, Dijkstra G, et al. Short Chain Fatty Acids (SCFAs)-Mediated Gut Epithelial and Immune Regulation and Its Relevance for Inflammatory Bowel Diseases. Front Immunol (2019) 10:277.   10.3389/fimmu.2019.00277 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31. Secombe KR, Van Sebille YZA, Mayo BJ, Coller JK, Gibson RJ, Bowen JM. Diarrhea Induced by Small Molecule Tyrosine Kinase Inhibitors Compared With Chemotherapy: Potential Role of the Microbiome. Integr Cancer Ther (2020) 19:1534735420928493.   10.1177/1534735420928493 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32. Ma W, Mao Q, Xia W, Dong G, Yu C, Jiang F. Gut Microbiota Shapes the Efficiency of Cancer Therapy. Front Microbiol (2019) 10:1050.   10.3389/fmicb.2019.01050 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. Alexander JL, Wilson ID, Teare J, Marchesi JR, Nicholson JK, Kinross JM. Gut microbiota modulation of chemotherapy efficacy and toxicity. Nat Rev Gastroenterol Hepatol (2017) 14(6):356–65.   10.1038/nrgastro.2017.20 [DOI] [PubMed] [Google Scholar]
  • 34. Heshiki Y, Vazquez-Uribe R, Li J, Ni Y, Quainoo S, Imamovic L, et al. Predictable modulation of cancer treatment outcomes by the gut microbiota. Microbiome (2020) 8(1):28.   10.1186/s40168-020-00811-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Azuaje F. Artificial intelligence for precision oncology: beyond patient stratification. NPJ Precis Oncol (2019) 3:6.   10.1038/s41698-019-0078-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36. Char DS, Shah NH, Magnus D. Implementing Machine Learning in Health Care - Addressing Ethical Challenges. N Engl J Med (2018) 378(11):981–3.   10.1056/NEJMp1714229 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37. Rudin C. Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead. Nat Mach Intell (2019) 1(5):206–15.   10.1038/s42256-019-0048-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38. Lundberg SM, Lee S-I. A unified approach to interpreting model predictions. In: . Proceedings of the 31st International Conference on Neural Information Processing Systems. Long Beach, California, USA: Curran Associates Inc. (2017). [Google Scholar]
  • 39. Bach S, Binder A, Montavon G, Klauschen F, Muller KR, Samek W. On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation. PloS One (2015) 10(7):e0130140.   10.1371/journal.pone.0130140 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40. Vabalas A, Gowen E, Poliakoff E, Casson AJ. Machine learning algorithm validation with a limited sample size. PloS One (2019) 14(11):e0224365.   10.1371/journal.pone.0224365 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Data Availability Statement

The original contributions presented in the study are publicly available. This data can be found here: https://github.com/cwwong-alec/Analysis-of-Gut-Microbiome-Using-xML-Predicts-Risk-of-Diarrhea-Associated-With-TKI-Neratinib


Articles from Frontiers in Oncology are provided here courtesy of Frontiers Media SA

RESOURCES