Machine learning models to predict success of endoscopic sleeve gastroplasty using total and excess weight loss percent achievement: a multicentre study

Maria Vannucci; Patrick Niyishaka; Toby Collins; María Rita Rodríguez-Luna; Pietro Mascagni; Alexandre Hostettler; Jacques Marescaux; Silvana Perretta

doi:10.1007/s00464-023-10520-0

. 2023 Nov 16;38(1):229–239. doi: 10.1007/s00464-023-10520-0

Machine learning models to predict success of endoscopic sleeve gastroplasty using total and excess weight loss percent achievement: a multicentre study

Maria Vannucci ^1,^2,^9,^✉, Patrick Niyishaka ³, Toby Collins ^2,³, María Rita Rodríguez-Luna ^2,⁴, Pietro Mascagni ^5,⁶, Alexandre Hostettler ^2,³, Jacques Marescaux ^2,³, Silvana Perretta ^2,^7,⁸

PMCID: PMC10776503 PMID: 37973639

Abstract

Background

The large amount of heterogeneous data collected in surgical/endoscopic practice calls for data-driven approaches as machine learning (ML) models. The aim of this study was to develop ML models to predict endoscopic sleeve gastroplasty (ESG) efficacy at 12 months defined by total weight loss (TWL) % and excess weight loss (EWL) % achievement. Multicentre data were used to enhance generalizability: evaluate consistency among different center of ESG practice and assess reproducibility of the models and possible clinical application. Models were designed to be dynamic and integrate follow-up clinical data into more accurate predictions, possibly assisting management and decision-making.

Methods

ML models were developed using data of 404 ESG procedures performed at 12 centers across Europe. Collected data included clinical and demographic variables at the time of ESG and at follow-up. Multicentre/external and single center/internal and temporal validation were performed. Training and evaluation of the models were performed on Python’s scikit-learn library. Performance of models was quantified as receiver operator curve (ROC-AUC), sensitivity, specificity, and calibration plots.

Results

Multicenter external validation: ML models using preoperative data show poor performance. Best performances were reached by linear regression (LR) and support vector machine models for TWL% and EWL%, respectively, (ROC-AUC: TWL% 0.87, EWL% 0.86) with the addition of 6-month follow-up data.

Single-center internal validation: Preoperative data only ML models show suboptimal performance. Early, i.e., 3-month follow-up data addition lead to ROC-AUC of 0.79 (random forest classifiers model) and 0.81 (LR models) for TWL% and EWL% achievement prediction, respectively. Single-center temporal validation shows similar results.

Conclusions

Although preoperative data only may not be sufficient for accurate postoperative predictions, the ability of ML models to adapt and evolve with the patients changes could assist in providing an effective and personalized postoperative care. ML models predictive capacity improvement with follow-up data is encouraging and may become a valuable support in patient management and decision-making.

Supplementary Information

The online version contains supplementary material available at 10.1007/s00464-023-10520-0.

Keywords: Bariatric endoscopy, Machine learning, Predictive model, Interventional endoscopy

Endoscopic sleeve gastroplasty (ESG) is a relatively new bariatric endoscopic procedure, which has proven to be safe and effective to reduce weight and obesity-related comorbidities [1–4]. ESG is scarless, repeatable, and reversible [5]. Thus, ESG has the potential to reach a larger proportion of obese patients compared to bariatric surgery [1], including those unfit for surgery, and those not undergoing surgery out of fear of invasive procedures [6]. Short- to mid-term results are promising [1, 7–9]. However, current ESG indications, as well as the endoscopic technique, have just started to be standardized and defined [10]. Although ESG predictors of success have been investigated in multiple studies [11, 12], the heterogeneity of data may represent an obstacle for thorough analysis with classical statistical methods. The increasing amount of diverse data gathered daily in endoscopic practice offers greater potential for standardized, data-driven approaches as machine learning (ML) [13]. ML models have been developed previously to predict complications and outcomes in other more invasive bariatric procedures [14], but never before for ESG. This study’s primary objective is to develop and evaluate ML models to predict ESG outcomes. The study is multicentric, featuring both temporal and external validation [15] to investigate generalizability of the models. Additionally, the proposed ML models can automatically adapt and evolve becoming more accurate with additional data during patient follow-up.

Methods

Dataset overview and outcome variables

Between 2016 and 2021, a database of 938 ESG procedures performed at 12 centers across Europe was established. Data were collected using a secure decentralized online registry by Medrio (Medrio inc. San Francisco, CA, USA). Longitudinal data at ESG follow-ups (months 1, 6, and 12) were collected retrospectively for procedures between 2016 and 2018, and prospectively after 2018. This study included all patients aged older than 18 undergoing ESG as the primary bariatric endoscopic/surgical procedure. Institutional Review Board (IRB) approval was obtained in each center and provided prior to accessing the online registry. Collected data shown in Table 1 of Supplements included demographic and clinical preoperative data such as blood tests and the presence of weight-related diseases. The outcome variables predicted by the ML models were total body weight loss (TWL) > 10% and excess body weight loss (EWL) > 25% at 12 months after ESG [16].

Table 1.

Cohort description

Validation (no of cases)	Dataset (no of cases)	Follow-up month	BMI	TWL%	EWL%	DIABETES	HYPERTENSION
Multicenter external validation (270)	Training (136)	M0	38.1 (STD 4.9)	NA	NA	NA	NA
		M6	32.8 (STD 4.4)	14.5 (STD 6.6)	45.6 (STD 24.5)	NA	NA
		M12	32.3 (STD 5.4)	15.0 (STD 9.5)	48 (STD 14.7)	NA	NA
	Testing also Single center internal validation dataset (134)	ESG	40.6 (STD 7.4)	NA	NA	34/134	49/134
		M6	36.2 (STD 7.3)	14.5 (STD 8.1)	38 (STD 22.3)	NA	NA
		M12	36.7 (STD 8.3)	13.8 (STD 9.8)	35.8 (STD 26.3)	24/134	39/134
Single-center temporal validation (134)	Training (81)	M0	41.5 (STD 8.4)	NA	NA	26/134	39/134
		M6	37.2 (STD 8.2)	14.3 (STD 7.5)	36.5 (STD 19.9)	NA	NA
		M12	38.1 (STD 9.3)	12.9 (STD 8.1)	32.9 (STD 21.2)	19/134	30/134
	Testing (53)	ESG	39.3 (STD 5.4)	NA	NA	8/134	17/134
		M6	34.5 (STD 5.2)	14.7 (STD 9.0)	40.4 (STD 25.5)	NA	NA
		M12	34.6 (STD 6.2)	15.0 (STD 12.0)	40.3 (STD 32.2)	5/134	9/134

Open in a new tab

Multicenter external validation ML model training and testing dataset including 270 cases of ESG performed at 12 centers across Europe. Single-center internal validation ML models including 134 cases of ESG performed at a single center (IHU-NHC Strasbourg, France)

Model training

ML models were trained to predict outcomes using multicenter validation i.e., external validation, and also single-center validation i.e., internal and temporal validation.

Four ML models were compared in this study: linear regression (LR), K-nearest neighbors (KNN), support vector machine (SVM), and random forest classifiers (RFC). They were selected based on their successful application in related tasks such as abdominal surgery outcome prediction [17]. All models were trained first using only preoperative clinical features, and subsequently trained with increasing amounts of follow-up variables (at 3-, 6-, and 9-month follow-up time points). This was performed to test our hypothesis that additional follow-up data, available during the routine clinical course, provided greater accuracy in ML prediction.

Models were trained and evaluated using Python’s scikit-learn library (version 1.0.2). Default model parameters were used for KNN (K = 5, weights = uniform, metric = minkowski), SVM (kernel = rbf, degree = 3), LR (solver = lbfgs), and RFC (n_estimators = 100, criterion = gini). Each model was trained to predict whether the procedure would be successful at 12-month follow-up (a binary output), including confidence estimates where applicable (SVM and LR). Success was defined with respect to TWL (≥ 10%) and EWL (≥ 25%), since both TWL and EWL were relevant clinical outcomes in ESG [18, 19]. Consequently, each model was trained twice; first to predict TWL success, and secondly to predict EWL success.

Model performance was quantified using the following established metrics: sensitivity, specificity, area under the receiver operator curve (ROC-AUC), and calibration plots.

Model validation

Multicenter validation

In the multicenter validation, 136 of 638 cases from centers 2 to 12 were used to train ML models. The models were then tested on 134 of 255 cases from center 1. No patient information from the center 1 was used to train the ML model in this validation. Consequently, this validation measured the performance of the ML models using patient data from a center completely unknown to the model during its development (also known as an external validation).

Single-center validation

In addition to the external validation, ML models were developed and tested using just the data collected in the 1st center. The motivation was twofold: first, to investigate performance when the ML was specifically adapted to data from a single center, which may result in a more accurate model, albeit one that could only be used at a specific center, and secondly, to investigate if the additional variables available in the 1st center resulted in better performance. Two forms of validation were performed in the single-center validation. Validation was conducted using RFE and tenfold cross-validation (known as an internal validation). To preserve the same class ratio in all folds, stratified tenfold cross-validation was used in the following manner. First, patients from center 1 were randomly assigned to 10 disjoint sets (‘folds’). The first fold was then taken, and data from patients in all other folds were used to train the ML models. The trained models were then tested in patients in the first fold. The process was repeated 10 times until all folds were used to test the models. Finally, model performance across all 10-folds was averaged and reported. A temporal validation was performed as follows. First, the procedures were sorted in time, and then data from the 81 earliest procedures (60% of patients) were used to train the ML models. Finally, data from the remaining 53 procedures were used to test model performance. Consequently, the temporal validation assessed the ability of the ML models to predict outcomes of future patients from the same center.

Data preparation

Clinical variables (‘model features’) that were not relevant for outcome prediction were automatically detected and removed using Recursive Feature Elimination (RFE) [20]. This established technique initially considered all features as independent variables, then features with the least impact on model predictions were subsequently detected and removed (pruned) until a target number of features remained. The target number was determined automatically using cross-validation, implemented in software using the sklearn.feature_selection.RFE (version 1.0.2) Python package. For multicentre model development, only 136 of 683 cases collected in center 2 to 12 had outcome variables, i.e., 12-month follow-up data, and could be included in training and testing of the models. Single center models as well as temporal validation models were trained and tested on 134 of 255 cases collected in center 1 as a result of missing outcome data. Missing feature values, e.g., unreported weight at follow-up, were handled using a combination of feature rejection and imputation. Specifically, all features with fewer than 50% recorded values, representing highly unreported features, were automatically removed. Of the remaining features, missing records were automatically imputed [21] using the variable’s mean (continuous variables) and mode (categorical variables).

Results

Cohort statistics

Descriptive statistics for the study sample are presented in Table 1. Variables collected in the 1st center and the remaining centers are listed in Table 1 of Supplements.

Multicenter external validation

In the training dataset (136 patients from centers 2 to 12), mean age was 48.9 (SD 9.8) years, and 77.9% of patients were women. Total weight loss > 10% and EWL > 25% at 12 months were achieved in 68.4% and 76.5% of cases, respectively. In the test dataset (134 patients from center 1), mean age was 45.5 (SD 12.7) years, and 62.7% were women. Total weight loss > 10% and EWL > 25% at 12 months were achieved in 62.7% and 60.4% of patients, respectively. Clinical features that were selected automatically with RFE for predicting TWL and EWL success are shown in Fig. 1a.

Fig. 1 — Recursive feature elimination (RFE) [32] was used combined with cross-validation (CV) for automatic relevant feature selection. To reduce overfitting, CV splits the dataset in subsets called k-folds, then machine learning models are iteratively trained on k-1 folds with the remaining fold serving as the test set. A Variables selected using recursive feature elimination (RFE) for prediction of TWL% and EWL% achievement in external validation models dataset. B Variables selected using recursive feature elimination (RFE) for prediction of TWL% and EWL% achievement in internal and temporal validation models dataset. *AHT* arterial hypertension, *BMI* body mass index, *choles* cholesterol, *ESG* endoscopic sleeve gastroplasty, *EWL* excess weight loss, FG fasting glucose, *GIQLI* Gastrointestinal Quality of Life Index, *LDL* low-density lipoprotein, *HDL* high-density lipoprotein, *trigly* triglycerides, *TWL* total weight loss

Predictive performances for the multicenter external validation are shown in Fig. 2a for TWL% and Fig. 2b for EWL% prediction, expressed as ROC-AUC curves, sensitivity, specificity, and calibration plots. Models are designed to perform an initial prediction based on preoperative data. The models were then updated during follow-up.

Single-center internal validation

Descriptive statistics are provided in Table 1. Clinical features that were selected automatically with RFE for predicting TWL and EWL success are shown in Fig. 1b. The ML model performances are shown in Fig. 3a using ROC-AUC for TWL% and EWL% and 3b showing models sensitivity, and specificity for TWL% and EWL% models.

Fig. 3 — Internal validation results. Performances of single-center internal validation ML models expressed as ROC-AUC, sensitivity and specificity, and the calibration plot. The bar charts show the mean performance of the ML models, grouped into three time points: ‘M0’ represents performance of each ML model using only preoperative variables as inputs, ‘M0 M3’ represent performance using preoperative and 3-month follow-up variables. ‘M0 M3 M6’ represent performance using preoperative, 3-month, and 6-month follow-up variables. Error bars represent 1 standard deviation/standard error. A ROC-AUC for TWL > %10 and EWL > 25% at 12 months ML models. B Sensitivity and specificity of TWL > %10 and EWL > 25% at 12 months ML models. *EWL* excess weight loss, *KNN* K-nearest neighbour, LR linear regression, M months, *RFC* random forest classifier, *SVM* support vector machine, *TWL* total weight loss

Single-center temporal validation

In the training dataset, 61.7% and 59.3% of cases reached TWL > 10% and EWL > 25% at 12 months, respectively. In the testing dataset, 64.2% and 62.3% of cases reached TWL > 10% and EWL > 25% at 12 months, respectively. Mean age was 47.8 (SD 11.8) and 42.1(SD 13.3) years in the training dataset (81 of 134 patients) and test dataset (53 of 134 patients), respectively. In the training dataset, 70.4% were women and 29.6% were men. In the test dataset, 75.5% were women and 24.5% were men.

In this validation, the ML models were trained only once as described above (model validation—single-center validation). The results are shown in Fig. 4a using ROC-AUC for TWL% and EWL% models, and 4b showing sensitivity, and specificity metrics for TWL% and EWL% models, respectively.

Fig. 4 — Temporal validation results. Performances of single-center temporal validation ML models expressed as ROC-AUC, sensitivity and specificity, and the calibration plot. The bar charts show the mean performance of the ML models, grouped into three time points: ‘M0’ represents performance of each ML model using only preoperative variables as inputs, ‘M0 M3’ represent performance using preoperative and 3-month follow-up variables. ‘M0 M3 M6’ represent performance using preoperative, 3-month, and 6-month follow-up variables. Error bars represent 1 standard deviation/standard error. A ROC-AUC for TWL > %10 and EWL > 25% at 12 months ML models. B Sensitivity and specificity of TWL > %10 and EWL > 25% at 12 months ML models. *EWL* excess weight loss, *KNN* K-nearest neighbor; LR linear regression, M months, *RFC* random forest classifier, *SVM* support vector machine; *TWL* total weight loss

Overall results

Multicenter external validation—predictive performances of the ML models using only preoperative data were poor, with the highest predictive capacity at 0.6 ROC-AUC in LR models. Predictive capacity increased when follow-up data were added, but better performance was achieved only when month 6 data were added to the ML models. This highlights the importance of short-term follow-up, which should guide clinical decision-making.

Single-center internal validation—ML models using preoperative data only show suboptimal ROC-AUC for both TWL% and EWL% achievement prediction. The addition of early, i.e., 3-month follow-up data, however, enables better predictive performance with ROC-AUC of 0.79 and 0.8 for TWL% and EWL% prediction, respectively. Adding data from 6-month follow-up further increases predictive capacity to ROC-AUC of 0.85 and 0.88 for TWL% and EWL% outcome prediction.

Single-center temporal validation—the temporally validated ML models show similar results to those internally validated, with suboptimal predictive capacity using only preoperative data, and better performance (ROC-AUC up to 0.8) with the addition of early, i.e., 3-month follow-up data.

Discussion

This study aimed at developing and validating ML models to predict successful ESG outcomes across different centers. The exact place of ESG in the management of obesity remains somewhat uncertain. While various studies have been undertaken to examine patient selection for ESG [22] and its postoperative management, there is currently no standardized framework for defining success in terms of weight loss or assessing ESG impact on comorbidities and quality of life. In addition, there is still a lack of standardization of the ESG technique which results in a considerable variability in weight loss outcomes as indicated by a substantial standard deviation.

Patient data in this context are heterogeneous and continually evolving, emphasizing the need for data-driven approaches such as machine learning (ML) models to address these complexities.

ML predictive models are increasingly being developed and used in the medical field [23], showing good overall predictive outcomes [24], thanks to their ability to handle a large number of heterogeneous data and to identify patterns that would be missed by classic statistical models. Some studies have already presented models able to update predictions [25, 26], which is particularly crucial as patient characteristics regularly change and evolve. Models able to integrate preoperative, intraoperative, and postoperative information that accumulates throughout the patient pathway can provide objective data-driven support for clinical decision-making, as well as standardize and improve patient management. Some studies focused on identifying predictive factors of weight loss and comorbidity resolution after bariatric procedures [27–30], highlighting the interest in better patient stratification and care planning. Consistent follow-up by a multidisciplinary team and the patient’s compliance with therapeutic indications [11, 12, 31, 31, 32] have shown to be success predictors of ESG. Indeed, variables such as compliance with surgical, nutritional, and psychological follow-up visits and performance of physical therapy were collected in a subset of cases in center 1, and models developed on these data show good predictive performance with preoperative data (LR models with ROC AUC of 0.77 and 0.76 for TWL% and EWL% achievement at 12 months, respectively). However, these predictors are not easily measured quantitatively, making their role in clinical modeling impractical with standard approaches. In this study, preoperative and follow-up clinical variables easily collected in routine practice have been exploited using ML. The external validation of ML models exhibited poor results overall. That is, ML models trained on multicenter datasets and tested on data from an independent center could not be used to reliably predict outcomes at that center. This could be due to missing data and variations in procedure technique. Other possible influencing factors include population differences, ranging from cultural-behavioral to biological-genetic differences. Performances of ML models using temporal validation reached acceptable performances, but only when postoperative follow-up data were introduced. These findings indicated that ML models using only preoperative data may not be viably deployed in routine practice. The results are in line with prior studies addressing different surgical procedures, where the predictive performance of ML models, trained using only preoperative data, has been poor in general [23]. Preoperative data only may not be sufficient for accurate postoperative predictions. This could be understood by the fact that technical aspects of the intervention itself may influence the outcome, and that early to mid-term postoperative physiological changes in patients can lead to different outcomes. This study shows that ML predictive capacity improves by the integration of follow-up data, with a potential role in early postoperative patient management. ML models using 3-month follow-up showed strong results (AUC-ROC > 0.79 for TWL achievement at 12 months in the RFC models internal validation dataset), which is considered outstanding performance [33]. The internally and temporally validated ML models had higher predictive capacity than externally validated models, as was expected due to the higher similarity between training and testing samples. Internal validation normally represents an upper performance bound, so it clearly showed that accurate prediction of ESG outcome success using only preoperative data is likely impossible with ML. Among the developed ML models (external validation dataset), LR models show the best predictive performance, showing no advantage in more complex ML modeling processes, possibly facilitating their application. ESG indications and patient postoperative management include potential for resuturing or bridge to surgery. Although the safety and feasibility of performing laparoscopic sleeve gastrectomy or gastric bypass after ESG have been described [34, 35], the best timing as well as setting specific weight loss targets to transition to more extensive bariatric surgery have not yet been established. Predictive models could then help evaluate the best time to perform bariatric surgery after bridge ESG. Although primary ESG is effective, some patients may require revision procedures to augment weight loss [5]. Interestingly, information on resuturing after primary ESG may be integrated in the ML models to identify the best timing to perform the revision procedure. In addition, these cutting-edge ML predictive models could also be expanded to bariatric surgeries in which two-step procedures are proposed, such as single anastomosis duodenal switch (SADI-S) [36]. Our goal is to further enhance the ML model by integrating intraoperative video data and harnessing deep learning techniques. This will enable us to leverage the information embedded in videos, which can provide valuable insights into patients specific anatomy and variations in operator technique and skills.

Study limitations

This study has some limitations: the relatively small size of the dataset compared to similar ML models development studies [17] may limit model performance. In addition, the relatively small dataset associated with a high rate of success in terms of TWL and EWL achieved leads to a very low sensitivity of ML models. Missing data, as well as lack of univocal definitions of some variables, e.g., the presence of comorbidities, also contribute to this study’s limitations. Indeed, models to predict obesity-related comorbidities were attempted, but lack of data prevented from developing reliable models. Many of the listed limitations highlight the importance of multicenter active collaboration and follow-up process standardization.

Conclusion

To the best of our knowledge, this is the first study to develop ML models that predict the success of ESG with routinely collected clinical information. The ML model predictions are updated by the addition of follow-up data, adapting to changes in patients after ESG, improving outcome prediction and therapeutic strategies. In clinical practice, the use of common and easily collected variables should favor ML predictive models spread. Finally, the clinical usefulness of applying the ML models developed in this study should be investigated in impact phase trials.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 13 kb)^{(13.9KB, docx)}

Acknowledgements

We would like to thank the Medrio group for the technical support on the online registry, as well as doctors Heyman, Lopez-Nava, Formiga, Bulajic, Zorron, D’Alessandro, Manos, Esteban, Singhal, Graus, and Bansi for their contribution to the multicenter data registry. We would also like to thank Sarah Mitchel and Guy Temporal for their assistance in proofreading this manuscript.

Funding

Open access funding provided by Università degli Studi di Torino within the CRUI-CARE Agreement.

Declarations

Disclosures

M.D. Drs. Maria Vannucci, María Rita Rodríguez-Luna, Pietro Mascagni, Jacques Marescaux, and Silvana Perretta have no conflicts of interest or financial ties to disclose. P.h.D. Drs. Patrick Niyishaka, Toby Collins, and Alexandre Hostettler have no conflicts of interest or financial ties to disclose.

Footnotes

Success of ESG depends on various factors. This study aimed at identifying patients clinical and demographic factors influencing weight loss to develop ML models to predict ESG success and assist clinical decision making.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

1.Lopez-Nava G, Galvão MP, Bautista-Castaño I, Fernandez-Corbelle JP, Trell M, Lopez N. Endoscopic sleeve gastroplasty for obesity treatment: two years of experience. ABCD Arq Bras Cir Dig São Paulo. 2017;30(1):18–20. doi: 10.1590/0102-6720201700010006. [DOI] [PMC free article] [PubMed] [Google Scholar]
2.Singh S, Hourneaux de Moura DT, Khan A, Bilal M, Ryan MB, Thompson CC. Safety and efficacy of endoscopic sleeve gastroplasty worldwide for treatment of obesity: a systematic review and meta-analysis. Surg Obes Relat Dis. 2020;16(2):340–351. doi: 10.1016/j.soard.2019.11.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Lopez-Nava G, Laster J, Negi A, Fook-Chong S, Bautista-Castaño I, Asokkumar R. Endoscopic sleeve gastroplasty (ESG) for morbid obesity: how effective is it? Surg Endosc. 2022;36(1):352–360. doi: 10.1007/s00464-021-08289-1. [DOI] [PubMed] [Google Scholar]
4.Dayyeh BKA, Bazerbachi F, Vargas EJ, et al. Endoscopic sleeve gastroplasty for treatment of class 1 and 2 obesity (MERIT): a prospective, multicentre, randomised trial. The Lancet. 2022;400(10350):441–451. doi: 10.1016/S0140-6736(22)01280-6. [DOI] [PubMed] [Google Scholar]
5.Boškoski I, Pontecorvi V, Gallo C, Bove V, Laterza L, Costamagna G. Redo endoscopic sleeve gastroplasty: technical aspects and short-term outcomes. Ther Adv Gastroenterol. 2020;2:89. doi: 10.1177/1756284819896179. [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Chang SH, Stoll CRT, Song J, Varela JE, Eagon CJ, Colditz GA. The effectiveness and risks of bariatric surgery: an updated systematic review and meta-analysis, 2003–2012. JAMA Surg. 2014;149(3):275–287. doi: 10.1001/jamasurg.2013.3654. [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Li P, Ma B, Gong S, Zhang X, Li W. Efficacy and safety of endoscopic sleeve gastroplasty for obesity patients: a meta-analysis. Surg Endosc. 2020;34(3):1253–1260. doi: 10.1007/s00464-019-06889-6. [DOI] [PubMed] [Google Scholar]
8.Abu Dayyeh BK, Acosta A, Camilleri M, et al. Endoscopic sleeve gastroplasty alters gastric physiology and induces loss of body weight in obese individuals. Clin Gastroenterol Hepatol. 2017;15(1):37–43.e1. doi: 10.1016/j.cgh.2015.12.030. [DOI] [PubMed] [Google Scholar]
9.Alqahtani A, Al-Darwish A, Mahmoud AE, Alqahtani YA, Elahmedi M. Short-term outcomes of endoscopic sleeve gastroplasty in 1000 consecutive patients. Gastrointest Endosc. 2019;89(6):1132–1138. doi: 10.1016/j.gie.2018.12.012. [DOI] [PubMed] [Google Scholar]
10.Neto MG, Silva LB, de Quadros LG, et al. Brazilian consensus on endoscopic sleeve gastroplasty. Obes Surg. 2021;31(1):70–78. doi: 10.1007/s11695-020-04915-4. [DOI] [PubMed] [Google Scholar]
11.Lopez-Nava G, Asokkumar R, Rull A, Corbelle F, Beltran L, Bautista I. Bariatric endoscopy procedure type or follow-up: what predicted success at 1 year in 962 obese patients? Endosc Int Open. 2019;7(12):E1691. doi: 10.1055/a-1007-1769. [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Pizzicannella M, Lapergola A, Fiorillo C, et al. Does endoscopic sleeve gastroplasty stand the test of time? Objective assessment of endoscopic ESG appearance and its relation to weight loss in a large group of consecutive patients. Surg Endosc. 2020;34(8):3696–3705. doi: 10.1007/s00464-019-07329-1. [DOI] [PubMed] [Google Scholar]
13.Sidey-Gibbons JAM, Sidey-Gibbons CJ. Machine learning in medicine: a practical introduction. BMC Med Res Methodol. 2019;19(1):64. doi: 10.1186/s12874-019-0681-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Bektaş M, Reiber BMM, Pereira JC, Burchell GL, van der Peet DL. Artificial intelligence in bariatric surgery: current status and future perspectives. Obes Surg. 2022;32(8):2772–2783. doi: 10.1007/s11695-022-06146-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Ramspek CL, Jager KJ, Dekker FW, Zoccali C, van Diepen M. External validation of prognostic models: what, why, how, when and where? Clin Kidney J. 2021;14(1):49–58. doi: 10.1093/ckj/sfaa188. [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Abu Dayyeh BK, Bazerbachi F, Vargas EJ, et al. Endoscopic sleeve gastroplasty for treatment of class 1 and 2 obesity (MERIT): a prospective, multicentre, randomised trial. Lancet Lond Engl. 2022;400(10350):441–451. doi: 10.1016/S0140-6736(22)01280-6. [DOI] [PubMed] [Google Scholar]
17.Stam WT, Goedknegt LK, Ingwersen EW, Schoonmade LJ, Bruns ERJ, Daams F. The prediction of surgical complications using artificial intelligence in patients undergoing major abdominal surgery: a systematic review. Surgery. 2022;171(4):1014–1021. doi: 10.1016/j.surg.2021.10.002. [DOI] [PubMed] [Google Scholar]
18.van de Laar AW, van Rijswijk AS, Kakar H, Bruin SC. Sensitivity and specificity of 50% excess weight loss (50%EWL) and twelve other bariatric criteria for weight loss success. Obes Surg. 2018;28(8):2297–2304. doi: 10.1007/s11695-018-3173-4. [DOI] [PubMed] [Google Scholar]
19.Guimarães M, Osório C, Silva D, et al. How sustained is Roux-en-Y gastric bypass long-term efficacy?: Roux-en-Y gastric bypass efficacy. Obes Surg. 2021;31(8):3623–3629. doi: 10.1007/s11695-021-05458-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Guyon I, Weston J, Barnhill S, Vapnik V. Gene selection for cancer classification using support vector machines. Mach Learn. 2002;46(1):389–422. doi: 10.1023/A:1012487302797. [DOI] [Google Scholar]
21.Rubin DB. Inference and missing data. Biometrika. 1976;63(3):581–592. doi: 10.2307/2335739. [DOI] [Google Scholar]
22.Currie AC, Glaysher MA, Blencowe NS, Kelly J. Systematic review of innovation reporting in endoscopic sleeve gastroplasty. Obes Surg. 2021;31(7):2962–2978. doi: 10.1007/s11695-021-05355-4. [DOI] [PubMed] [Google Scholar]
23.Xue B, Li D, Lu C, et al. Use of machine learning to develop and evaluate models using preoperative and intraoperative data to identify risks of postoperative complications. JAMA Netw Open. 2021;4(3):e212240. doi: 10.1001/jamanetworkopen.2021.2240. [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Elfanagely O, Toyoda Y, Othman S, et al. Machine learning and surgical outcomes prediction: a systematic review. J Surg Res. 2021;264:346–361. doi: 10.1016/j.jss.2021.02.045. [DOI] [PubMed] [Google Scholar]
25.Goto T, Camargo CA, Faridi MK, Freishtat RJ, Hasegawa K. Machine learning-based prediction of clinical outcomes for children during emergency department triage. JAMA Netw Open. 2019;2(1):e186937. doi: 10.1001/jamanetworkopen.2018.6937. [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Kate RJ, Pearce N, Mazumdar D, Nilakantan V. A continual prediction model for inpatient acute kidney injury. Comput Biol Med. 2020;116:103580. doi: 10.1016/j.compbiomed.2019.103580. [DOI] [PubMed] [Google Scholar]
27.Kitamura R, Chen R, Trickey A, Eisenberg D. Positive and negative independent predictive factors of weight loss after bariatric surgery in a veteran population. Obes Surg. 2020;30(6):2124–2130. doi: 10.1007/s11695-020-04428-0. [DOI] [PubMed] [Google Scholar]
28.Blume CA, Brust-Renck PG, Rocha MK, et al. Development and validation of a predictive model of success in bariatric surgery. Obes Surg. 2021;31(3):1030–1037. doi: 10.1007/s11695-020-05103-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
29.Karpińska IA, Kulawik J, Pisarska-Adamczyk M, Wysocki M, Pędziwiatr M, Major P. Is It possible to predict weight loss after bariatric surgery?—external validation of predictive models. Obes Surg. 2021;31(7):2994–3004. doi: 10.1007/s11695-021-05341-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
30.Nielsen MS, Christensen BJ, Schmidt JB, et al. Predictors of weight loss after bariatric surgery-a cross-disciplinary approach combining physiological, social, and psychological measures. Int J Obes. 2020;44(11):2291–2302. doi: 10.1038/s41366-020-0576-9. [DOI] [PubMed] [Google Scholar]
31.Lopez-Nava G, Asokkumar R, Rull A, Corbelle F, Beltran L, Bautista I. Bariatric endoscopy procedure type or follow-up: what predicted success at 1 year in 962 obese patients? Endosc Int Open. 2019;07(12):E1691–E1698. doi: 10.1055/a-1007-1769. [DOI] [PMC free article] [PubMed] [Google Scholar]
32.Lopez-Nava G, Galvao M, Bautista-Castaño I, Fernandez-Corbelle J, Trell M. Endoscopic sleeve gastroplasty with 1-year follow-up: factors predictive of success. Endosc Int Open. 2016;04(02):E222–E227. doi: 10.1055/s-0041-110771. [DOI] [PMC free article] [PubMed] [Google Scholar]
33.Andersen PK. 3. Applied logistic regression. 2nd edn. David W. Hosmer and Stanley Lemeshow. Wiley, New York, 2000. No. of pages: xii+373. Price: £60.95. ISBN 0-471-35632-8. Stat Med. 2002;21(13):1963–1964. doi: 10.1002/sim.1236. [DOI] [Google Scholar]
34.Maselli DB, Alqahtani AR, Abu Dayyeh BK, et al. Revisional endoscopic sleeve gastroplasty of laparoscopic sleeve gastrectomy: an international, multicenter study. Gastrointest Endosc. 2021;93(1):122–130. doi: 10.1016/j.gie.2020.05.028. [DOI] [PubMed] [Google Scholar]
35.Bulajic M, Vadalà di Prampero SF, Boškoski I, Costamagna G. Endoscopic therapy of weight regain after bariatric surgery. World J Gastrointest Surg. 2021;13(12):1584–1596. doi: 10.4240/wjgs.v13.i12.1584. [DOI] [PMC free article] [PubMed] [Google Scholar]
36.Admella V, Osorio J, Sorribas M, Sobrino L, Casajoana A, Pujol-Gebellí J. Direct and two-step single anastomosis duodenal switch (SADI-S): unicentric comparative analysis of 232 cases. Cirugia Espanola. 2021;99(7):514–520. doi: 10.1016/j.cireng.2021.06.017. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary file1 (DOCX 13 kb)^{(13.9KB, docx)}

[CR1] 1.Lopez-Nava G, Galvão MP, Bautista-Castaño I, Fernandez-Corbelle JP, Trell M, Lopez N. Endoscopic sleeve gastroplasty for obesity treatment: two years of experience. ABCD Arq Bras Cir Dig São Paulo. 2017;30(1):18–20. doi: 10.1590/0102-6720201700010006. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR2] 2.Singh S, Hourneaux de Moura DT, Khan A, Bilal M, Ryan MB, Thompson CC. Safety and efficacy of endoscopic sleeve gastroplasty worldwide for treatment of obesity: a systematic review and meta-analysis. Surg Obes Relat Dis. 2020;16(2):340–351. doi: 10.1016/j.soard.2019.11.012. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR3] 3.Lopez-Nava G, Laster J, Negi A, Fook-Chong S, Bautista-Castaño I, Asokkumar R. Endoscopic sleeve gastroplasty (ESG) for morbid obesity: how effective is it? Surg Endosc. 2022;36(1):352–360. doi: 10.1007/s00464-021-08289-1. [DOI] [PubMed] [Google Scholar]

[CR4] 4.Dayyeh BKA, Bazerbachi F, Vargas EJ, et al. Endoscopic sleeve gastroplasty for treatment of class 1 and 2 obesity (MERIT): a prospective, multicentre, randomised trial. The Lancet. 2022;400(10350):441–451. doi: 10.1016/S0140-6736(22)01280-6. [DOI] [PubMed] [Google Scholar]

[CR5] 5.Boškoski I, Pontecorvi V, Gallo C, Bove V, Laterza L, Costamagna G. Redo endoscopic sleeve gastroplasty: technical aspects and short-term outcomes. Ther Adv Gastroenterol. 2020;2:89. doi: 10.1177/1756284819896179. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR6] 6.Chang SH, Stoll CRT, Song J, Varela JE, Eagon CJ, Colditz GA. The effectiveness and risks of bariatric surgery: an updated systematic review and meta-analysis, 2003–2012. JAMA Surg. 2014;149(3):275–287. doi: 10.1001/jamasurg.2013.3654. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR7] 7.Li P, Ma B, Gong S, Zhang X, Li W. Efficacy and safety of endoscopic sleeve gastroplasty for obesity patients: a meta-analysis. Surg Endosc. 2020;34(3):1253–1260. doi: 10.1007/s00464-019-06889-6. [DOI] [PubMed] [Google Scholar]

[CR8] 8.Abu Dayyeh BK, Acosta A, Camilleri M, et al. Endoscopic sleeve gastroplasty alters gastric physiology and induces loss of body weight in obese individuals. Clin Gastroenterol Hepatol. 2017;15(1):37–43.e1. doi: 10.1016/j.cgh.2015.12.030. [DOI] [PubMed] [Google Scholar]

[CR9] 9.Alqahtani A, Al-Darwish A, Mahmoud AE, Alqahtani YA, Elahmedi M. Short-term outcomes of endoscopic sleeve gastroplasty in 1000 consecutive patients. Gastrointest Endosc. 2019;89(6):1132–1138. doi: 10.1016/j.gie.2018.12.012. [DOI] [PubMed] [Google Scholar]

[CR10] 10.Neto MG, Silva LB, de Quadros LG, et al. Brazilian consensus on endoscopic sleeve gastroplasty. Obes Surg. 2021;31(1):70–78. doi: 10.1007/s11695-020-04915-4. [DOI] [PubMed] [Google Scholar]

[CR11] 11.Lopez-Nava G, Asokkumar R, Rull A, Corbelle F, Beltran L, Bautista I. Bariatric endoscopy procedure type or follow-up: what predicted success at 1 year in 962 obese patients? Endosc Int Open. 2019;7(12):E1691. doi: 10.1055/a-1007-1769. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR12] 12.Pizzicannella M, Lapergola A, Fiorillo C, et al. Does endoscopic sleeve gastroplasty stand the test of time? Objective assessment of endoscopic ESG appearance and its relation to weight loss in a large group of consecutive patients. Surg Endosc. 2020;34(8):3696–3705. doi: 10.1007/s00464-019-07329-1. [DOI] [PubMed] [Google Scholar]

[CR13] 13.Sidey-Gibbons JAM, Sidey-Gibbons CJ. Machine learning in medicine: a practical introduction. BMC Med Res Methodol. 2019;19(1):64. doi: 10.1186/s12874-019-0681-4. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR14] 14.Bektaş M, Reiber BMM, Pereira JC, Burchell GL, van der Peet DL. Artificial intelligence in bariatric surgery: current status and future perspectives. Obes Surg. 2022;32(8):2772–2783. doi: 10.1007/s11695-022-06146-1. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR15] 15.Ramspek CL, Jager KJ, Dekker FW, Zoccali C, van Diepen M. External validation of prognostic models: what, why, how, when and where? Clin Kidney J. 2021;14(1):49–58. doi: 10.1093/ckj/sfaa188. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR16] 16.Abu Dayyeh BK, Bazerbachi F, Vargas EJ, et al. Endoscopic sleeve gastroplasty for treatment of class 1 and 2 obesity (MERIT): a prospective, multicentre, randomised trial. Lancet Lond Engl. 2022;400(10350):441–451. doi: 10.1016/S0140-6736(22)01280-6. [DOI] [PubMed] [Google Scholar]

[CR17] 17.Stam WT, Goedknegt LK, Ingwersen EW, Schoonmade LJ, Bruns ERJ, Daams F. The prediction of surgical complications using artificial intelligence in patients undergoing major abdominal surgery: a systematic review. Surgery. 2022;171(4):1014–1021. doi: 10.1016/j.surg.2021.10.002. [DOI] [PubMed] [Google Scholar]

[CR18] 18.van de Laar AW, van Rijswijk AS, Kakar H, Bruin SC. Sensitivity and specificity of 50% excess weight loss (50%EWL) and twelve other bariatric criteria for weight loss success. Obes Surg. 2018;28(8):2297–2304. doi: 10.1007/s11695-018-3173-4. [DOI] [PubMed] [Google Scholar]

[CR19] 19.Guimarães M, Osório C, Silva D, et al. How sustained is Roux-en-Y gastric bypass long-term efficacy?: Roux-en-Y gastric bypass efficacy. Obes Surg. 2021;31(8):3623–3629. doi: 10.1007/s11695-021-05458-y. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR20] 20.Guyon I, Weston J, Barnhill S, Vapnik V. Gene selection for cancer classification using support vector machines. Mach Learn. 2002;46(1):389–422. doi: 10.1023/A:1012487302797. [DOI] [Google Scholar]

[CR21] 21.Rubin DB. Inference and missing data. Biometrika. 1976;63(3):581–592. doi: 10.2307/2335739. [DOI] [Google Scholar]

[CR22] 22.Currie AC, Glaysher MA, Blencowe NS, Kelly J. Systematic review of innovation reporting in endoscopic sleeve gastroplasty. Obes Surg. 2021;31(7):2962–2978. doi: 10.1007/s11695-021-05355-4. [DOI] [PubMed] [Google Scholar]

[CR23] 23.Xue B, Li D, Lu C, et al. Use of machine learning to develop and evaluate models using preoperative and intraoperative data to identify risks of postoperative complications. JAMA Netw Open. 2021;4(3):e212240. doi: 10.1001/jamanetworkopen.2021.2240. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR24] 24.Elfanagely O, Toyoda Y, Othman S, et al. Machine learning and surgical outcomes prediction: a systematic review. J Surg Res. 2021;264:346–361. doi: 10.1016/j.jss.2021.02.045. [DOI] [PubMed] [Google Scholar]

[CR25] 25.Goto T, Camargo CA, Faridi MK, Freishtat RJ, Hasegawa K. Machine learning-based prediction of clinical outcomes for children during emergency department triage. JAMA Netw Open. 2019;2(1):e186937. doi: 10.1001/jamanetworkopen.2018.6937. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR26] 26.Kate RJ, Pearce N, Mazumdar D, Nilakantan V. A continual prediction model for inpatient acute kidney injury. Comput Biol Med. 2020;116:103580. doi: 10.1016/j.compbiomed.2019.103580. [DOI] [PubMed] [Google Scholar]

[CR27] 27.Kitamura R, Chen R, Trickey A, Eisenberg D. Positive and negative independent predictive factors of weight loss after bariatric surgery in a veteran population. Obes Surg. 2020;30(6):2124–2130. doi: 10.1007/s11695-020-04428-0. [DOI] [PubMed] [Google Scholar]

[CR28] 28.Blume CA, Brust-Renck PG, Rocha MK, et al. Development and validation of a predictive model of success in bariatric surgery. Obes Surg. 2021;31(3):1030–1037. doi: 10.1007/s11695-020-05103-0. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR29] 29.Karpińska IA, Kulawik J, Pisarska-Adamczyk M, Wysocki M, Pędziwiatr M, Major P. Is It possible to predict weight loss after bariatric surgery?—external validation of predictive models. Obes Surg. 2021;31(7):2994–3004. doi: 10.1007/s11695-021-05341-w. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR30] 30.Nielsen MS, Christensen BJ, Schmidt JB, et al. Predictors of weight loss after bariatric surgery-a cross-disciplinary approach combining physiological, social, and psychological measures. Int J Obes. 2020;44(11):2291–2302. doi: 10.1038/s41366-020-0576-9. [DOI] [PubMed] [Google Scholar]

[CR31] 31.Lopez-Nava G, Asokkumar R, Rull A, Corbelle F, Beltran L, Bautista I. Bariatric endoscopy procedure type or follow-up: what predicted success at 1 year in 962 obese patients? Endosc Int Open. 2019;07(12):E1691–E1698. doi: 10.1055/a-1007-1769. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR32] 32.Lopez-Nava G, Galvao M, Bautista-Castaño I, Fernandez-Corbelle J, Trell M. Endoscopic sleeve gastroplasty with 1-year follow-up: factors predictive of success. Endosc Int Open. 2016;04(02):E222–E227. doi: 10.1055/s-0041-110771. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR33] 33.Andersen PK. 3. Applied logistic regression. 2nd edn. David W. Hosmer and Stanley Lemeshow. Wiley, New York, 2000. No. of pages: xii+373. Price: £60.95. ISBN 0-471-35632-8. Stat Med. 2002;21(13):1963–1964. doi: 10.1002/sim.1236. [DOI] [Google Scholar]

[CR34] 34.Maselli DB, Alqahtani AR, Abu Dayyeh BK, et al. Revisional endoscopic sleeve gastroplasty of laparoscopic sleeve gastrectomy: an international, multicenter study. Gastrointest Endosc. 2021;93(1):122–130. doi: 10.1016/j.gie.2020.05.028. [DOI] [PubMed] [Google Scholar]

[CR35] 35.Bulajic M, Vadalà di Prampero SF, Boškoski I, Costamagna G. Endoscopic therapy of weight regain after bariatric surgery. World J Gastrointest Surg. 2021;13(12):1584–1596. doi: 10.4240/wjgs.v13.i12.1584. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR36] 36.Admella V, Osorio J, Sorribas M, Sobrino L, Casajoana A, Pujol-Gebellí J. Direct and two-step single anastomosis duodenal switch (SADI-S): unicentric comparative analysis of 232 cases. Cirugia Espanola. 2021;99(7):514–520. doi: 10.1016/j.cireng.2021.06.017. [DOI] [PubMed] [Google Scholar]

PERMALINK

Machine learning models to predict success of endoscopic sleeve gastroplasty using total and excess weight loss percent achievement: a multicentre study

Maria Vannucci

Patrick Niyishaka

Toby Collins

María Rita Rodríguez-Luna

Pietro Mascagni

Alexandre Hostettler

Jacques Marescaux

Silvana Perretta

Abstract

Background

Methods

Results

Conclusions

Supplementary Information

Methods

Dataset overview and outcome variables

Table 1.

Model training

Model validation

Multicenter validation

Single-center validation

Data preparation

Results

Cohort statistics

Multicenter external validation

Fig. 1.

Fig. 2.

Single-center internal validation

Fig. 3.

Single-center temporal validation

Fig. 4.

Overall results

Discussion

Study limitations

Conclusion

Supplementary Information

Acknowledgements

Funding

Declarations

Disclosures

Footnotes

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases