Abstract
Purpose
We developed Machine learning (ML) algorithms to predict ureteroscopy (URS) outcomes, offering insights into diagnosis and treatment planning, personalised care and improved clinical decision-making.
Methods
FLEXOR is a large international multicentric database including 6669 patients treated with URS for urolithiasis from 2015 to 2023. Preoperative and postoperative(PO) correlations were investigated through 15 ML-trained algorithms. Outcomes included stone free status (SFS, at 3-month imaging follow up), intraoperative (PCS bleeding, ureteric/PCS injury, need for postoperative drainage) and PO complications (fever, sepsis, need for reintervention). ML was applied for the prediction, correlation and logistic regression analysis. Explainable AI emphasizes key features and their contributions to the output.
Results
Extra Tree Classifier achieved the best accuracy (81%) in predicting SFS. PCS bleed was negatively linked with ‘positive urine culture‘(-0.08), ‘tamsulosin‘(-0.08), ‘stone location‘(-0.10), ‘fibre optic scope‘(-0.19), ‘Moses Fibre‘(-0.09), and ‘TFL‘(-0.09), and positively with ‘elevated creatine‘(0.25), ‘fever‘(0.11), and ‘stone diameter‘(0.21). ‘PCS injury’ and ‘ureteric injury’ both showed moderate correlation with ‘elevated creatinine‘(0.11), ‘fever‘(0.10), and ‘lower pole stone‘(0.09). ‘Tamsulosin‘(0.23) use, presence of ‘multiple‘(0.25) or ‘lower pole‘(0.25) stones, ‘reusable scope‘(0.17) and ‘Moses Fibre’(0.2546) increased the risk for PO stent, while ‘digital scope’(-0.13) or ‘TFL‘(-0.29) reduced it. ‘Preoperative fever‘(0.10), ‘positive urine culture‘(0.16), and ‘stone diameter‘(0.10) may play a role in ‘PO fever’ and ‘sepsis’. SFS was mainly influenced by ‘age‘(0.12), ‘preoperative fever‘(0.09), ‘multiple stones‘(0.15), ‘stone diameter‘(0.17), ‘Moses Fibre“(0.15) and ‘TFL‘(-0.28).
Conclusion
ML is valuable tool for accurately predicting outcomes by analysing pre-existing datasets. Our model demonstrated strong performance in outcomes and risks prediction, laying the groundwork for development of accessible predictive models.
Supplementary information
The online version contains supplementary material available at 10.1007/s00345-025-05551-2.
Keywords: Ureteroscopy, Urolithiasis, Machine learning, Artificial intelligence, Explainable AI, Outcomes prediction
Introduction
Urolithiasis represents a significant burden on healthcare systems worldwide, affecting millions of individuals annually, and leading to severe pain, potential renal damage, and frequent hospitalizations [1]. Ureteroscopy and laser lithotripsy (fURSL) are among the most effective minimally invasive surgical options for treating urolithiasis [2]. Despite its efficacy, the outcomes can vary significantly among patients due to a multitude of factors, including stone burden, location, patient anatomy, and comorbidities [3]. Moreover, the risk of complications, such as ureteral injury, infection, and incomplete stone clearance, remains a critical concern for clinicians [4].
Traditionally, the prediction of success and risks have relied on clinician expertise, patient history, and basic imaging techniques [5, 6]. However, these methods often lack the necessary precision to consistently predict outcomes, leading to variability in treatment effectiveness and patient safety [7]. As healthcare continues to embrace technological advancements, there is a growing interest in the application of Machine Learning (ML) to improve clinical decision-making [8–12]. ML, a subset of artificial intelligence (AI), involves the development of algorithms that can learn from and make predictions based on data. In recent years, ML has demonstrated remarkable potential in various medical fields, offering enhanced diagnostic accuracy, personalized treatment plans, and predictive analytics that surpass traditional methods [13–16].
In the context of urolithiasis, ML offers predictive capabilities: analysing large datasets of preoperative, intraoperative, and postoperative variables, complex patterns and interactions - often imperceptible to the human eye - can be identified [17, 18]. ML models can be trained to predict procedural outcomes, such as stone-free stetus (SFS), and to detect early signs of potential complications, thereby enabling proactive interventions that improve patient outcomes [19–21].
This paper proposes a novel predictive method that integrates ML into fURSL practice. We explore the development and validation of ML models designed to enhance outcome prediction and facilitate the early detection of complications. Our approach leverages a combination of patient-specific factors, procedural data, and advanced computational techniques to drive towards more accurate, individualized care.
Materials and methods
Patients selection, data collection and procedures
Data were extracted from an international multicentric database, the FLEXOR registry, analysing the practices and outcomes of 6,669 patients who underwent fURSL for urinary stones between 2015 and 2023. This dataset involved 20 centers across 15 countries, focusing on adult patients (≥ 18 years) treated for urolithiasis via fURSL, including both recurrences and first presentations, naïve or pre-stented patients, emergency and elective procedures, unilateral or bilateral urolithiasis, stones in both ureter and kidney, all stone sizes. Exclusion criteria included pediatric patients, those undergoing ureteroscopy for non-urolithiasis conditions (e.g., upper tract tumors), and individuals who declined consent for data collection. In all cases, patients received specialized preoperative counseling, and informed consent for the procedure and data usage was obtained. Suspension of oral anticoagulant therapies was evaluated for each patient and tailored according to comorbidities and local protocols. Antibiotic prophylaxis protocols were applied based on local guidelines and varied across centers; patients with a preoperative positive urine sample (MSU) were treated according to antibiograms and presented a second negative MSU before surgery.
Data collected included demographic and preoperative details such as age, sex, presentation type (acute, during follow-up, incidental), symptoms, comorbidities, stone size, total stone burden (cumulative diameter), stone count and location (upper pole (UP), middle pole (MP), lower pole (LP), renal pelvis (RP)), MSU findings, and the presence of preoperative urinary drainage. Additionally, factors like preoperative hematuria, fever, positive MSU, and use of alpha-blocker medications (e.g., Tamsulosin) were documented. Imaging prior to surgery involved plain or contrast-enhanced CT scans to assess anatomy and stone characteristics.
Surgical details recorded included anesthesia type (spinal, epidural, general), operative duration (from instrument insertion to catheter placement), tools and devices used (e.g., ureteral access sheaths, single-use/reusable scopes, digital/fiberoptic scopes, baskets), and laser-related parameters (type, MOSES technology, dusting/fragmentation mode, lasering time). The use of postoperative stents and any intraoperative complications, such as injuries to the pelvicalyceal system (PCS) or ureter, bleeding, and transfusion requirements, were also documented. Early postoperative complications, including fever, sepsis, hematuria, and pain, were tracked and classified using the Clavien-Dindo grading system. Data on hospitalization duration and readmissions were also collected.
Follow-up assessments determined SFS through local protocols (using KUB X-rays, ultrasound, or non-contrast CT scans). SFS was defined as the absence of fragments larger than 2 mm in the urinary tract.
ML algorithms training and data analysis
The workflow was organized into multiple stages, beginning with data cleaning and preprocessing. Initially, unnecessary spaces and irrelevant characters were removed to clean the data. Subsequently, preprocessing steps were undertaken, including imputing mode values for categorical preoperative characteristics. Statistical analyses were then carried out, which involved examining correlations, calculating variance inflation factors (VIF), and performing logistic regression for all four tasks. Following this, individual ML algorithms were trained for each task, while a multitask artificial neural network (ANN) was developed to address all tasks simultaneously.
In total, sixteen ML algorithms were selected and trained on the preprocessed data for each of the four tasks. These included logistic regression, quadratic discriminant analysis, Extra Trees Classifier, AdaBoost, CatBoost Classifier, Naïve Bayes, Bagging Classifier, Gradient Boost, Extreme Gradient Boost (XGBoost), Decision Tree, K-Nearest Neighbors (KNN), Random Forest, Linear Discriminant Analysis, and Support Vector Machines (SVM) using linear, polynomial, and radial basis function (RBF) kernels. Each algorithm was trained independently to predict outcomes based on the input features associated with each task.
To address multitasking, we implemented algorithms designed to generate multiple predictions simultaneously. A multitask ANN was specifically constructed to predict all postoperative outcomes concurrently, using shared layers for initial feature extraction, followed by task-specific layers tailored to each prediction. The architecture comprised an input layer with eight neurons, a shared hidden layer containing 128 neurons activated by a rectified linear unit (ReLU) function, and four distinct output layers with one neuron each, employing sigmoid activation functions to predict individual postoperative outcomes. The performance of ML models was assessed using confusion matrix and classification report. The confusion matrix provided counts of true positives, true negatives, false positives, and false negatives, which were used to derive metrics such as accuracy, precision, recall, and the F1 score. The classification report further detailed precision and recall metrics for each class within the tasks.
Recently, emphasis on model interpretability and explainability has increased, particularly in sensitive domains like healthcare, as complex models such as ANNs, often regarded as “black boxes,” require strategies to improve transparency. To enhance interpretability, explainable AI (XAI) techniques were employed to shed light on the predictions made by the models. Explainable Tree, Shapley additive explanations (SHAP) plot, SHAP beeswarm chart and SHAP bar chart were created for all selected outcomes. These methods highlighted key features and their contributions, offering clear insights into the factors influencing the outcomes.
The following features were selected for ML training and testing:
Inputs: Age; Sex; Preoperative (PreOp) Haematuria; PreOp Pain; PreOp elevated creatinine (S-crea); PreOp Fever; PreOp MSU; PreOp Stent; Use of Tamsulosin; Normal/abnormal anatomy of urinary tract; Number of stones; Stone diameter; Location of stone; Suction UAS; Reusable/disposable scope; Digital/fibre optic scope; Use of Moses; Use of TFL;
Outputs: Need for postoperative (PostOp) drainage; Residual fragments (RFs); Intraoperative complications: PCS bleeding, PCS injury, Ureteric injury; PostOp complications: Fever, Sepsis; Same-day discharge; Need for reintervention.
Results
The findings from the FLEXOR registry, conducted by the TOWER research group, have already been published [22]. Thus, only a brief overview is provided here and a complete presentation of results is provided in the supplementary materials. The complete results from ML analysis and XAI are also available in the supplementary materials.
Outcomes from the FLEXOR registry
The study enrolled 6,669 patients internationally. Mean patients age was 49.3 years, with a M: F ratio 2:1. Most patients presented with pain and a single stone, with a mean diameter of 10.04 mm. fURSL was predominantly performed using reusable flexible ureteroscopes in 72.0% cases, and a UAS was employed in 93.2%. The Holmium: YAG laser was used in 73.1%, with the combined dusting/fragmentation technique being the most common lithotripsy method. The average operation time was 62.40 min, and the mean hospital stay 3.62 days. Postoperative complications occurred in 535 patients, including 84 cases of sepsis. At follow-up, 78.3% of patients achieved SFS. Among those with RFs, 51.5% required further intervention.
Prediction of need for PostOp drainage
For the prediction of PostOp drainage, the best performing algorithm was the Extra Tree Classifier, with an accuracy of 80.97% and a precision of 77.70%. At correlation analysis, factors related to the need for PostOp drainage were: use of Tamsulosin, Number of stones, Stone location and diameter, PreOp elevated S-crea, PreOp pain, Use of Moses, abnormal anatomy, use of disposable or fibreoptic scope, of Moses technology and TFL. Logistic regression analysis indicated the same variables as associated with the likelihood of Need for PostOp drainage.
Prediction of FRs
For the automated prediction of RFs, Extra Tree Classifier performed as the best algorithm, with an accuracy of 81.10% and a precision of 75.78%. Cat Boost Classifier and Random Forest followed with both an accuracy of 79.26%, and a precision of 73.59% and 73.42% respectively. Correlation analysis suggests that factors such as Age, PreOp Haematuria, PreOp elevated S-crea, PreOp Fever, Positive MSU, Number of stones, Stone diameter, Stone in LP, Use of Moses, and Use of TFL may play a role in the occurrence of FRs. Similarly, logistic regression demonstrated a significant association with Age, PreOp elevated S-crea, Number of stones, Stone diameter, Stone in RP/MP, Reusable and Fibreoptic scope, Use of Moses and TFL, Use of Tamsulosin, abnormal anatomy. XAI results (Fig. 1) demonstrated strong correlation with the use TFL and digital scopes, that seems to play a predictive role in the absence of fragments. These results were consistent in the Explainable tree, Plot Chart and SHAP Bar Chart.
Fig. 1.
Explainable AI for RFs. Clockwise from the upper left figure: explainable tree, feature importance bar chart, shap summary plot, shap beeswarm chart
Intraoperative complications: PCS and ureteric injury
Random Forest algorithm was the best predictor of PCS injury, with 97.73% accuracy and 63.50% precision, while XGBoost achieved 96.88% accuracy and 59.27% precision in predicting ureteric injury. Explainable AI emphasized the importance of factors such as digital scope usage, TFL, stone location and higher stone burden in the risk of both PCS and ureteric injury.
Correlation analysis revealed a moderate positive correlation was observed between PCS injury and factors like elevated creatinine, fever, abnormal anatomy, multiple and larger stones, and stones in the LP/RP, with the strongest correlation seen for PCS bleed. For ureteric injury, association was noted with age and sex, preoperative haematuria/pain, positive MSU, stone diameter, number and location, elevated creatinine, fever, pre-stented patients, tamsulosin, suction UAS, reusable or fibreoptic scope, use of TFL, and Moses Fibre. Logistic regression identified key predictors of PCS injury (fever, stones in the LP/MP, and use of fibreoptic scope or TFL) and ureteric injury (elevated creatinine, fever, LP stones, and Moses Fibre).
PostOp complications: fever and sepsis
The Extra Tree Classifier was the best ML model for predicting postoperative fever, with 91.34% accuracy and 58.20% precision; XGBoost performed slightly worse, with similar accuracy but lower precision. CatBoost Classifier was the best-performing ML algorithm in postoperative sepsis prediction, with 99.01% validation accuracy and 66.45% precision; Random Forest and Extra Tree Classifier also performed well (99.15% accuracy, 66.38% precision). Explainable AI highlighted TFL, reusable scopes, and stone characteristics (size, multiplicity, location) as key predictors of postoperative fever, while stone diameter, RP stones, and use of fibreoptic scope use emerged as key predictive features for sepsis. According to correlation analysis, preoperative MSU emerged as a key factor in postoperative fever, alongside age, sex, creatinine levels, fever, stone characteristics, and certain instrument choices. Logistic regression identified significant predictors: elevated creatinine, fever, positive MSU, tamsulosin, multiple stones, stone diameter, and MPe stones. Conversely, use of single use scopes and TFL reduced the likelihood of fever. Postoperative sepsis showed weak correlations with preoperative characteristics, and the use of TFL, suction UAS, fibreoptic scope, and Moses Fibre. Positive associations were also observed with other complications. Logistic regression highlighted significant predictors of postoperative sepsis: positive MSU, tamsulosin, increasing stone diameter, and RP stones, all increasing the odds of sepsis.
Prediction of same-day discharge
Gradient Boost was the best-performing ML algorithm, with 83.85% validation accuracy and 76.18% precision. Cat Boost Classifier (83.66% accuracy, 75.57% precision) and XGBoost (81.53% accuracy, 72.35% precision) also performed well. Explainable AI identified type of scopes, use and type of UAS and use of tamsulosin as key predictive features, with digital scopes being the most important. Same day discharged showed weak negative correlation with patient’s demographics and preoperative features. It had a moderate negative correlation with Moses Fibre, a strong negative correlation with reusable scope, and a very strong negative correlation with fibreoptic scope. On the other hand, it has a weak positive correlation with use of TFL and suction UAS, presence of pain at presentation, fever, and stone located in UP. According to logistic regression, stone diameter, use of reusable scopes, fibreoptic scopes, Moses Fibre have negative coefficients; while use of tamsulosin and suction UAS showed positive coefficients.
Prediction of need for reintervention
Random Forest was the best-performing ML algorithm, with 88.49% validation accuracy and 76.30% precision. XGBoost (88.21% accuracy, 75.62% precision) and Extra Tree Classifier (87.07% accuracy, 73.19% precision) also performed well. Explainable AI identified type of scopes, type of laser and stone diameters as key predictive features, with fibreoptic scopes being the most important. At correlation analysis, stone diameter, elevated creatinine, multiple stones, and stones in the RP or LP have positive correlations with reintervention. Presence of pain, patient sex, use of digital/single use scope, TFL, presence of fever, positive preoperative MSU, and use of Tamsulosin have negative correlations. Logistic regression showed negative coefficients for sex, single stone, single use, digital, Moses Fibre, and TFL have negative coefficients. Positive OR were found for elevated creatinine, fever, abnormal anatomy, stone diameter, LP stones, and use of tamsulosin.
Discussion
This study represents the first ML-driven evaluation of operative outcomes, complications, and their correlations in a large cohort of patients undergoing fURSL for urolithiasis. 15 ML algorithms were evaluated on a substantial dataset to accurately predict complications based on preoperative and intraoperative factors, with a particular emphasis on infectious complications. We included models such as the Extra Tree Classifier, Random Forest, and CatBoost Classifier, which demonstrated high accuracy in predicting intra- and postoperative outcomes. Notably, the CatBoost Classifier achieved the highest overall performance, with 99% accuracy in predicting postoperative sepsis, offering promising potential for precise complication prediction following stone treatments. Similarly, Random Forest delivered excellent accuracy in predicting intraoperative PCS injury (97.73%) and the need for reintervention (88.49%), and XGBoost in predicting ureteric injury (96.88%). Even if with lower percentages, our ML models achieved strong accuracy in the prediction of RFs (81.10%), need for postoperative drainage (80.97%) and feasibility of same-day discharge (83.85%).
XAI techniques were also applied in this study. While being a relatively new tool, somehow still not well-known in the urological community, XAI is a powerful method to increase the transparency and interpretability of “black-box” ML models, clarifying machine-generated predictions, fostering trust in the models, and enabling informed, data-driven decisions [23]. It does not reflect correlation and regression analysis alone, but produces new insights in the analysis interpreting the standard ML results [24]. For example, the explainable decision tree revealed associations between ureteric injury and factors such as specific stone locations. Additionally, XAI tools can enhance patient consultations by visually presenting risks and personalizing treatment discussions, serving as a reference for individual risk assessments.
Our analysis revealed a significant correlation between preoperative MSU and postoperative fever and sepsis (p = 0.001). While the importance of performing fURS in a sterile urinary tract has been previously explored [25], this finding underscores the clinical relevance of infection control and thorough patient preparation, particularly for high-risk individuals identified through preoperative MSU. Despite tailored preoperative antibiotic therapy, patients with an initially positive urine culture remained at an elevated risk for postoperative infectious complications.
In relation to RFs and need for reintervention, our analysis revealed a strong correlation with both stone characteristics (such as diameter and position within the PCS) and instrumentation choices. The introduction of new devices like the TFL and digital scopes appears to significantly influence SFS and reintervention rates. These findings align with recent research [26, 27] and pave the way for developing ML-driven scoring systems for automated risk and outcome prediction.
While ML is a relatively novel approach in urology, prior studies have demonstrated its applicability. For instance, Vigneswaran and colleagues [28] studied the correlation between SFS and clinical and radiological factors through a ML analysis on 330 patients, revealing stone volume to be the most important factor in the prediction of SFS, with an accuracy of 74.5%. In a previous study conducted by our team on a paediatric population [29], an ML ensemble model produced high accuracy (90%) in predicting SFS, finding correlation with overall stone size, presence of multiple stones, and presence of a preoperative stent. Pietropaolo et al. [30] employed ML to predict severe sepsis following fURS in 114 patients, identifying predictors like proximal stone location, prolonged stent use, large stone size, and extended operative times, with an accuracy of 81.3%. Similarly, Castellani et al. [31] analysed data from over 1,500 patients to identify predictors of post-ureteroscopy sepsis, reporting associations with age, stone volume, and operative time, with 92% accuracy. Other research has explored ML’s potential in predicting SFS based on total stone burden or intraoperative features, further emphasizing its value in the field [32].
A notable strength of the study lies in its extensive dataset from the FLEXOR database, encompassing data from over 6,500 patients across multiple centers. This large-scale dataset enhances the robustness and generalizability of the findings. Moreover, by evaluating 15 different ML algorithms, the study identified optimal configurations that delivered exceptional performance with the top-performing models. However, the authors acknowledge several limitations, including the retrospective design, which might result in missing data, and potential biases arising from multicenter variability in procedural techniques and instrumentation. Nevertheless, these limitations arguably mirror real-world clinical practices.
Future research could focus on enhancing ML models to achieve higher precision and incorporating real-time intraoperative data to improve predictive accuracy during procedures. Long-term investigations could also assess the impact of predictive models on patient outcomes, healthcare costs, and satisfaction, further solidifying their role in clinical decision-making.
Conclusion
Our study depicts ML as a valuable tool for the accurate prediction of outcomes by analysing pre-existing datasets, demonstrating strong performance in outcomes and risks prediction, with particular accuracy for complications. Ultimately, the integration of ML into urolithiasis treatment protocols may represent a significant step forward in achieving automatic outcome prediction and optimizing patient safety in clinical practice.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Author contributions
CN wrote the main manuscript. VG, DC, FP, JYCT, BHC, KYF, MB, NG, AGS, TRWH, OT, BKS provided and reviewed research data. CN, SA, NN, SG, HS performed statistical and ML analysis. CN, PT, SA, NN, SG, HS created figures and tables. All authors reviewed the manuscript.
Funding
Open access funding provided by Università Politecnica delle Marche within the CRUI-CARE Agreement.
Data availability
Data is provided within the manuscript or supplementary information files.
Declarations
Competing interests
The authors declare no competing interests.
Footnotes
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Romero V, Akpinar H, Assimos DG (2010) Kidney stones: a global picture of prevalence, incidence, and associated risk factors. Rev Urol 12(2–3):e86–96 [PMC free article] [PubMed] [Google Scholar]
- 2.Doizi S, Traxer O (2018) Flexible ureteroscopy: technique, tips and tricks. Urolithiasis 46(1):47–58 [DOI] [PubMed] [Google Scholar]
- 3.Skolarikos A, Neisius A, Petřík A, Somani B, Thomas K, Gambaro G (2022) EAU Guidelines on Urolithiasis. EAU guidelines. Edn. presented at the EAU Annual congress Amsterdam. Available from: https://uroweb.org/guidelines/urolithiasis/chapter/introduction
- 4.Grosso AA, Sessa F, Campi R, Viola L, Polverino P, Crisci A et al (2021) Intraoperative and postoperative surgical complications after ureteroscopy, retrograde intrarenal surgery, and percutaneous nephrolithotomy: a systematic review. Minerva Urol Nephrol. 73(3):309–332 [DOI] [PubMed]
- 5.Geraghty RM, Davis NF, Tzelves L, Lombardo R, Yuan C, Thomas K et al (2023) Best practice in interventional management of urolithiasis: an update from the European association of urology guidelines panel for urolithiasis 2022. Eur Urol Focus 9(1):199–208 [DOI] [PubMed] [Google Scholar]
- 6.De Coninck V, Keller EX, Somani B, Giusti G, Proietti S, Rodriguez-Socarras M et al (2020) Complications of ureteroscopy: a complete overview. World J Urol 38(9):2147–2166 [DOI] [PubMed] [Google Scholar]
- 7.Wright AE, Rukin NJ, Somani BK (2014) Ureteroscopy and stones: current status and future expectations. World J Nephrol 3(4):243 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Nedbal C, Bres-Niewada E, Dybowski B, Somani BK (2024) The impact of artificial intelligence (AI) in revolutionizing all aspects of urological care: a glimpse in the future. Cent Eur J Urol 1(77):12–14 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Obermeyer Z, Emanuel EJ (2016) Predicting the Future — Big data, machine learning, and clinical medicine. N Engl J Med 375(13):1216–1219 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Kempa-Liehr AW, Lin CYC, Britten R, Armstrong D, Wallace J, Mordaunt D et al (2020) Healthcare pathway discovery and probabilistic machine learning. Int J Med Inf 137:104087 [DOI] [PubMed] [Google Scholar]
- 11.AlAzab R, Ghammaz O, Ardah N, Al-Bzour A, Zeidat L, Mawali Z et al (2023) Predicting the Stone-Free status of percutaneous nephrolithotomy with the machine learning system. Int J Nephrol Renovasc Dis 16:197–206 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Nedbal C, Cerrato C, Jahrreiss V, Pietropaolo A, Galosi AB, Castellani D et al (2023) Trends of ‘artificial intelligence, machine learning, virtual reality and radiomics in urolithiasis’ over the last 30 years (1994–2023) as published in the literature (PubMed): a comprehensive review. J Endourol 38(8):788–798 [DOI] [PubMed]
- 13.Zhu G, Li C, Guo Y, Sun L, Jin T, Wang Z et al (2024) Predicting stone composition via machine-learning models trained on intra-operative endoscopic digital images. BMC Urol 24(1):5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Chen IY, Joshi S, Ghassemi M, Ranganath R (2021) Probabilistic machine learning for healthcare. Annu Rev Biomed Data Sci 4(1):393–415 [DOI] [PubMed] [Google Scholar]
- 15.Blum ES, Porras AR, Biggs E, Tabrizi PR, Sussman RD, Sprague BM et al (2018) Early detection of ureteropelvic junction obstruction using signal analysis and machine learning: A dynamic solution to a dynamic problem. J Urol 199(3):847–852 [DOI] [PubMed] [Google Scholar]
- 16.Elfanagely O, Toyoda Y, Othman S, Mellia JA, Basta M, Liu T et al (2021) Machine learning and surgical outcomes prediction: A systematic review. J Surg Res 264:346–361 [DOI] [PubMed] [Google Scholar]
- 17.Hassan AM, Rajesh A, Asaad M, Nelson JA, Coert JHenk, Mehrara BJ et al (2023) Artificial intelligence and machine learning in prediction of surgical complications: current State, applications, and implications. Am Surg 89(1):25–30 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Nedbal C, Adithya S, Naik N, Gite S, Juliebø-Jones P, Somani BK (2024) Can machine learning correctly predict outcomes of flexible ureteroscopy with laser lithotripsy for kidney stone disease?? Results from a large endourology university centre. Eur Urol Open Sci 64:30–37 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Nedbal C, Cerrato C, Jahrreiss V, Castellani D, Pietropaolo A, Galosi AB (2023) The role of ‘artificial intelligence, machine learning, virtual reality, and radiomics’ in PCNL: a review of publication trends over the last 30 years. Therapeutic Advances in Urology, vol 15. SAGE Publications Inc. [DOI] [PMC free article] [PubMed]
- 20.Aminsharifi A, Irani D, Tayebi S, Jafari Kafash T, Shabanian T, Parsaei H (2020) Predicting the postoperative outcome of percutaneous nephrolithotomy with machine learning system: software validation and comparative analysis with Guy’s stone score and the CROES nomogram. J Endourol 34(6):692–699 [DOI] [PubMed] [Google Scholar]
- 21.Zhao H, Li W, Li J, Li L, Wang H, Guo J (2022) Predicting the stone-Free status of percutaneous nephrolithotomy with the machine learning system: comparative analysis with Guy’s stone score and the S.T.O.N.E score system. Front Mol Biosci. 9:880291 [DOI] [PMC free article] [PubMed]
- 22.Gauhar V, Chew BH, Traxer O, Tailly T, Emiliani E, Inoue T et al (2022) Indications, preferences, global practice patterns and outcomes in retrograde intrarenal surgery (RIRS) for renal stones in adults: results from a multicenter database of 6669 patients of the global flexible ureteroscopy outcomes registry (FLEXOR). World J Urol 41(2):567–574 [DOI] [PubMed] [Google Scholar]
- 23.O’Sullivan S, Janssen M, Holzinger A, Nevejans N, Eminaga O, Meyer CP et al (2022) Explainable artificial intelligence (XAI): closing the gap between image analysis and navigation in complex invasive diagnostic procedures. World J Urol 40(5):1125–1134 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.London AJ (2019) Artificial intelligence and Black-Box medical decisions: Accuracy versus explainability. Hastings Cent Rep 49(1):15–21 [DOI] [PubMed] [Google Scholar]
- 25.Corrales M, Sierra A, Doizi S, Traxer O (2022) Risk of Sepsis in retrograde intrarenal surgery: A systematic review of the literature. European Urology Open Science, Elsevier B.V. 44;84–91 [DOI] [PMC free article] [PubMed]
- 26.Geavlete B, Mareș C, Mulțescu R, Georgescu D, Geavlete P (2022) Second-look flexible ureteroscopy after RIRS – Holmium Moses versus TFL (Soltive). J Med Life 15(10):1218–1223 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Uleri A, Farré A, Izquierdo P, Angerri O, Kanashiro A, Balaña J et al (2024) Thulium Fiber laser versus holmium:yttrium aluminum Garnet for lithotripsy: A systematic review and Meta-analysis. Eur Urol 85(6):529–540 [DOI] [PubMed] [Google Scholar]
- 28.Vigneswaran G, Teh R, Ripa F, Pietropaolo A, Modi S, Chauhan J et al (2024) A machine learning approach using stone volume to predict stone-free status at ureteroscopy. World J Urol 42(1):344 [DOI] [PubMed] [Google Scholar]
- 29.Nedbal C, Adithya S, Gite S, Naik N, Griffin S, Somani BK (2024) A machine learning predictive model for ureteroscopy Lasertripsy outcomes in a pediatric Population—Results from a large endourology tertiary center. J Endourol 38(10):1044–1055 [DOI] [PubMed] [Google Scholar]
- 30.Pietropaolo A, Geraghty RM, Veeratterapillay R, Rogers A, Kallidonis P, Villa L et al (2021) A machine learning predictive model for Post-Ureteroscopy Urosepsis needing intensive care unit admission: A Case–Control YAU endourology study from nine European centres. J Clin Med 10(17):3888 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Castellani D, De Stefano V, Brocca C, Mazzon G, Celia A, Bosio A et al (2024) The infection post flexible ureterorenoscopy (I-FUN) predictive model based on machine learning: a new clinical tool to assess the risk of sepsis post retrograde intrarenal surgery for kidney stone disease. World J Urol 42(1):612 [DOI] [PubMed] [Google Scholar]
- 32.Ripa F, Vigneswaran G, Teh R, Pietropaolo A, Modi S, Somani B (2024) A machine learning approach using stone volume to predict stone-free status after ureteroscopy for kidney stones. Eur Urol Open Sci 59:S41–S42 [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Data is provided within the manuscript or supplementary information files.

