Systematic Reviews
Copyright ©The Author(s) 2025.
World J Gastrointest Surg. Aug 27, 2025; 17(8): 109463
Published online Aug 27, 2025. doi: 10.4240/wjgs.v17.i8.109463
Table 2 Overview of studies focusing on preoperative planning and risk prediction in gastrointestinal surgery
Ref.
Focus area
AI method/model
Key outcome
Number of data points
Results
Galvis-García et al[13], 2023Colorectal polyp detection and classificationDeep learning, CNN, CAD (CADe/CADx)Increased adenoma and polyp detection rates using AI assisted colonoscopy in real time1038 patients (RCT), 8641 images (CNN study), 466 polyps (CADx), 238 lesions (endocytoscopy)Sensitivity up to 96.5%, specificity up to 93%, accuracy up to 96.4%, F1 score approximately 94%, NPV up to 99.6%
Zhang et al[14], 2023Diagnosis of choledocholithiasis in gallstone patientsMachine learning (7 models), AI (ModelArts)Developed and validated AI model with high diagnostic accuracy for CBD stones prediction1199 patients (681 with CBD stones)ModelArts AI: Accuracy 0.97, recall 097, precision 0.971, F1 score 097; machine learning AUCs: 0.77-0.81
Ahmad et al[15], 2022Detection of subtle and advanced colorectal neoplasiaDeep learning (ResNet 101 CNN)High sensitivity for detecting flat lesions, sessile serrated lesions, and advanced colorectal polyps173 polyps, 35114 polyp positive frames, 634988 polyp negative frames across multiple datasetsPer polyp sensitivity: 100%, 98.9%, and 79.5% (in subtle set); F1 score: Up to 87.9%; CNN outperformed expert and trainee endoscopists in detection speed and accuracy
Lei et al[16], 2023Polyp detection via colon capsule endoscopyDeep learning (CNN, AiSPEED)Feasibility and diagnostic accuracy of AI assisted reading compared to clinician interpretationTarget: 674 patients (597 needed for power; both prospective and retrospective recruitment)Sensitivity/specificity of AI to be compared to clinician standard; exact results pending study is ongoing
Eckhoff et al[17], 2023Surgical phase recognition for Ivor-Lewis esophagectomyCNN + LSTM (TEsoNet), transfer learningDemonstrated feasibility of knowledge transfer from sleeve gastrectomy to esophagectomy with moderate accuracy60 sleeve gastrectomy videos, 40 esophagectomy videos (used in combinations across 5 experiments)Single procedure accuracy: 87.7% (sleeve). Transfer learning: 23.4% overall (4 overlapping phases: 58.6%). Co training max accuracy: 40.8%
Blum et al[18], 2024Prediction of choledocholithiasisLogistic regression, RF, XGBoost, KNN; ensemble modelMachine learning models can outperform ASGE guidelines in predicting choledocholithiasis risk using pre MRCP data222 patientsAUROC: 0.83 (RF), accuracy: 0.81 (ensemble), sensitivity: Up to 0.94, F1 score: Up to 0.82
Axon[19], 2020Evolution and future of digestive endoscopyConceptual AI (future prediction)Highlights AI’s potential to surpass expert level diagnostic accuracy and support real time treatment decisionsPredicts that AI will revolutionize diagnosis and treatment in endoscopy with real time support tools
Hsu et al[20], 2023Predicting postoperative GIB after bariatric surgeryRF, XGBoost, NNsMachine learning models outperform logistic regression in predicting GIB, aiding clinical decision making159959 patients (632 with GIB)RF AUROC: 0.764, sensitivity: 75.4%, specificity: 70.0%; XGBoost AUROC: 0.746; NN AUROC: 0.741; logistic regression AUROC: 0.709
Athanasiadis et al[21], 2025Accuracy of self-assessment in laparoscopic cholecystectomy (CVS quality)Surgeons frequently overestimate CVS performance; self-assessment alone is insufficient25 surgeons enrolled, 13 submitted 1 video, 4 submitted 2 videosNo surgeon achieved adequate CVS per expert review; significant discrepancy between self and expert ratings on Strasberg scale
Bisschops et al[22], 2019Advanced endoscopic imaging for colorectal neoplasiaGuidance on when to use advanced imaging (HD WLE, CE, NBI, etc.) for detection and differentiation of colorectal lesionsAI suggested for future use if validated; various imaging techniques reviewed for ADR, miss rates, lesion detection
Han et al[23], 2025Intraoperative recognition of PAN during total mesorectal excisionDeepLabv3+ with ResNet50 backboneAI model (AINS) achieved real time neuro recognition of PAN, aiding nerve preservation during rectal cancer surgery1780 images (1424 training, 356 validation)Accuracy: 0.9609, precision: 0.7494, recall: 0.6587, F1 score: 0.7011; AI outperformed surgeons (F1: 0.4568) and operated faster (3 minutes vs 25 minutes)