Retrospective Study
Copyright ©The Author(s) 2023. Published by Baishideng Publishing Group Inc. All rights reserved.
World J Gastroenterol. Nov 21, 2023; 29(43): 5804-5817
Published online Nov 21, 2023. doi: 10.3748/wjg.v29.i43.5804
Development of a machine learning-based model for predicting risk of early postoperative recurrence of hepatocellular carcinoma
Yu-Bo Zhang, Gang Yang, Yang Bu, Peng Lei, Wei Zhang, Dan-Yang Zhang
Yu-Bo Zhang, Gang Yang, Peng Lei, Wei Zhang, Dan-Yang Zhang, Department of Hepatobiliary Surgery, General Hospital of Ningxia Medical University, Yinchuan 750003, Ningxia Hui Autonomous Region, China
Yang Bu, Department of Hepatobiliary Surgery, People's Hospital of Ningxia Hui Autonomous Region, Yinchuan 750003, Ningxia Hui Autonomous Region, China
Author contributions: Zhang YB analyzed the data and wrote the manuscript; Yang G analyzed the data and wrote the original draft; Bu Y contributed the resources; Lei P designed the research; Zhang W analyzed the data; Zhang DY wrote the original draft.
Supported by Ningxia Key Research and Development Program, No. 2018BEG03001.
Institutional review board statement: The study was approved by the Ethics Committee of the General Hospital of Ningxia Medical University (KYLL-2023-0378).
Informed consent statement: Written informed consent was provided by individual patients.
Conflict-of-interest statement: The authors declare no conflict of interest for this article.
Data sharing statement: No additional data are available.
Open-Access: This article is an open-access article that was selected by an in-house editor and fully peer-reviewed by external reviewers. It is distributed in accordance with the Creative Commons Attribution NonCommercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial. See: https://creativecommons.org/Licenses/by-nc/4.0/
Corresponding author: Peng Lei, MD, Doctor, Department of Hepatobiliary Surgery, General Hospital of Ningxia Medical University, No. 84 Shenli Nan Road, Yinchuan 750003, Ningxia Hui Autonomous Region, China. leipengnx@126.com
Received: August 26, 2023
Peer-review started: August 26, 2023
First decision: September 18, 2023
Revised: October 7, 2023
Accepted: November 3, 2023
Article in press: November 3, 2023
Published online: November 21, 2023
Abstract
BACKGROUND

Surgical resection is the primary treatment for hepatocellular carcinoma (HCC). However, studies indicate that nearly 70% of patients experience HCC recurrence within five years following hepatectomy. The earlier the recurrence, the worse the prognosis. Current studies on postoperative recurrence primarily rely on postoperative pathology and patient clinical data, which are lagging. Hence, developing a new pre-operative prediction model for postoperative recurrence is crucial for guiding individualized treatment of HCC patients and enhancing their prognosis.

AIM

To identify key variables in pre-operative clinical and imaging data using machine learning algorithms to construct multiple risk prediction models for early postoperative recurrence of HCC.

METHODS

The demographic and clinical data of 371 HCC patients were collected for this retrospective study. These data were randomly divided into training and test sets at a ratio of 8:2. The training set was analyzed, and key feature variables with predictive value for early HCC recurrence were selected to construct six different machine learning prediction models. Each model was evaluated, and the best-performing model was selected for interpreting the importance of each variable. Finally, an online calculator based on the model was generated for daily clinical practice.

RESULTS

Following machine learning analysis, eight key feature variables (age, intratumoral arteries, alpha-fetoprotein, pre-operative blood glucose, number of tumors, glucose-to-lymphocyte ratio, liver cirrhosis, and pre-operative platelets) were selected to construct six different prediction models. The XGBoost model outperformed other models, with the area under the receiver operating characteristic curve in the training, validation, and test datasets being 0.993 (95% confidence interval: 0.982-1.000), 0.734 (0.601-0.867), and 0.706 (0.585-0.827), respectively. Calibration curve and decision curve analysis indicated that the XGBoost model also had good predictive performance and clinical application value.

CONCLUSION

The XGBoost model exhibits superior performance and is a reliable tool for predicting early postoperative HCC recurrence. This model may guide surgical strategies and postoperative individualized medicine.

Keywords: Machine learning, Hepatocellular carcinoma, Early recurrence, Risk prediction models, Imaging features, Clinical features

Core Tip: The current study aimed at employing machine learning techniques to select imaging and pre-operative clinical characteristic variables, to which the clinicians were easily accessible, to develop six different risk prediction models for early postoperative recurrence of hepatocellular carcinoma (HCC). We compared the sensitivity and specificity of these models in detecting patients at high risk of early postoperative recurrence of HCC. In addition, to increase the feasibility and applicability of the constructed model, we generated a calculator online based on the predictive model to help clinicians apply it in their daily medical practice.