Construction of a machine learning model based on the Ki67 positive index to predict the recurrence risk of hepatocellular carcinoma
10.3760/cma.j.cn501113-20241002-00526
- VernacularTitle:构建基于增殖细胞核抗原阳性指数的机器学习模型预测肝细胞癌的复发风险
- Author:
Haoran LI
1
;
Yan YU
;
Fangying FAN
;
Wenzhen DING
;
Hui FENG
;
Minghua YING
;
Jiawei LI
;
Qingqing SUN
;
Lele BIAN
;
Haokai XU
;
Zhanyue CHEN
;
Jie YU
;
Ping LIANG
Author Information
1. 中国人民解放军总医院第五医学中心超声科,北京 100039
- Publication Type:Journal Article
- Keywords:
Hepatocellular carcinoma;
Recurrence;
Proliferating cell nuclear antigen;
Prediction;
Risk;
Machine learning
- From:
Chinese Journal of Hepatology
2025;33(9):898-909
- CountryChina
- Language:Chinese
-
Abstract:
Objective:To screen the optimal machine learning model for predicting the recurrence condition of hepatocellular carcinoma (HCC) at different time points post-surgery, based on the cutoff value of the Ki67 positive proliferation index condition calculated from recurrence-free survival and combined with various clinical features.Methods:retrospective study included initially treated patients with solitary HCC who underwent radical surgery at the Fifth Medical Center of the PLA General Hospital from January 2013 to March 2023. Data included general clinical data, preoperative laboratory parameters, and surgical pathology information about the subjects. The postoperative recurrence status was assessed by querying the medical record system or by telephone follow-up. The Ki67 positive index cutoff value was determined by the X-tile software based on the patient's recurrence-free survival status and time analysis. Survival rates were calculated using the Kaplan-Meier method, and survival curves were plotted. The study population was randomly divided into training and testing groups in a 7:3 ratio using a computer-generated random number method. The minimum redundancy maximum relevance (mRMR) method was used for feature variable selection. Predictive models for postoperative HCC recurrence conditions in patients with HCC were constructed using random forest, support vector machine, logistic regression, and gradient boosting decision tree machine learning algorithms. Inter-group comparisons for continuous data were performed using the t-test or Mann-Whitney U test. Inter-group comparisons of enumeration data were performed using the Pearson χ2 test, continuity-corrected χ2 test, or Fisher's exact test. Results:The cutoff values for the Ki67 positivity index were 0.3 and 0.5 in 510 cases, with a follow-up time ranging from 1.2 to 11.4 years (median: 6.2 years). The recurrence-free survival time was between 1 and 135 months (median: 32 months), with recurrence-free survival rates post-surgery at 1, 2, 3, and 5 years were 87.5%, 77.1%, 61.2%, and 54.5%, respectively. The top five variables predicted HCC recurrence and non-recurrence conditions following surgical follow-up at 6 months, 1 year, 2 years, and beyond 2 years, in accordance with information obtained by the mRMR screen out. The Ki67 positivity index screened a successfully constructed machine learning model to predict HCC recurrence and non-recurrence conditions following surgical follow-up at 6 months, 1 year, 2 years, and beyond 2 years. The machine learning model based on the gradient boosting decision tree algorithm had the best prediction performance among them (areas under the receiver operating characteristic curves for predicting HCC recurrence within six months in the training and validation sets were 0.996 and 0.946, and accuracies were 0.972 and 0.935, respectively).Conclusion:A machine learning model was successfully constructed using the Ki67 positivity index combined with four readily available clinical features to predict HCC recurrence. The machine learning model based on the gradient boosting decision tree algorithm demonstrated the best performance in terms of predicting HCC recurrence within six months after surgery.