Exploring Risk Factors for Primary Liver Cancer in Patients with Chronic Hepatitis C Based on Machine Learning Prediction Models
10.3971/j.issn.1000-8578.2024.24.0590
- VernacularTitle:基于机器学习预测模型探索慢性丙型肝炎患者发生原发性肝癌的风险因素
- Author:
Rong YANG
1
;
Bin FANG
2
;
Lingling ZHENG
3
;
Jinhua CHEN
1
;
Wenjuan ZHOU
4
Author Information
1. Follow-up Center, Fujian Medical University Union Hospital, Fuzhou 350001, China.
2. Fuzhou Zhizhen Medical Technology Co., Ltd., Fuzhou 350200, China.
3. Department of Public Health, Fujian Medical University Union Hospital, Fuzhou 350001, China.
4. Department of Laboratory Medicine, Fujian Medical University Union Hospital, Fuzhou 350001, China.
- Publication Type:CLINICALRESEARCH
- Keywords:
Machine learning;
Chronic hepatitis C;
Liver cancer;
Prediction model
- From:
Cancer Research on Prevention and Treatment
2024;51(12):1015-1020
- CountryChina
- Language:Chinese
-
Abstract:
Objective To construct a risk prediction model for liver cancer in patients with chronic hepatitis C based on seven different machine learning algorithms and select the optimal model. Methods A total of 236 patients with chronic hepatitis C were selected as the research subjects. Patients were divided into a case group and a control group according to whether liver cancer occurs. Prediction models were constructed based on seven machine learning algorithms including classification and regression tree, random forest, gradient boosting decision tree, extreme gradient boosting (XGBoost), logistic regression, K-near neighbor, and support vector machine. The Shapley additive explanations (SHAP) algorithm was used to interpret the best prediction model. Results Among the seven models, the XGBoost model had the best comprehensive prediction performance (accuracy of 0.933, sensitivity of 0.775, specificity of 0.960, area under the ROC curve of 0.956, F1 score of 0.764). The SHAP algorithm suggested that AFP, age, AST, diabetes, BMI, PLT, ALT, liver cysts, FIB-4, and gender contributed to the model decision and are the risk factors for liver cancer in patients with chronic hepatitis C. Conclusion This study develops an interpretable machine learning model based on the XGBoost algorithm, which has a good reference value for individualized monitoring of liver cancer in patients with chronic hepatitis C.