Establishment and evaluation of a machine learning prediction model for sepsis-related encephalopathy in the elderly.
10.3760/cma.j.cn121430-20250227-00157
- Author:
Xiao YUE
1
;
Yiwen WANG
2
;
Zhifang LI
1
;
Lei WANG
1
;
Li HUANG
1
;
Shuo WANG
1
;
Yiming HOU
1
;
Shu ZHANG
1
;
Zhengbin WANG
1
Author Information
1. Department of Emergency Medicine, the First Affiliated Hospital of Zhengzhou University, Zhengzhou 450052, China.
2. Department of Rehabilitation Medicine, the Third Affiliated Hospital of Zhengzhou University, Zhengzhou 450052, China. Corresponding author: Wang Zhengbin, Email: fccwangzb2@zzu.edu.cn.
- Publication Type:Journal Article
- MeSH:
Humans;
Machine Learning;
Aged;
Sepsis-Associated Encephalopathy;
Sepsis/complications*;
Intensive Care Units;
Logistic Models;
Middle Aged;
Male;
ROC Curve;
Female;
Bayes Theorem;
Nomograms;
Support Vector Machine;
Algorithms
- From:
Chinese Critical Care Medicine
2025;37(10):937-943
- CountryChina
- Language:Chinese
-
Abstract:
OBJECTIVE:To construct machine learning prediction model for sepsis-associated encephalopathy (SAE), and analyze the application value of the model on early identification of SAE risk in elderly septic patients.
METHODS:Patients aged over 60 years with a primary diagnosis of sepsis admitted to intensive care unit (ICU) from 2008 to 2023 were selected from Medical Information Mart for Intensive Care-IV 2.2 (MIMIC-IV 2.2). Demographic variables, disease severity scores, comorbidities, interventions, laboratory indicators, and hospitalization details were collected. Key factors associated with SAE were identified using univariate Logistic regression analysis. The data were randomly divided into training and validation sets in a 7 : 3 ratio. Multivariable Logistic regression analysis was conducted in the training set and visualized using a nomogram model for prediction of SAE. The discrimination of the model was evaluated in the validation set using the receiver operator characteristic curve (ROC curve), and its calibration was assessed using calibration curve. Furthermore, multiple machine learning algorithms, including multi-layer perceptron (MLP), support vector machine (SVM), naive bayes (NB), gradient boosting machine (GBM), random forest (RF), and extreme gradient boosting (XGB), were constructed in the training set. Their predictive performance was subsequently evaluated on the validation set. Taking the XGB model as an example, the interpretability of the model through the SHapley Additive exPlanations (SHAP) algorithm was enhanced to identify the key predictive factors and their contributions.
RESULTS:A total of 2 204 septic patients were finally enrolled, of whom 840 developed SAE (38.1%). A total of 21 variables associated with SAE were screened through univariate Logistic regression analysis. Multivariable Logistic regression analysis showed that endotracheal intubation [odds ratio (OR) = 0.40, 95% confidence interval (95%CI) was 0.19-0.88, P < 0.001], oxygen therapy (OR = 0.76, 95%CI was 0.53-0.95, P = 0.023), tracheotomy (OR = 0.20, 95%CI was 0.07-0.53, P < 0.001), continuous renal replacement therapy (CRRT; OR = 0.32, 95%CI was 0.15-0.70, P < 0.001), cerebrovascular disease (OR = 0.31, 95%CI was 0.16-0.60, P < 0.001), rheumatic disease (OR = 0.44, 95%CI was 0.19-0.99, P < 0.001), male (OR = 0.68, 95%CI was 0.54-0.86, P = 0.001), and maximum anion gap (AG; OR = 0.95, 95%CI was 0.93-0.97, P < 0.001) were associated with an decreased probability of SAE, and age (OR = 1.05, 95%CI was 1.03-1.06, P < 0.001), acute physiology score III (APSIII; OR = 1.02, 95%CI was 1.01-1.02, P < 0.001), Oxford acute severity of illness score (OASIS; OR = 1.04, 95%CI was 1.03-1.06, P < 0.001), and length of hospital stay (OR = 1.01, 95%CI was 1.01-1.02, P < 0.001) were associated with an increased probability of SAE. A nomogram model was constructed based on these variables. In the validation set, ROC curve analysis showed that the model achieved an area under the ROC curve (AUC) of 0.723, and the calibration curve showed good consistency between the predicted probability of the model and the observed probability. Among the machine learning algorithms, including MLP, SVM, NB, GBM, RF, and XGB, the SVM model and RF model demonstrated relatively good predictive performance, with AUC of 0.748 and 0.739, respectively, and the sensitivity was both exceeding 85%. The predictive performance of the XGB model was explained through SHAP analysis, and the results indicated that APSIII score (SHAP value was 0.871), age (SHAP value was 0.521), and OASIS score (SHAP value was 0.443) were important factors affecting the predictive performance of the model.
CONCLUSIONS:The machine learning-based SAE prediction model exhibits good predictive capability and holds significant application value for the early identification of SAE risk in elderly septic patients.