Machine learning-based predictive model for severe pneumonia in children
10.3760/cma.j.cn112150-20250126-00076
- VernacularTitle:基于机器学习的儿童重症肺炎预警模型的研究
- Author:
Qing DU
1
;
Mingzhao HUANG
;
Ying LI
;
Kai CHEN
;
Lianting HU
;
Chao XIONG
;
Xiaoxia LU
Author Information
1. 华中科技大学同济医学院附属武汉儿童医院呼吸内科,武汉 430016
- Publication Type:Journal Article
- Keywords:
Severe pneumonia;
Machine learning;
Predictive modeling
- From:
Chinese Journal of Preventive Medicine
2025;59(10):1716-1724
- CountryChina
- Language:Chinese
-
Abstract:
Objective:To develop and validate a clinical warning model for severe pediatric community-acquired pneumonia (CAP) using electronic health records.Methods:A retrospective cohort study was conducted, analyzing clinical data of 15 750 children hospitalized for CAP at Wuhan Children′s Hospital between January 1, 2019, and December 31, 2023. Patient data were randomly split into training and testing sets at a 7∶3 ratio. Six supervised machine learning models were constructed in the training set, optimized using five-fold cross-validation, and evaluated in the testing set. Model performance was assessed using ROC-AUC, sensitivity, specificity, positive predictive value, negative predictive value, calibration curves, and clinical decision curve analysis at optimal thresholds. The best-performing model was selected, and SHapley Additive exPlanations (SHAP) were used to interpret feature importance. A program interface was developed based on the model results, enabling integration into clinical decision support systems for automated early warning.Results:A total of 15 750 participants, ranging in age from 28 days to 18 years, were included in the study. The median age was 2 years [interquartile range (IQR): 0-4 years], with 9 555 males (60.67%) and 6 195 females (39.33%). Among them, 2 211 (14.04%) developed severe pneumonia. In the prediction models, XGB outperformed other models with an ROC-AUC of 0.884 (95% CI: 0.870-0.898), sensitivity (0.803, 95% CI: 0.772-0.832), specificity (0.828, 95% CI: 0.816-0.839). Calibration analysis showed strong agreement between predicted and observed risks (Brier score: 0.081, 95% CI: 0.075-0.086). The analysis based on the SHAP method revealed that respiratory rate, heart rate, T-lymphocyte subsets, and red blood cell volume distribution width-SD are predictive factors for severe progression of community-acquired pneumonia (CAP) in children. Conclusion:An interpretable machine learning model was developed for the early detection and personalized treatment planning of severe CAP in children, providing valuable support to clinicians.