Prediction of Cardiovascular Complication in Patients with Newly Diagnosed Type 2 Diabetes Using an XGBoost/GRU-ODE-Bayes-Based Machine-Learning Algorithm
- Author:
Joonyub LEE
1
;
Yera CHOI
;
Taehoon KO
;
Kanghyuck LEE
;
Juyoung SHIN
;
Hun-Sung KIM
Author Information
- Publication Type:Original Article
- From:Endocrinology and Metabolism 2024;39(1):176-185
- CountryRepublic of Korea
- Language:English
-
Abstract:
Background:Cardiovascular disease is life-threatening yet preventable for patients with type 2 diabetes mellitus (T2DM). Because each patient with T2DM has a different risk of developing cardiovascular complications, the accurate stratification of cardiovascular risk is critical. In this study, we proposed cardiovascular risk engines based on machine-learning algorithms for newly diagnosed T2DM patients in Korea.
Methods:To develop the machine-learning-based cardiovascular disease engines, we retrospectively analyzed 26,166 newly diagnosed T2DM patients who visited Seoul St. Mary’s Hospital between July 2009 and April 2019. To accurately measure diabetes-related cardiovascular events, we designed a buffer (1 year), an observation (1 year), and an outcome period (5 years). The entire dataset was split into training and testing sets in an 8:2 ratio, and this procedure was repeated 100 times. The area under the receiver operating characteristic curve (AUROC) was calculated by 10-fold cross-validation on the training dataset.
Results:The machine-learning-based risk engines (AUROC XGBoost=0.781±0.014 and AUROC gated recurrent unit [GRU]-ordinary differential equation [ODE]-Bayes=0.812±0.016) outperformed the conventional regression-based model (AUROC=0.723± 0.036).
Conclusion:GRU-ODE-Bayes-based cardiovascular risk engine is highly accurate, easily applicable, and can provide valuable information for the individualized treatment of Korean patients with newly diagnosed T2DM.