Interpretation of Clinical Data Based on C4.5 Algorithm for the Diagnosis of Coronary Heart Disease.
10.4258/hir.2016.22.3.186
- Author:
Wiharto WIHARTO
1
;
Hari KUSNANTO
;
Herianto HERIANTO
Author Information
1. Department of Informatic, Sebelas Maret University, Surakarta, Indonesia. wiharto@staff.uns.ac.id
- Publication Type:Original Article
- Keywords:
Heart Diseases;
Algorithms;
Diagnosis;
Data Mining;
Decision Tree
- MeSH:
Area Under Curve;
Classification;
Coronary Disease*;
Data Mining;
Decision Trees;
Diagnosis*;
Heart Diseases;
Incidence;
Sensitivity and Specificity
- From:Healthcare Informatics Research
2016;22(3):186-195
- CountryRepublic of Korea
- Language:English
-
Abstract:
OBJECTIVES: The interpretation of clinical data for the diagnosis of coronary heart disease can be done using algorithms in data mining. Most clinical data interpretation systems for diagnosis developed using data mining algorithms with a black-box approach cannot recognize examination attribute relationships with the incidence of coronary heart disease. METHODS: This study proposes a system to interpretation clinical examination results for the diagnosis of coronary heart disease based the decision tree algorithm. This system comprises several stages. First, oversampling is carried out by a combination of the synthetic minority oversampling technique (SMOTE), feature selection, and the C4.5 classification algorithm. System testing is done using k-fold cross-validation. The performance parameters are sensitivity, specificity, positive prediction value (PPV), negative prediction value (NPV) and the area under the curve (AUC). RESULTS: The results showed that the performance of the system has a sensitivity of 74.7%, a specificity of 93.7%, a PPV of 74.2%, an NPV of 93.7%, and an AUC of 84.2%. CONCLUSIONS: This study demonstrated that, by using C4.5 algorithms, data can be interpreted in the form of a decision tree, to aid the understanding of the clinician. In addition, the proposed system can provide better performance by category.