1.Stacking Ensemble Technique for Classifying Breast Cancer
Hyunjin KWON ; Jinhyeok PARK ; Youngho LEE
Healthcare Informatics Research 2019;25(4):283-288
OBJECTIVES: Breast cancer is the second most common cancer among Korean women. Because breast cancer is strongly associated with negative emotional and physical changes, early detection and treatment of breast cancer are very important. As a supporting tool for classifying breast cancer, we tried to identify the best meta-learner model in a stacking ensemble when the same machine learning models for the base learner and meta-learner are used. METHODS: We used machine learning models, such as the gradient boosted model, distributed random forest, generalized linear model, and deep neural network in a stacking ensemble. These models were used to construct a base learner, and each of them was used as a meta-learner again. Then, we compared the performance of machine learning models in the meta-learner to determine the best meta-learner model in the stacking ensemble. RESULTS: Experimental results showed that using the GBM as a meta-learner led to higher accuracy than that achieved with any other model for breast cancer data and using the GLM as a meta learner led to low root-mean-squared error for both sets of breast cancer data. CONCLUSIONS: We compared the performance of every meta-learner model in a stacking ensemble as a supporting tool for classifying breast cancer. The study showed that using specific models as a metalearner resulted in better performance than single classifiers, and using GBM and GLM as a meta-learner is appropriate as a supporting tool for classifying breast cancer data.
Breast Neoplasms
;
Breast
;
Classification
;
Female
;
Forests
;
Humans
;
Linear Models
;
Machine Learning
;
Medical Informatics
;
Statistics as Topic