Current approaches and challenges in addressing class imbalance in medical prediction models
10.3760/cma.j.cn112338-20250109-00025
- VernacularTitle:医学预测模型中类不平衡问题处理策略的现状与挑战
- Author:
Xianglong MENG
1
;
Yutong WANG
;
Xin ZHANG
;
Siyan ZHAN
;
Shengfeng WANG
Author Information
1. 北京大学公共卫生学院流行病与卫生统计学系,北京 100191
- Publication Type:Journal Article
- Keywords:
Class imbalance;
Prediction model;
Sampling;
Cost-sensitive learning;
Generative adversarial networks;
Transfer learning
- From:
Chinese Journal of Epidemiology
2025;46(9):1632-1639
- CountryChina
- Language:Chinese
-
Abstract:
With the rise of personalized medicine and the rapid development of big data technology, medical prediction models have become increasingly important in disease diagnosis, prognosis assessment, and risk stratification. However, class imbalance is a common problem in medical data, which can result in models being overly trained toward the majority class rather than the minority class, influencing the detection power and clinical application value. This paper systematically summarizes traditional methods in addressing class imbalance, including data pre-processing and algorithm level strategies, and introduces the applications of new technologies such as generative adversarial networks and transfer learning and suggests key considerations and potential research focus for addressing class imbalance to provide reference for researchers to select appropriate strategies.