1.Application of Data Mining Technology in Risk Prediction Model for Lung Cancer
Zibo GAO ; Di LI ; Shuyin DUAN ; Xiaolei ZHOU ; Hong LIU ; Jing WANG ; Wei WANG ; Yongjun WU
Cancer Research on Prevention and Treatment 2021;48(5):479-483
Objective To establish a lung cancer risk prediction model using data mining technology and compare the performance of decision tree C5.0 and artificial neural networks in the application of risk prediction model, and to explore the value of data mining techniques in lung cancer risk prediction. Methods We collected the data of 180 patients with lung cancer and 240 patients with benign lung lesion which contained 17 variables of risk factors and clinical symptoms. Decision tree C5.0 and artificial neural networks models were established to compare the prediction performance. Results There were 420 valid samples collected in total and proportioned with the ratio of 7:3 for the training set and testing set. The accuracy, sensitivity, specificity, Youden index, positive predictive value, negative predictive value and AUC of artificial neural networks model were 65.3%, 61.7%, 73.3%, 0.350, 54.9%, 73.1% and 0.675 (95%