1.Genome-wide association study based risk prediction model in predicting lung cancer risk in Chinese.
Meng ZHU ; Yang CHENG ; Juncheng DAI ; Lan XIE ; Guangfu JIN ; Hongxia MA ; Zhibin HU ; Yongyong SHI ; Dongxin LIN ; Hongbing SHEN ; Email: HBSHEN@NJMU.EDU.CN.
Chinese Journal of Epidemiology 2015;36(10):1047-1052
OBJECTIVETo evaluate the predictive power of risk model by combining traditional epidemiological factors and genetic factors.
METHODSOur previous GWAS data of lung cancer in Chinese were used in training set (Nanjing and Shanghai: 1473 cases vs. 1962 control) and testing set (Beijing and Wuhan: 858 cases vs. 1 115 control). All the single nucleotide polymorphisms (SNPs) associated with lung cancer risk were systematically selected and stepwise logistic regression analysis was used to select independent factors in the training set. The wGRS (weighted genetic score) was further used to calculate genetic risk score. To evaluate the contribution of the genetic factors, 3 risk models were established by using the training set, i.e. smoking model (based on smoking status) , genetic risk model (based on genetic risk score) and combined model (based on smoke and genetic risk score). The predictability of the models were evaluated by the areas under the receiver operating characteristic (ROC) curves, area under curve (AUC), net reclassification improvement (NRI) and integrated discrimination index (IDI). Besides, the results were further verified in the testing set.
RESULTSIn the training set, it was found that the AUC of the smoking, genetic risk and combined models were 0.65 (0.63-0.66), 0.60 (0.59-0.62) and 0.69 (0.67-0.71), respectively. Compared with combined model, the predictive power of other two models significantly declined, the difference was statistically significant (P<0.001). Furthermore, compared with the smoking model, the NRI of the combined model increased by 4.57% (2.23%-6.91%) and IDI increased by 3.11% (2.52%-3.69%) in the training set, the difference was statistically significant (P<0.001). Similarly, in the testing set NRI increased by 2.77%, the difference was not statistically significant (P=0.069) , and IDI increased by 3.16%, the difference was statistically significant (P<0.001).
CONCLUSIONThis study showed that combining 14 genetic variants with traditional epidemiological factors could improve the predictive power of risk model for lung cancer. The model could be used in the screening of high-risk population of lung cancer in Chinese and provide evidence for the early diagnosis and treatment of lung cancer.
Area Under Curve ; Asian Continental Ancestry Group ; Beijing ; Case-Control Studies ; China ; Genetic Predisposition to Disease ; Genetic Variation ; Genome-Wide Association Study ; Humans ; Lung Neoplasms ; epidemiology ; genetics ; Polymorphism, Single Nucleotide ; ROC Curve ; Risk Factors