Gene-based principal component logistic regression model and its application on genome-wide association study
10.3760/cma.j.issn.0254-6450.2012.06.018
- VernacularTitle:基于基因水平主成分logistic回归模型在全基因组关联研究中的应用
- Author:
Hong-Gang YI
1
;
Hong-Mei WO
;
Yang ZHAO
;
Ru-Yang ZHANG
;
Jian-Ling BAI
;
Yong-Yue WEI
;
Feng CHEN
Author Information
1. 南京医科大学
- Keywords:
Principal component analysis;
Logistic regression;
Genome-wide association study;
Association
- From:
Chinese Journal of Epidemiology
2012;33(6):622-625
- CountryChina
- Language:Chinese
-
Abstract:
To explore the gene-based principal component logistic regression model and its application in genome-wide association study.Using the simulated genome-wide single nucleotide polymorphism (SNPs) genotypes data,we proposed a practical statistical analysis strategy-'the principal component logistic regression model',based on the gene levels to assess the association between genetic variations and complex diseases.The simulation results showed that the P value of genes in related diseases was the smallest among the results from all the genes.The results of simulation indicated that not only it could reduce the degree of freedom through hypothesis testing but could also better understand the correlations between SNPs.The gene-based principal component logistic regression model seemed to have certain statistical power for testing the association between genetic genes and diseases in the genome-wide association studies.