1.How Many SNPs Should Be Used for the Human Phylogeny of Highly Related Ethnicities? A Case of Pan Asian 63 Ethnicities.
Hoyoung GHANG ; Youngjoo HAN ; Sangjin JEONG ; Jong BHAK ; Sunghoon LEE ; Tae Hyung KIM ; Chulhong KIM ; Sangsoo KIM ; Fahd AL-MULLA ; Chan Hyun YOUN ; Hyang Sook YOO
Genomics & Informatics 2011;9(4):181-188
In planning a model-based phylogenic study for highly related ethnic data, the SNP marker number is an important factor to determine for relationship inferences. Genotype frequency data, utilizing a sub sampling method, from 63 Pan Asian ethnic groups was used for determining the minimum SNP number required to establish such relationships. Bootstrap random sub-samplings were done from 5.6K PASNPi SNP data. DA distance was calculated and neighbour-joining trees were drawn with every re-sampling data set. Consensus trees were made with the same 100 sub-samples and bootstrap proportions were calculated. The tree consistency to the one obtained from the whole marker set, improved with increasing marker numbers. The bootstrap proportions became reliable when more than 7,000 SNPs were used at a time. Within highly related ethnic groups, the minimum SNPs number for a robust neighbor-joining tree inference was about 7,000 for a 95% bootstrap support.
Asian Continental Ancestry Group
;
Consensus
;
Ethnic Groups
;
Genotype
;
Humans
;
Phylogeny
;
Polymorphism, Single Nucleotide