论文信息 - Testing Untyped Alleles (TUNA)—applications to genome‐wide association studies

Testing Untyped Alleles (TUNA)—applications to genome‐wide association studies

The large number of tests performed in analyzing data from genome‐wide association studies has a large impact on the power of detecting risk variants, and analytic strategies specifying the optimal set of hypotheses to be tested are necessary. We propose a genome‐wide strategy that is based on one degree of freedom tests for all the genotyped variants, and for all the untyped variants for which there is sufficient information in the observed data. The set of untyped variants to be tested is found using multi‐locus measures of linkage disequilibrium and haplotype frequencies from a reference database such as HapMap (The International HapMap Consortium [2003] Nature 426:789–796). We introduce a novel statistic for testing differences in allele frequencies for untyped variation that is based on linear combinations of estimable haplotype frequencies. Algorithms for finding the sets of genotyped markers to be used in testing an untyped allele, and ways of incorporating haplotypes observed in the study data but not in the reference database are also described. The proposed testing strategy can be used as the first step in the analysis of genome‐wide association data, and, because every performed test is directed to a marker, it can be used to specify the set of polymorphisms to genotype in follow‐up studies. The described methodology provides also a tool for joint analysis of data from studies done on different platforms. Genet. Epidemiol. 2006.© 2006 Wiley‐Liss, Inc.

Dan L Nicolae | D. Nicolae

[1] H. Akaike. A Bayesian analysis of the minimum AIC procedure , 1978 .

[2] Nicole Soranzo,et al. A single-nucleotide polymorphism tagging set for human drug metabolism and transport , 2005, Nature Genetics.

[3] M. Stephens,et al. Accounting for Decay of Linkage Disequilibrium in Haplotype Inference and Missing-data Imputation , 2022 .

[4] Eric Boerwinkle,et al. Determinants of the success of whole-genome association testing. , 2005, Genome research.

[5] Xiaoquan Wen,et al. Coverage and Characteristics of the Affymetrix GeneChip Human Mapping 100K SNP Set , 2006, PLoS genetics.

[6] S. Gabriel,et al. Efficiency and power in genetic association studies , 2005, Nature Genetics.

[7] Daniel O Stram,et al. Tag SNP selection for association studies , 2004, Genetic epidemiology.

[8] Paul Scheet,et al. A fast and flexible statistical model for large-scale population genotype data: applications to inferring missing genotypes and haplotypic phase. , 2006, American journal of human genetics.

[9] J. Long,et al. An E-M algorithm and testing strategy for multiple-locus haplotypes. , 1995, American journal of human genetics.

[10] Nicholas W Wood,et al. Genome scans and candidate gene approaches in the study of common diseases and variable drug responses. , 2003, Trends in genetics : TIG.

[11] Juliet M Chapman,et al. Detecting Disease Associations due to Linkage Disequilibrium Using Haplotype Tags: A Class of Tests and the Determinants of Statistical Power , 2003, Human Heredity.