Evolutionary‐based association analysis using haplotype data

Association studies, both family-based and population-based, can be powerful means of detecting disease-liability alleles. To increase the information of the test, various researchers have proposed targeting haplotypes. The larger number of haplotypes, however, relative to alleles at individual loci, could decrease power because of the additional degrees of freedom required for the test. An optimal strategy would focus the test on particular haplotypes or groups of haplotypes, much as is done with cladistic-based association analysis. First suggested by Templeton et al. ([1987] Genetics 117:343-351), such analyses use the evolutionary relationships among haplotypes to produce a limited set of hypothesis tests and to increase the interpretability of these tests. To more fully utilize the information contained in the evolutionary relationships among haplotypes and in the sample, we propose generalized linear models (GLM) for the analysis of data from family-based and population-based studies. These models fully account for haplotype phase ambiguity and allow for covariates. The models are encoded into a software package (the Evolutionary-Based Haplotype Analysis Package, EHAP), which also provides for various kinds of exploratory data analysis. The exploratory analyses, such as error checking, estimation of haplotype frequencies, and tools for building cladograms, should facilitate the implementation of cladistic-based association analysis with haplotypes.

[1]  G. Abecasis,et al.  A general test of association for quantitative traits in nuclear families. , 2000, American journal of human genetics.

[2]  K. Roeder,et al.  A Semiparametric Mixture Approach to Case-Control Studies with Errors in Covariables , 1996 .

[3]  E J Hollox,et al.  Lactase haplotype diversity in the Old World. , 2001, American journal of human genetics.

[4]  K. Roeder,et al.  Transmission/disequilibrium test meets measured haplotype analysis: family-based association analysis guided by evolution of haplotypes. , 2001, American journal of human genetics.

[5]  M Farrall,et al.  Measured haplotype analysis of the angiotensin-I converting enzyme gene. , 1998, Human molecular genetics.

[6]  L Kruglyak,et al.  Parametric and nonparametric linkage analysis: a unified multipoint approach. , 1996, American journal of human genetics.

[7]  C. Sing,et al.  A cladistic analysis of phenotypic associations with haplotypes inferred from restriction endonuclease mapping. IV. Nested analyses with cladogram uncertainty and recombination. , 1993, Genetics.

[8]  D. Clayton,et al.  Transmission/disequilibrium tests for extended marker haplotypes. , 1999, American journal of human genetics.

[9]  N. Breslow,et al.  Statistical methods in cancer research: volume 1- The analysis of case-control studies , 1980 .

[10]  C. Sing,et al.  A cladistic analysis of phenotype associations with haplotypes inferred from restriction endonuclease mapping. II. The analysis of natural populations. , 1988, Genetics.

[11]  C. Sing,et al.  A cladistic analysis of phenotypic associations with haplotypes inferred from restriction endonuclease mapping and DNA sequence data. III. Cladogram estimation. , 1992, Genetics.

[12]  M. Slatkin,et al.  Estimating the age of alleles by use of intraallelic variability. , 1997, American journal of human genetics.

[13]  P. Sham Statistics in human genetics , 1997 .

[14]  H. Grüneberg,et al.  Introduction to quantitative genetics , 1960 .

[15]  E. Boerwinkle,et al.  A cladistic analysis of phenotypic associations with haplotypes inferred from restriction endonuclease mapping. I. Basic theory and an analysis of alcohol dehydrogenase activity in Drosophila. , 1987, Genetics.

[16]  D. Rubin,et al.  Statistical Analysis with Missing Data , 1988 .

[17]  W. Ewens,et al.  Transmission test for linkage disequilibrium: the insulin gene region and insulin-dependent diabetes mellitus (IDDM). , 1993, American journal of human genetics.

[18]  H. Bandelt,et al.  Median-joining networks for inferring intraspecific phylogenies. , 1999, Molecular biology and evolution.

[19]  M. Spence,et al.  Analysis of human genetic linkage , 1986 .

[20]  P. McCullagh,et al.  Generalized Linear Models , 1984 .

[21]  A. Templeton,et al.  A cladistic analysis of phenotypic associations with haplotypes inferred from restriction endonuclease mapping or DNA sequencing. V. Analysis of case/control sampling designs: Alzheimer's disease and the apoprotein E locus. , 1995, Genetics.

[22]  J. O’Connell Zero‐recombinant haplotyping: Applications to fine mapping using SNPs , 2000, Genetic epidemiology.

[23]  D. Schaid,et al.  Score tests for association between traits and haplotypes when linkage phase is ambiguous. , 2002, American journal of human genetics.

[24]  Siavash Ghaffari,et al.  A candidate prostate cancer susceptibility gene at chromosome 17p , 2001, Nature Genetics.

[25]  Zhaohui S. Qin,et al.  Bayesian haplotype inference for multiple linked single-nucleotide polymorphisms. , 2002, American journal of human genetics.

[26]  D. Clayton,et al.  A generalization of the transmission/disequilibrium test for uncertain-haplotype transmission. , 1999, American journal of human genetics.

[27]  M. Waterman,et al.  A dynamic programming algorithm for haplotype block partitioning , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[28]  D J Schaid,et al.  Evaluation of candidate genes in case-control studies: a statistical method to account for related subjects. , 2001, American journal of human genetics.

[29]  S. Gabriel,et al.  The Structure of Haplotype Blocks in the Human Genome , 2002, Science.

[30]  M. Plummer,et al.  International agency for research on cancer. , 2020, Archives of pathology.

[31]  C. Sing,et al.  Association between common alleles of the low-density lipoprotein receptor gene region and interindividual variation in plasma lipid and apolipoprotein levels in a population-based sample from Rochester, Minnesota , 1996, Human Genetics.

[32]  K. Crandall,et al.  Empirical tests of some predictions from coalescent theory with applications to intraspecific phylogeny reconstruction. , 1993, Genetics.

[33]  G. Lathrop,et al.  High-resolution genetic mapping of the ACE-linked QTL influencing circulating ACE activity , 2002, European Journal of Human Genetics.

[34]  E. Boerwinkle,et al.  Haplotype structure and population genetic inferences from nucleotide-sequence variation in human lipoprotein lipase. , 1998, American journal of human genetics.

[35]  K. Roeder,et al.  The power of genomic control. , 2000, American journal of human genetics.

[36]  J. Kingman On the genealogy of large populations , 1982, Journal of Applied Probability.