Score tests for association between traits and haplotypes when linkage phase is ambiguous.

A key step toward the discovery of a gene related to a trait is the finding of an association between the trait and one or more haplotypes. Haplotype analyses can also provide critical information regarding the function of a gene; however, when unrelated subjects are sampled, haplotypes are often ambiguous because of unknown linkage phase of the measured sites along a chromosome. A popular method of accounting for this ambiguity in case-control studies uses a likelihood that depends on haplotype frequencies, so that the haplotype frequencies can be compared between the cases and controls; however, this traditional method is limited to a binary trait (case vs. control), and it does not provide a method of testing the statistical significance of specific haplotypes. To address these limitations, we developed new methods of testing the statistical association between haplotypes and a wide variety of traits, including binary, ordinal, and quantitative traits. Our methods allow adjustment for nongenetic covariates, which may be critical when analyzing genetically complex traits. Furthermore, our methods provide several different global tests for association, as well as haplotype-specific tests, which give a meaningful advantage in attempts to understand the roles of many different haplotypes. The statistics can be computed rapidly, making it feasible to evaluate the associations between many haplotypes and a trait. To illustrate the use of our new methods, they are applied to a study of the association of haplotypes (composed of genes from the human-leukocyte-antigen complex) with humoral immune response to measles vaccination. Limited simulations are also presented to demonstrate the validity of our methods, as well as to provide guidelines on how our methods could be used.

[1]  N. Risch Searching for genetic determinants in the new millennium , 2000, Nature.

[2]  International Human Genome Sequencing Consortium Initial sequencing and analysis of the human genome , 2001, Nature.

[3]  Siavash Ghaffari,et al.  A candidate prostate cancer susceptibility gene at chromosome 17p , 2001, Nature Genetics.

[4]  T. Louis Finding the Observed Information Matrix When Using the EM Algorithm , 1982 .

[5]  M. Daly,et al.  A map of human genome sequence variation containing 1.42 million single nucleotide polymorphisms , 2001, Nature.

[6]  L. Lazzeroni,et al.  Linkage disequilibrium and gene mapping: an empirical least-squares approach. , 1998, American journal of human genetics.

[7]  Eric R. Ziegel,et al.  Generalized Linear Models , 2002, Technometrics.

[8]  S. Tishkoff,et al.  Molecular haplotyping of genetic markers 10 kb apart by allele-specific long-range PCR. , 1996, Nucleic acids research.

[9]  R S Judson,et al.  Complex promoter and coding region beta 2-adrenergic receptor haplotypes alter receptor expression and predict in vivo responsiveness. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[10]  P. Donnelly,et al.  A new statistical method for haplotype reconstruction from population data. , 2001, American journal of human genetics.

[11]  L. Excoffier,et al.  Maximum-likelihood estimation of molecular haplotype frequencies in a diploid population. , 1995, Molecular biology and evolution.

[12]  E J Hollox,et al.  Lactase haplotype diversity in the Old World. , 2001, American journal of human genetics.

[13]  D. Mccormick Sequence the Human Genome , 1986, Bio/Technology.

[14]  Yanfa Yan,et al.  Alloys: Atomic structure of the quasicrystal Al72Ni20Co8 , 2000, Nature.

[15]  P. Sasieni From genotypes to genes: doubling the sample size. , 1997, Biometrics.

[16]  M. Xiong,et al.  Fine-scale genetic mapping based on linkage disequilibrium: theory and applications. , 1997, American journal of human genetics.

[17]  S W Guo,et al.  Linkage disequilibrium measures for fine-scale mapping: a comparison. , 1997, Human heredity.

[18]  Jeffrey C. Hall,et al.  Advances in Genetics , 1947 .

[19]  James R. Eshleman,et al.  Conversion of diploidy to haploidy , 2000, Nature.

[20]  E. Boerwinkle,et al.  Haplotype structure and population genetic inferences from nucleotide-sequence variation in human lipoprotein lipase. , 1998, American journal of human genetics.

[21]  K. Kidd,et al.  HAPLO: a program using the EM algorithm to estimate the frequencies of multi-site haplotypes. , 1995, The Journal of heredity.

[22]  R. Vierkant,et al.  Identification of an association between HLA class II alleles and low antibody levels after measles immunization. , 2001, Vaccine.

[23]  L. Cardon,et al.  Association study designs for complex diseases , 2001, Nature Reviews Genetics.

[24]  Eric Lander,et al.  Linkage disequilibrium mapping in isolated founder populations: diastrophic dysplasia in Finland , 1992, Nature Genetics.

[25]  J. Terwilliger A powerful likelihood method for the analysis of linkage disequilibrium between trait loci and one or more polymorphic marker loci. , 1995, American journal of human genetics.

[26]  B S Weir,et al.  Likelihood methods for locating disease genes in nonequilibrium populations. , 1995, American journal of human genetics.

[27]  B Rannala,et al.  Likelihood analysis of disequilibrium mapping, and related problems. , 1998, American journal of human genetics.

[28]  F. Collins,et al.  Shattuck lecture--medical and societal consequences of the Human Genome Project. , 1999, The New England journal of medicine.

[29]  J. Long,et al.  An E-M algorithm and testing strategy for multiple-locus haplotypes. , 1995, American journal of human genetics.

[30]  K K Kidd,et al.  The accuracy of statistical methods for estimation of haplotype frequencies: an example from the CD4 locus. , 2000, American journal of human genetics.

[31]  D. Clayton,et al.  Transmission/disequilibrium tests for extended marker haplotypes. , 1999, American journal of human genetics.

[32]  Laurent Excoffier,et al.  Testing for linkage disequilibrium in genotypic data using the Expectation-Maximization algorithm , 1996, Heredity.

[33]  K. Roeder,et al.  Disequilibrium mapping: composite likelihood for pairwise disequilibrium. , 1996, Genomics.

[34]  J. Stephens,et al.  Haplotype Variation and Linkage Disequilibrium in 313 Human Genes , 2001, Science.

[35]  N. Schork,et al.  The future of genetic case-control studies. , 2001, Advances in genetics.

[36]  N. Schork,et al.  Genetic analysis of case/control data using estimated haplotype frequencies: application to APOE locus variation and Alzheimer's disease. , 2001, Genome research.

[37]  K K Kidd,et al.  Comparisons of two methods for haplotype reconstruction and haplotype frequency estimation from population data. , 2001, American journal of human genetics.

[38]  P. McKeigue Efficiency of estimation of haplotype frequencies: use of marker phenotypes of unrelated individuals versus counting of phase-known gametes. , 2000, American journal of human genetics.

[39]  A. Clark,et al.  Inference of haplotypes from PCR-amplified samples of diploid populations. , 1990, Molecular biology and evolution.

[40]  N. Schork,et al.  Accuracy of haplotype frequency estimation for biallelic loci, via the expectation-maximization algorithm for unphased diploid genotype data. , 2000, American journal of human genetics.

[41]  K. Roeder,et al.  Genomic Control for Association Studies , 1999, Biometrics.

[42]  Jurg Ott,et al.  Handbook of Human Genetic Linkage , 1994 .

[43]  J. V. Moran,et al.  Initial sequencing and analysis of the human genome. , 2001, Nature.