On the use of haplotype phylogeny to detect disease susceptibility loci

BackgroundThe cladistic approach proposed by Templeton has been presented as promising for the study of the genetic factors involved in common diseases. This approach allows the joint study of multiple markers within a gene by considering haplotypes and grouping them in nested clades. The idea is to search for clades with an excess of cases as compared to the whole sample and to identify the mutations defining these clades as potential candidate disease susceptibility sites. However, the performance of this approach for the study of the genetic factors involved in complex diseases has never been studied.ResultsIn this paper, we propose a new method to perform such a cladistic analysis and we estimate its power through simulations. We show that under models where the susceptibility to the disease is caused by a single genetic variant, the cladistic test is neither really more powerful to detect an association nor really more efficient to localize the susceptibility site than an individual SNP testing. However, when two interacting sites are responsible for the disease, the cladistic analysis greatly improves the probability to find the two susceptibility sites. The impact of the linkage disequilibrium and of the tree characteristics on the efficiency of the cladistic analysis are also discussed. An application on a real data set concerning the CARD15 gene and Crohn disease shows that the method can successfully identify the three variant sites that are involved in the disease susceptibility.ConclusionThe use of phylogenies to group haplotypes is especially interesting to pinpoint the sites that are likely to be involved in disease susceptibility among the different markers identified within a gene.

[1]  L. Tiret,et al.  Testing for association between disease and linked marker loci: a log-linear-model analysis. , 1991, American journal of human genetics.

[2]  M. Daly,et al.  CARD15 genetic variation in a Quebec population: prevalence, genotype-phenotype relationship, and haplotype structure. , 2002, American journal of human genetics.

[3]  Peter H. Westfall,et al.  Testing Association of Statistically Inferred Haplotypes with Discrete and Continuous Traits in Samples of Unrelated Individuals , 2002, Human Heredity.

[4]  E. Boerwinkle,et al.  A cladistic analysis of phenotypic associations with haplotypes inferred from restriction endonuclease mapping. I. Basic theory and an analysis of alcohol dehydrogenase activity in Drosophila. , 1987, Genetics.

[5]  K. Roeder,et al.  Evolutionary‐based association analysis using haplotype data , 2003 .

[6]  J. Hein,et al.  Consequences of recombination on traditional phylogenetic analysis. , 2000, Genetics.

[7]  A. Templeton,et al.  A cladistic analysis of phenotypic associations with haplotypes inferred from restriction endonuclease mapping or DNA sequencing. V. Analysis of case/control sampling designs: Alzheimer's disease and the apoprotein E locus. , 1995, Genetics.

[8]  M. Xiong,et al.  Haplotypes vs single marker linkage disequilibrium tests: what do we gain? , 2001, European Journal of Human Genetics.

[9]  M Farrall,et al.  Measured haplotype analysis of the angiotensin-I converting enzyme gene. , 1998, Human molecular genetics.

[10]  M. Boehnke,et al.  Experimentally-derived haplotypes substantially increase the efficiency of linkage disequilibrium studies , 2001, Nature Genetics.

[11]  N. Schork,et al.  Single nucleotide polymorphisms and the future of genetic epidemiology , 2000, Clinical genetics.

[12]  D. Ord,et al.  PAUP:Phylogenetic analysis using parsi-mony , 1993 .

[13]  D. Zaykin,et al.  Effect of Two- and Three-Locus Linkage Disequilibrium on the Power to Detect Marker/Phenotype Associations , 2004, Genetics.

[14]  R. Todd,et al.  Association analysis in an evolutionary context: cladistic analysis of the DRD2 locus to test for association with alcoholism. , 1998, American journal of medical genetics.

[15]  C. Sing,et al.  A cladistic analysis of phenotypic associations with haplotypes inferred from restriction endonuclease mapping and DNA sequence data. III. Cladogram estimation. , 1992, Genetics.

[16]  Cladistic analysis: its applications in association studies of complex diseases. , 2000, Annals of the Academy of Medicine, Singapore.

[17]  J. Long,et al.  Cladistic association analysis of Y chromosome effects on alcohol dependence and related personality traits. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[18]  N. Schork,et al.  Genetic analysis of case/control data using estimated haplotype frequencies: application to APOE locus variation and Alzheimer's disease. , 2001, Genome research.

[19]  T. Niu Algorithms for inferring haplotypes , 2004, Genetic epidemiology.

[20]  P. Sham,et al.  Model-Free Analysis and Permutation Tests for Allelic Associations , 1999, Human Heredity.

[21]  Michael Knapp,et al.  A powerful strategy to account for multiple testing in the context of haplotype analysis. , 2004, American journal of human genetics.

[22]  R. Todd,et al.  Cladistic analysis of disease association with tyrosine hydroxylase: application to manic-depressive disease and alcoholism. , 1997, American journal of medical genetics.

[23]  Gonçalo R. Abecasis,et al.  GOLD-Graphical Overview of Linkage Disequilibrium , 2000, Bioinform..

[24]  J. Haines,et al.  Effects of Age, Sex, and Ethnicity on the Association Between Apolipoprotein E Genotype and Alzheimer Disease: A Meta-analysis , 1997 .

[25]  Mourad Sahbatou,et al.  Association of NOD2 leucine-rich repeat variants with susceptibility to Crohn's disease , 2001, Nature.

[26]  L. Brooks,et al.  A DNA polymorphism discovery resource for research on human genetic variation. , 1998, Genome research.

[27]  C. Sing,et al.  A cladistic analysis of phenotypic associations with haplotypes inferred from restriction endonuclease mapping. IV. Nested analyses with cladogram uncertainty and recombination. , 1993, Genetics.

[28]  B. Rannala,et al.  High-resolution multipoint linkage-disequilibrium mapping in the context of a human genome sequence. , 2001, American journal of human genetics.

[29]  C. Sing,et al.  A cladistic analysis of phenotype associations with haplotypes inferred from restriction endonuclease mapping. II. The analysis of natural populations. , 1988, Genetics.

[30]  R. Hamman,et al.  Cladistic Analysis of Human Apolipoprotein A4 Polymorphisms in Relation to Quantitative Plasma Lipid Risk Factors of Coronary Heart Disease , 2003, Annals of human genetics.

[31]  Hongyu Zhao,et al.  Haplotype analysis in population genetics and association studies. , 2003, Pharmacogenomics.

[32]  S. Gabriel,et al.  The Structure of Haplotype Blocks in the Human Genome , 2002, Science.

[33]  R. Elston,et al.  Localization of the Q1 Mutation by Cladistic Analysis , 2001, Genetic epidemiology.

[34]  Cladistic Analysis of Haplotypes as an Attempt to Detect Disease Susceptibility , 2001, Genetic epidemiology.

[35]  E. Génin,et al.  Search for multifactorial disease susceptibility genes in founder populations , 2000, Annals of human genetics.

[36]  J. Haines,et al.  Effects of age, sex, and ethnicity on the association between apolipoprotein E genotype and Alzheimer disease. A meta-analysis. APOE and Alzheimer Disease Meta Analysis Consortium. , 1997, JAMA.

[37]  F. Collins Preparing health professionals for the genetic revolution. , 1997, JAMA.

[38]  C. Sing,et al.  Cladistic analysis of the apolipoprotein AI‐CIH‐AIV gene cluster using a healthy French Canadian sample. I. Haploid analysis , 1995, Annals of human genetics.

[39]  D. Swofford PAUP*: Phylogenetic analysis using parsimony (*and other methods), Version 4.0b10 , 2002 .

[40]  K. Roeder,et al.  Transmission/disequilibrium test meets measured haplotype analysis: family-based association analysis guided by evolution of haplotypes. , 2001, American journal of human genetics.

[41]  Francis S. Collins,et al.  Erratum: A DNA polymorphism discovery resource for research on human genetic variation (Genome Research (1998) 8 (1229-1231)) , 1999 .

[42]  L. Lazzeroni Linkage disequilibrium and gene mapping: an empirical least-squares approach. , 1998, American journal of human genetics.

[43]  Andrew P Morris,et al.  Linkage disequilibrium mapping via cladistic analysis of single-nucleotide polymorphism haplotypes. , 2004, American journal of human genetics.

[44]  M. McPeek,et al.  Assessment of linkage disequilibrium by the decay of haplotype sharing, with application to fine-scale genetic mapping. , 1999, American journal of human genetics.