Association analysis for quantitative traits by data mining: QHPM

Previously, we have presented a data mining-based algorithmic approach to genetic association analysis, Haplotype Pattern Mining. We have now extended the approach with the possibility of analysing quantitative traits and utilising covariates. This is accomplished by using a linear model for measuring association. We present results with the extended version, QHPM, with simulated quantitative trait data. One data set was simulated with the population simulator package Populus, and another was obtained from GAW12. In the former, there were 2-3 underlying susceptibility genes for a trait, each with several ancestral disease mutations, and 1 or 2 environmental components. We show that QHPM is capable of finding the susceptibility loci, even when there is strong allelic heterogeneity and environmental effects in the disease models. The power of finding quantitative trait loci is dependent on the ascertainment scheme of the data: collecting the study subjects from both ends of the quantitative trait distribution is more effective than using unselected individuals or individuals ascertained based on disease status, but QHPM has good power to localize the genes even with unselected individuals. Comparison with quantitative trait TDT (QTDT) showed that QHPM has better localization accuracy when the gene effect is weak.

[1]  L Kruglyak,et al.  Parametric and nonparametric linkage analysis: a unified multipoint approach. , 1996, American journal of human genetics.

[2]  E. Lander,et al.  Mapping mendelian factors underlying quantitative traits using RFLP linkage maps. , 1989, Genetics.

[3]  J. Kere,et al.  Data mining applied to linkage disequilibrium mapping. , 2000, American journal of human genetics.

[4]  N. Risch,et al.  Mapping quantitative-trait loci in humans by use of extreme concordant sib pairs: selected sampling by parental phenotypes. , 1996, American journal of human genetics.

[5]  M. Daly,et al.  High-resolution haplotype structure in the human genome , 2001, Nature Genetics.

[6]  N. Risch Searching for genetic determinants in the new millennium , 2000, Nature.

[7]  L. Helmuth Genome research: map of the human genome 3.0. , 2001, Science.

[8]  L. Helmuth Map of the Human Genome 3.0 , 2001, Science.

[9]  S. Zhang,et al.  Quantitative similarity-based association tests using population samples. , 2001, American journal of human genetics.

[10]  L R Cardon,et al.  Extent and distribution of linkage disequilibrium in three genomic regions. , 2001, American journal of human genetics.

[11]  Vesa Ollikainen,et al.  Simulation Techniques for Disease Gene Localization in Isolated Populations , 2002 .

[12]  N Risch,et al.  Extreme discordant sib pairs for mapping quantitative trait loci in humans. , 1995, Science.

[13]  J. Kere,et al.  Mining Associations Between Genetic Markers, Phenotypes, and Covariates , 2001, Genetic epidemiology.

[14]  G. Abecasis,et al.  A general test of association for quantitative traits in nuclear families. , 2000, American journal of human genetics.

[15]  J K Hewitt,et al.  Combined linkage and association sib-pair analysis for quantitative traits. , 1999, American journal of human genetics.

[16]  N. Risch,et al.  Mapping quantitative trait loci with extreme discordant sib pairs: sampling considerations. , 1996, American journal of human genetics.

[17]  R C Elston,et al.  Transmission/disequilibrium tests for quantitative traits , 2001, Genetic epidemiology.

[18]  D Rabinowitz,et al.  A transmission disequilibrium test for quantitative trait loci. , 1997, Human heredity.

[19]  L R Cardon,et al.  The power to detect linkage disequilibrium with quantitative traits in selected samples. , 2001, American journal of human genetics.

[20]  W. Ewens,et al.  Transmission test for linkage disequilibrium: the insulin gene region and insulin-dependent diabetes mellitus (IDDM). , 1993, American journal of human genetics.

[21]  N. Risch,et al.  Affected‐sib‐pair interval mapping and exclusion for complex genetic traits: Sampling considerations , 1996, Genetic epidemiology.

[22]  Pardis C Sabeti,et al.  Linkage disequilibrium in the human genome , 2001, Nature.

[23]  P. Sham,et al.  Power of linkage versus association analysis of quantitative traits, by use of variance-components models, for sibship data. , 2000, American journal of human genetics.

[24]  T D Dyer,et al.  GAW12: Simulated Genome Scan, Sequence, and Family Data for a Common Disease , 2001, Genetic epidemiology.

[25]  B. Rannala,et al.  High-resolution multipoint linkage-disequilibrium mapping in the context of a human genome sequence. , 2001, American journal of human genetics.

[26]  D Curtis,et al.  Use of an artificial neural network to detect association between a disease and multiple marker genotypes , 2001, Annals of human genetics.