Haplotype interaction analysis of unlinked regions

Genetically complex diseases are caused by interacting environmental factors and genes. As a consequence, statistical methods that consider multiple unlinked genomic regions simultaneously are desirable. Such consideration, however, may lead to a vast number of different high‐dimensional tests whose appropriate analysis pose a problem. Here, we present a method to analyze case‐control studies with multiple SNP data without phase information that considers gene‐gene interaction effects while correcting appropriately for multiple testing. In particular, we allow for interactions of haplotypes that belong to different unlinked regions, as haplotype analysis often proves to be more powerful than single marker analysis. In addition, we consider different marker combinations at each unlinked region. The multiple testing issue is settled via the minP approach; the P value of the “best” marker/region configuration is corrected via Monte‐Carlo simulations. Thus, we do not explicitly test for a specific pre‐defined interaction model, but test for the global hypothesis that none of the considered haplotype interactions shows association with the disease. We carry out a simulation study for case‐control data that confirms the validity of our approach. When simulating two‐locus disease models, our test proves to be more powerful than association methods that analyze each linked region separately. In addition, when one of the tested regions is not involved in the etiology of the disease, only a small amount of power is lost with interaction analysis as compared to analysis without interaction. We successfully applied our method to a real case‐control data set with markers from two genes controlling a common pathway. While classical analysis failed to reach significance, we obtained a significant result even after correction for multiple testing with our proposed haplotype interaction analysis. The method described here has been implemented in FAMHAP. Genet. Epidemiol. 2005. © 2005 Wiley‐Liss, Inc.

[1]  S. Cichon,et al.  Examination of G72 and D-amino-acid oxidase as genetic risk factors for schizophrenia and bipolar affective disorder , 2004, Molecular Psychiatry.

[2]  J. A. Dunston,et al.  Phenotype Severity and Genetic Variation at the Disease Locus: An Investigation of Nail Dysplasia in the Nail Patella Syndrome , 2005, Annals of human genetics.

[3]  Michael Knapp,et al.  Maximum‐likelihood estimation of haplotype frequencies in nuclear families , 2004, Genetic epidemiology.

[4]  S A Seuchter,et al.  Two-locus disease models with two marker loci: the power of affected-sib-pair tests. , 1994, American journal of human genetics.

[5]  M. G. Reese,et al.  Improved use of SNP information to detect the role of genes , 2003, Genetic epidemiology.

[6]  B. Manly Randomization, Bootstrap and Monte Carlo Methods in Biology , 2018 .

[7]  P. Sham,et al.  Genetic and physiological data implicating the new human gene G72 and the gene for d-amino acid oxidase in schizophrenia , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[8]  S. Dudoit,et al.  Resampling-based multiple testing for microarray data analysis , 2003 .

[9]  C Charles Gu,et al.  Genetic association mapping under founder heterogeneity via weighted haplotype similarity analysis in candidate genes , 2004, Genetic epidemiology.

[10]  Heather J Cordell,et al.  Case/pseudocontrol analysis in genetic association studies: A unified framework for detection of genotype and haplotype associations, gene‐gene and gene‐environment interactions, and parent‐of‐origin effects , 2004, Genetic epidemiology.

[11]  Michael Knapp,et al.  A powerful strategy to account for multiple testing in the context of haplotype analysis. , 2004, American journal of human genetics.

[12]  P. Donnelly,et al.  Genome-wide strategies for detecting multiple loci that influence complex diseases , 2005, Nature Genetics.

[13]  L. Wasserman,et al.  Analysis of multilocus models of association , 2003, Genetic epidemiology.

[14]  M Knapp,et al.  Multiple Testing in the Context of Haplotype Analysis Revisited: Application to Case‐Control Data , 2005, Annals of human genetics.

[15]  S. S. Young,et al.  Resampling-Based Multiple Testing: Examples and Methods for p-Value Adjustment , 1993 .

[16]  J. Ott,et al.  Trimming, weighting, and grouping SNPs in human case-control association studies. , 2001, Genome research.

[17]  Daniel J Schaid,et al.  Nonparametric tests of association of multiple genes with human disease. , 2005, American journal of human genetics.