Comparing Logic Regression Based Methods for Identifying SNP Interactions

In single-nucleotide polymorphism (SNP) association studies interactions are often of main interest. Logic regression is a regression methodology that can identify complex Boolean interactions of binary variables. It has been applied successfully to SNP data but only identifies a single best model, while usually there is a number of models that are almost as good. Extensions of logic regression that consider several plausible models are Monte Carlo logic regression (MCLR) and a full Bayesian version of logic regression (FBLR) proposed in this paper. FBLR allows the incorporation of biological knowledge such as known pathways. We compare the performance in identifying SNP interactions associated with the case-control status of the three logic regression based methods and stepwise logistic regression in a simulation study and in a study of breast cancer.

[1]  M. De Iorio,et al.  Bayesian logistic regression using a perfect phylogeny. , 2007, Biostatistics.

[2]  Thomas Brüning,et al.  ERCC2 genotypes and a corresponding haplotype are linked with breast cancer risk in a German population. , 2004, Cancer epidemiology, biomarkers & prevention : a publication of the American Association for Cancer Research, cosponsored by the American Society of Preventive Oncology.

[3]  K. Golka,et al.  The enhanced bladder cancer susceptibility of NAT2 slow acetylators towards aromatic amines: a review considering ethnic differences. , 2002, Toxicology letters.

[4]  R. H. J. M. Otten,et al.  The Annealing Algorithm , 1989 .

[5]  S. Chanock,et al.  SNPs in cancer research and treatment , 2004, British Journal of Cancer.

[6]  Christopher C. Holmes,et al.  Classification with Bayesian MARS , 2004, Machine Learning.

[7]  Ingo Ruczinski,et al.  Identifying interacting SNPs using Monte Carlo logic regression , 2005, Genetic epidemiology.

[8]  C Kooperberg,et al.  Sequence Analysis Using Logic Regression , 2001, Genetic epidemiology.

[9]  Katja Ickstadt,et al.  Analyzing SNPs: Are There Needles in the Haystack? , 2006 .

[10]  P. Green Reversible jump Markov chain Monte Carlo computation and Bayesian model determination , 1995 .