Detection for gene-gene co-association via kernel canonical correlation analysis

BackgroundCurrently, most methods for detecting gene-gene interaction (GGI) in genomewide association studies (GWASs) are limited in their use of single nucleotide polymorphism (SNP) as the unit of association. One way to address this drawback is to consider higher level units such as genes or regions in the analysis. Earlier we proposed a statistic based on canonical correlations (CCU) as a gene-based method for detecting gene-gene co-association. However, it can only capture linear relationship and not nonlinear correlation between genes. We therefore proposed a counterpart (KCCU) based on kernel canonical correlation analysis (KCCA).ResultsThrough simulation the KCCU statistic was shown to be a valid test and more powerful than CCU statistic with respect to sample size and interaction odds ratio. Analysis of data from regions involving three genes on rheumatoid arthritis (RA) from Genetic Analysis Workshop 16 (GAW16) indicated that only KCCU statistic was able to identify interactions reported earlier.ConclusionsKCCU statistic is a valid and powerful gene-based method for detecting gene-gene co-association.

[1]  Anbupalam Thalamuthu,et al.  TRAF1-C5 as a risk locus for rheumatoid arthritis--a genomewide study. , 2007, The New England journal of medicine.

[2]  D. Nicolae,et al.  Restricted parameter space models for testing gene‐gene interaction , 2009, Genetic epidemiology.

[3]  C. Cockerham,et al.  An Extension of the Concept of Partitioning Hereditary Variance for Analysis of Covariances among Relatives When Epistasis Is Present. , 1954, Genetics.

[4]  Yi Wang,et al.  Exploration of gene–gene interaction effects using entropy-based methods , 2008, European Journal of Human Genetics.

[5]  Andreas Ziegler,et al.  On safari to Random Jungle: a fast implementation of Random Forests for high-dimensional data , 2010, Bioinform..

[6]  M. Xiong,et al.  Test for interaction between two unlinked loci. , 2006, American journal of human genetics.

[7]  Franck Davoine,et al.  Face tracking using canonical correlation analysis , 2007, VISAPP.

[8]  R. Fisher XV.—The Correlation between Relatives on the Supposition of Mendelian Inheritance. , 1919, Transactions of the Royal Society of Edinburgh.

[9]  J. Booth,et al.  Resampling-Based Multiple Testing. , 1994 .

[10]  J. H. Moore,et al.  Multifactor-dimensionality reduction reveals high-order interactions among estrogen-metabolism genes in sporadic breast cancer. , 2001, American journal of human genetics.

[11]  Y. Benjamini,et al.  THE CONTROL OF THE FALSE DISCOVERY RATE IN MULTIPLE TESTING UNDER DEPENDENCY , 2001 .

[12]  Qianqian Peng,et al.  A gene-based method for detecting gene–gene co-association in a case–control association study , 2010, European Journal of Human Genetics.

[13]  Deanne M. Taylor,et al.  Powerful SNP-set analysis for case-control genome-wide association studies. , 2010, American journal of human genetics.

[14]  M. Xiong,et al.  Composite measure of linkage disequilibrium for testing interaction between unlinked loci , 2008, European Journal of Human Genetics.

[15]  H. Cordell Epistasis: what it means, what it doesn't mean, and statistical methods to detect it in humans. , 2002, Human molecular genetics.

[16]  O. Kempthorne,et al.  The correlation between relatives in a random mating population , 1954, Proceedings of the Royal Society of London. Series B - Biological Sciences.

[17]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[18]  G. Mendel,et al.  Mendel's Principles of Heredity , 1910, Nature.

[19]  Jason H. Moore,et al.  Tuning ReliefF for Genome-Wide Genetic Analysis , 2007, EvoBIO.

[20]  Jing Li,et al.  Generating samples for association studies based on HapMap data , 2008, BMC Bioinformatics.

[21]  Jun S. Liu,et al.  Bayesian inference of epistatic interactions in case-control studies , 2007, Nature Genetics.

[22]  Yoshihiro Yamanishi,et al.  Extraction of correlated gene clusters from multiple genomic data by generalized kernel canonical correlation analysis , 2003, ISMB.

[23]  Andreas Ziegler,et al.  On safari to Random Jungle: a fast implementation of Random Forests for high-dimensional data , 2010, Bioinform..

[24]  L. Penrose,et al.  THE CORRELATION BETWEEN RELATIVES ON THE SUPPOSITION OF MENDELIAN INHERITANCE , 2022 .

[25]  P. Phillips Epistasis — the essential role of gene interactions in the structure and evolution of genetic systems , 2008, Nature Reviews Genetics.

[26]  J. Golinval,et al.  Fault detection based on Kernel Principal Component Analysis , 2010 .

[27]  W. Zheng,et al.  Facial expression recognition using kernel canonical correlation analysis (KCCA) , 2006, IEEE Transactions on Neural Networks.

[28]  Dechang Chen,et al.  Gene Expression Data Classification With Kernel Principal Component Analysis , 2005, Journal of biomedicine & biotechnology.

[29]  Qiang Yang,et al.  BOOST: A fast approach to detecting gene-gene interactions in genome-wide case-control studies , 2010, American journal of human genetics.

[30]  Daoqiang Zhang,et al.  Adaptive Kernel Principal Component Analysis with Unsupervised Learning of Kernels , 2006, Sixth International Conference on Data Mining (ICDM'06).

[31]  Tian Zheng,et al.  Rheumatoid arthritis-associated gene-gene interaction network for rheumatoid arthritis candidate genes , 2009, BMC proceedings.

[32]  Yijun Zuo,et al.  An entropy-based approach for testing genetic epistasis underlying complex diseases. , 2008, Journal of theoretical biology.