Multivariate Phenotype Association Analysis by Marker‐Set Kernel Machine Regression

Genetic studies of complex diseases often collect multiple phenotypes relevant to the disorders. As these phenotypes can be correlated and share common genetic mechanisms, jointly analyzing these traits may bring more power to detect genes influencing individual or multiple phenotypes. Given the advancement brought by the multivariate phenotype approaches and the multimarker kernel machine regression, we construct a multivariate regression based on kernel machine to facilitate the joint evaluation of multimarker effects on multiple phenotypes. The kernel machine serves as a powerful dimension‐reduction tool to capture complex effects among markers. The multivariate framework incorporates the potentially correlated multidimensional phenotypic information and accommodates common or different environmental covariates for each trait. We derive the multivariate kernel machine test based on a score‐like statistic, and conduct simulations to evaluate the validity and efficacy of the method. We also study the performance of the commonly adapted strategies for kernel machine analysis on multiple phenotypes, including the multiple univariate kernel machine tests with original phenotypes or with their principal components. Our results suggest that none of these approaches has the uniformly best power, and the optimal test depends on the magnitude of the phenotype correlation and the effect patterns. However, the multivariate test retains to be a reasonable approach when the multiple phenotypes have none or mild correlations, and gives the best power once the correlation becomes stronger or when there exist genes that affect more than one phenotype. We illustrate the utility of the multivariate kernel machine method through the Clinical Antipsychotic Trails of Intervention Effectiveness antibody study.

[1]  Jianxin Shi,et al.  Common variants on chromosome 6p22.1 are associated with schizophrenia , 2009, Nature.

[2]  Pall I. Olason,et al.  Common variants conferring risk of schizophrenia , 2009, Nature.

[3]  Pierre Lafaye de Micheaux,et al.  Computing the distribution of quadratic forms: Further comparisons between the Liu-Tang-Zhang approximation and exact methods , 2010, Comput. Stat. Data Anal..

[4]  D. Harville Maximum Likelihood Approaches to Variance Component Estimation and to Related Problems , 1977 .

[5]  Jung-Ying Tzeng,et al.  A comprehensive approach to haplotype-specific analysis by penalized likelihood , 2010, European Journal of Human Genetics.

[6]  Generalized estimating equations: A hybrid approach for mean parameters in multivariate regression models , 2002 .

[7]  Jung-Ying Tzeng,et al.  Studying gene and gene-environment effects of uncommon and common variants on continuous traits: a marker-set approach using gene-trait similarity regression. , 2011, American journal of human genetics.

[8]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[9]  J. Lieberman,et al.  Genomewide association for schizophrenia in the CATIE study: results of stage 1 , 2008, Molecular Psychiatry.

[10]  Xihong Lin,et al.  Powerful Tests for Detecting a Gene Effect in the Presence of Possible Gene–Gene Interactions Using Garrote Kernel Machines , 2011, Biometrics.

[11]  Kathryn Roeder,et al.  Pleiotropy and principal components of heritability combine to increase power for association analysis , 2008, Genetic epidemiology.

[12]  P. Visscher,et al.  Common polygenic variation contributes to risk of schizophrenia and bipolar disorder , 2009, Nature.

[13]  Jung-Ying Tzeng,et al.  Haplotype-based association analysis via variance-components score test. , 2007, American journal of human genetics.

[14]  Heping Zhang,et al.  An Association Test for Multiple Traits Based on the Generalized Kendall’s Tau , 2010, Journal of the American Statistical Association.

[15]  C. Gieger,et al.  Human metabolic individuality in biomedical and pharmaceutical research , 2011, Nature.

[16]  Deanne M. Taylor,et al.  Powerful SNP-set analysis for case-control genome-wide association studies. , 2010, American journal of human genetics.

[17]  Matthew A. Zapala,et al.  Multivariate regression analysis of distance matrices for testing associations between gene expression patterns and related variables , 2006, Proceedings of the National Academy of Sciences.

[18]  J. Lieberman,et al.  Effectiveness of antipsychotic drugs in patients with chronic schizophrenia. , 2005, The New England journal of medicine.

[19]  Claudio J. Verzilli,et al.  Bayesian modelling of multivariate quantitative traits using seemingly unrelated regressions , 2005, Genetic epidemiology.

[20]  H. Deng,et al.  Bivariate association analyses for the mixture of continuous and binary traits with the use of extended generalized estimating equations , 2009, Genetic epidemiology.

[21]  Xihong Lin,et al.  A powerful and flexible multilocus association test for quantitative traits. , 2008, American journal of human genetics.

[22]  Xihong Lin,et al.  Rare-variant association testing for sequencing data with the sequence kernel association test. , 2011, American journal of human genetics.

[23]  J. Lieberman,et al.  Serological evidence of exposure to Herpes Simplex Virus type 1 is associated with cognitive deficits in the CATIE schizophrenia sample , 2011, Schizophrenia Research.

[24]  I. Gottesman,et al.  The endophenotype concept in psychiatry: etymology and strategic intentions. , 2003, The American journal of psychiatry.

[25]  Heping Zhang,et al.  Why Do We Test Multiple Traits in Genetic Association Studies? , 2009, Journal of the Korean Statistical Society.