Analysis of multiple related phenotypes in genome-wide association studies

Most genome-wide association studies (GWAS) have been conducted by focusing on one phenotype of interest for identifying genetic variants associated with common complex phenotypes. However, despite many successful results from GWAS, only a small number of genetic variants tend to be identified and replicated given a very stringent genome-wide significance criterion, and explain only a small fraction of phenotype heritability. In order to improve power by using more information from data, we propose an alternative multivariate approach, which considers multiple related phenotypes simultaneously. We demonstrate through computer simulation that the multivariate approach can improve power for detecting disease-predisposing genetic variants and pleiotropic variants that have simultaneous effects on multiple related phenotypes. We apply the multivariate approach to a GWA dataset of 8,842 Korean individuals genotyped for 327,872 SNPs, and detect novel genetic variants associated with metabolic syndrome related phenotypes. Considering several related phenotype simultaneously, the multivariate approach provides not only more powerful results than the conventional univariate approach but also clue to identify pleiotropic genes that are important to the pathogenesis of many related complex phenotypes.

[1]  C. Barnes,et al.  Genome-Wide Screen for Metabolic Syndrome Susceptibility Loci Reveals Strong Lipid Gene Contribution But No Evidence for Common Genetic Basis for Clustering of Metabolic Syndrome Traits , 2012, Circulation. Cardiovascular genetics.

[2]  Wei Lu,et al.  Meta-analysis of genome-wide association studies identifies eight new loci for type 2 diabetes in east Asians , 2011, Nature Genetics.

[3]  Tien Yin Wong,et al.  Meta-analysis of genome-wide association studies identifies common variants associated with blood pressure variation in east Asians , 2011, Nature Genetics.

[4]  Fernando Costa,et al.  Diagnosis and management of the metabolic syndrome: an American Heart Association/National Heart, Lung, and Blood Institute Scientific Statement. , 2005, Circulation.

[5]  E. Demerath,et al.  Meta-analysis of genome-wide association data identifies two loci influencing age at menarche , 2009, Nature Genetics.

[6]  Y. J. Kim,et al.  Genome-Wide Association Study Identifies Two Novel Loci with Sex-Specific Effects for Type 2 Diabetes Mellitus and Glycemic Traits in a Korean Population , 2014, Diabetes & metabolism journal.

[7]  R. Collins,et al.  Newly identified loci that influence lipid concentrations and risk of coronary artery disease , 2008, Nature Genetics.

[8]  K. Frazer,et al.  Common vs. rare allele hypotheses for complex diseases. , 2009, Current opinion in genetics & development.

[9]  R. Hegele,et al.  Genetic determinants of the metabolic syndrome , 2006, Nature Clinical Practice Cardiovascular Medicine.

[10]  R. Krauss,et al.  Diagnosis and Management of the Metabolic Syndrome: An American Heart Association/National Heart, Lung, and Blood Institute Scientific Statement , 2005, Current opinion in cardiology.

[11]  Thomas Meitinger,et al.  Meta-analysis identifies 13 new loci associated with waist-hip ratio and reveals sexual dimorphism in the genetic basis of fat distribution , 2010, Nature Genetics.

[12]  Ikuo Saito,et al.  Metabolic syndrome: recent prevalence in East and Southeast Asian populations. , 2007, Asia Pacific journal of clinical nutrition.

[13]  Simon C. Potter,et al.  Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls , 2007, Nature.

[14]  J. Ioannidis,et al.  Meta-analysis methods for genome-wide association studies and beyond , 2013, Nature Reviews Genetics.

[15]  M. McCarthy,et al.  Genome-wide association studies for complex traits: consensus, uncertainty and challenges , 2008, Nature Reviews Genetics.

[16]  Wei Pan,et al.  Penalized regression and risk prediction in genome‐wide association studies , 2013, Stat. Anal. Data Min..

[17]  C. Gieger,et al.  Human metabolic individuality in biomedical and pharmaceutical research , 2011, Nature.

[18]  K. Hong,et al.  The dietary monounsaturated to saturated fatty acid ratio modulates the genetic effects of GCKR on serum lipid levels in children. , 2015, Clinica chimica acta; international journal of clinical chemistry.

[19]  Dolores Corella,et al.  Six new loci associated with blood low-density lipoprotein cholesterol, high-density lipoprotein cholesterol or triglycerides in humans , 2008, Nature Genetics.

[20]  M. Daly,et al.  Genome-wide association studies for common diseases and complex traits , 2005, Nature Reviews Genetics.

[21]  Taesung Park,et al.  Large-scale genome-wide association studies in east Asians identify new genetic loci influencing metabolic traits , 2011, Nature Genetics.

[22]  Chong Shen,et al.  Genome-wide association study in Han Chinese identifies four new susceptibility loci for coronary artery disease , 2012, Nature Genetics.

[23]  Marcia M. Nizzari,et al.  Genome-Wide Association Analysis Identifies Loci for Type 2 Diabetes and Triglyceride Levels , 2007, Science.

[24]  Yan Guo,et al.  Powerful Bivariate Genome-Wide Association Analyses Suggest the SOX6 Gene Influencing Both Obesity and Osteoporosis Phenotypes in Males , 2009, PloS one.

[25]  T. Hudson,et al.  A genome-wide association study identifies novel risk loci for type 2 diabetes , 2007, Nature.

[26]  Hui Shen,et al.  Bivariate genome-wide association study suggests fatty acid desaturase genes and cadherin DCHS2 for variation of both compressive strength index and appendicular lean mass in males. , 2012, Bone.

[27]  Erik Ingelsson,et al.  Genome-wide association studies of obesity and metabolic syndrome , 2014, Molecular and Cellular Endocrinology.

[28]  Xiaofeng Zhu,et al.  ARTICLE Genome-wide Characterization of Shared and Distinct Genetic Components that Influence Blood Lipid Levels in Ethnically Diverse Human Populations , 2022 .

[29]  Peter Kraft,et al.  Replication in genome-wide association studies. , 2009, Statistical science : a review journal of the Institute of Mathematical Statistics.

[30]  S. Jee,et al.  Cholesterol ester transfer protein gene is associated with high-density lipoprotein cholesterol levels in Korean population , 2012, Genes & Genomics.

[31]  Taesung Park,et al.  A large-scale genome-wide association study of Asian populations uncovers genetic factors influencing eight quantitative traits , 2009, Nature Genetics.

[32]  P. Froguel,et al.  The genetics of human obesity , 2005, Nature Reviews Genetics.

[33]  Y. J. Kim,et al.  New susceptibility loci in MYL2, C12orf51 and OAS1 associated with 1-h plasma glucose as predisposing risk factors for type 2 diabetes in the Korean population , 2013, Journal of Human Genetics.

[34]  P. O’Reilly,et al.  MultiPhen: Joint Model of Multiple Phenotypes Can Increase Discovery in GWAS , 2012, PloS one.

[35]  Ellen Kampman,et al.  Genome-wide association yields new sequence variants at seven loci that associate with measures of obesity , 2009, Nature Genetics.

[36]  Taesung Park,et al.  Joint Identification of Multiple Genetic Variants via Elastic‐Net Variable Selection in a Genome‐Wide Association Analysis , 2010, Annals of human genetics.

[37]  Luigi Ferrucci,et al.  Population-based genome-wide association studies reveal six loci influencing plasma levels of liver enzymes. , 2008, American journal of human genetics.

[38]  Nilanjan Chatterjee,et al.  Projecting the performance of risk prediction based on polygenic analyses of genome-wide association studies , 2013, Nature Genetics.

[39]  Sung Hee Choi,et al.  Increasing Prevalence of Metabolic Syndrome in Korea , 2011, Diabetes Care.

[40]  D W Fulker,et al.  Increase in Power through Multivariate Analyses , 1998, Behavior genetics.

[41]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[42]  Mark I McCarthy,et al.  Genomic inflation factors under polygenic inheritance , 2011, European Journal of Human Genetics.

[43]  P. Ridker,et al.  Polymorphism in the CETP Gene Region, HDL Cholesterol, and Risk of Future Myocardial Infarction: Genomewide Analysis Among 18 245 Initially Healthy Women From the Women’s Genome Health Study , 2009, Circulation. Cardiovascular genetics.

[44]  C. Hoggart,et al.  Genome-wide association analysis of metabolic traits in a birth cohort from a founder population , 2008, Nature Genetics.

[45]  Patrick Neven,et al.  Genome-wide association studies identify four ER negative–specific breast cancer risk loci , 2013, Nature Genetics.

[46]  Steven Gallinger,et al.  Meta-analysis of genome-wide association data identifies four new susceptibility loci for colorectal cancer , 2008, Nature Genetics.

[47]  S. Grundy,et al.  The metabolic syndrome , 2003, The Lancet.

[48]  M. Fornage,et al.  Genetic Loci Associated with Plasma Phospholipid n-3 Fatty Acids: A Meta-Analysis of Genome-Wide Association Studies from the CHARGE Consortium , 2011, PLoS genetics.

[49]  Anders D. Børglum,et al.  Genome-wide association study identifies five new schizophrenia loci , 2011, Nature Genetics.

[50]  Richard A. Johnson,et al.  Applied Multivariate Statistical Analysis , 1983 .

[51]  R. Elston,et al.  On the Analysis of a Repeated Measure Design in Genome-Wide Association Analysis , 2014, International journal of environmental research and public health.