CPAG: software for leveraging pleiotropy in GWAS to reveal similarity between human traits links plasma fatty acids and intestinal inflammation

Meta-analyses of genome-wide association studies (GWAS) have demonstrated that the same genetic variants can be associated with multiple diseases and other complex traits. We present software called CPAG (Cross-Phenotype Analysis of GWAS) to look for similarities between 700 traits, build trees with informative clusters, and highlight underlying pathways. Clusters are consistent with pre-defined groups and literature-based validation but also reveal novel connections. We report similarity between plasma palmitoleic acid and Crohn's disease and find that specific fatty acids exacerbate enterocolitis in zebrafish. CPAG will become increasingly powerful as more genetic variants are uncovered, leading to a deeper understanding of complex traits. CPAG is freely available at www.sourceforge.net/projects/CPAG/.

[1]  W. Guan,et al.  Genome-Wide Association Study Identifies Novel Loci Associated With Concentrations of Four Plasma Phospholipid Fatty Acids in the De Novo Lipogenesis Pathway: Results From the Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) Consortium , 2013, Circulation. Cardiovascular genetics.

[2]  E. Manzato,et al.  Plasma lipids and inflammation in active inflammatory bowel diseases , 2009, Alimentary pharmacology & therapeutics.

[3]  D. Ko,et al.  Understanding Human Variation in Infectious Disease Susceptibility through Clinical and Cellular GWAS , 2013, PLoS pathogens.

[4]  Kasper Lage,et al.  Pervasive Sharing of Genetic Effects in Autoimmune Disease , 2011, PLoS genetics.

[5]  R. Recker,et al.  Genetic and environmental correlations between obesity phenotypes and age at menarche , 2006, International Journal of Obesity.

[6]  Robert K. Colwell,et al.  A new statistical approach for assessing similarity of species composition with incidence and abundance data , 2004 .

[7]  Stefan Wirtz,et al.  Chemically induced mouse models of intestinal inflammation , 2007, Nature Protocols.

[8]  Manabu T. Nakamura,et al.  Disruption of FADS2 gene in mice impairs male reproduction and causes dermal and intestinal ulceration , 2009, Journal of Lipid Research.

[9]  A. Butte,et al.  Disease Risk Factors Identified Through Shared Genetic Architecture and Electronic Medical Records , 2014, Science Translational Medicine.

[10]  P. Jaccard THE DISTRIBUTION OF THE FLORA IN THE ALPINE ZONE.1 , 1912 .

[11]  Richard T. Lee,et al.  Targeted deletion of caspase-1 reduces early mortality and left ventricular dilatation following myocardial infarction. , 2003, Journal of molecular and cellular cardiology.

[12]  Elizabeth W Karlson,et al.  Replication of putative candidate-gene associations with rheumatoid arthritis in >4,000 samples from North America and Sweden: association of susceptibility with PTPN22, CTLA4, and PADI4. , 2005, American journal of human genetics.

[13]  S. Purcell,et al.  Pleiotropy in complex traits: challenges and strategies , 2013, Nature Reviews Genetics.

[14]  G. McVean,et al.  Psoriasis Patients Are Enriched for Genetic Variants That Protect against HIV-1 Disease , 2012, PLoS genetics.

[15]  N. Voelkel,et al.  The inflammasome promotes adverse cardiac remodeling following acute myocardial infarction in the mouse , 2011, Proceedings of the National Academy of Sciences.

[16]  Michael Wasnick,et al.  A genome-wide in vitro bacterial-infection screen reveals human variation in the host response associated with inflammatory disease. , 2009, American journal of human genetics.

[17]  C. Hall,et al.  The zebrafish lysozyme C promoter drives myeloid-specific expression in transgenic fish , 2007, BMC Developmental Biology.

[18]  Atul J. Butte,et al.  Autoimmune Disease Classification by Inverse Association with SNP Alleles , 2009, PLoS genetics.

[19]  Huaxi Xu,et al.  Apolipoprotein E and Alzheimer disease: risk, mechanisms and therapy , 2013, Nature Reviews Neurology.

[20]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[21]  D. Altshuler,et al.  A map of human genome variation from population-scale sequencing , 2010, Nature.

[22]  Sharon R Grossman,et al.  Integrating common and rare genetic variation in diverse human populations , 2010, Nature.

[23]  G. Cheng,et al.  Systematic identification of type I and type II interferon-induced antiviral factors , 2012, Proceedings of the National Academy of Sciences.

[24]  T. Sørensen,et al.  A method of establishing group of equal amplitude in plant sociobiology based on similarity of species content and its application to analyses of the vegetation on Danish commons , 1948 .

[25]  R. A. Bailey,et al.  Robust associations of four new chromosome regions from genome-wide analyses of type 1 diabetes , 2007, Nature Genetics.

[26]  G. Greenberg,et al.  Ustekinumab induction and maintenance therapy in refractory Crohn's disease. , 2012, The New England journal of medicine.

[27]  森下 Measuring of interspecific association and similarity between communities. , 1961 .

[28]  F. Agakov,et al.  Abundant pleiotropy in human complex diseases and traits. , 2011, American journal of human genetics.

[29]  H. El‐Serag,et al.  Dietary Intake and Risk of Developing Inflammatory Bowel Disease: A Systematic Review of the Literature , 2011, The American Journal of Gastroenterology.

[30]  F. Piano,et al.  A High-Resolution C. elegans Essential Gene Network Based on Phenotypic Profiling of a Complex Tissue , 2011, Cell.

[31]  P. Rutgeerts,et al.  A randomized trial of Ustekinumab, a human interleukin-12/23 monoclonal antibody, in patients with moderate-to-severe Crohn's disease. , 2008, Gastroenterology.

[32]  C. Myers,et al.  Using networks to measure similarity between genes: association index selection , 2013, Nature Methods.

[33]  K. Wakai,et al.  Dietary Risk Factors for Inflammatory Bowel Disease: A Multicenter Case‐Control Study in Japan , 2005, Inflammatory bowel diseases.

[34]  Kathryn E. Crosier,et al.  A chemical enterocolitis model in zebrafish larvae that is dependent on microbiota and responsive to pharmacological agents , 2011, Developmental dynamics : an official publication of the American Association of Anatomists.

[35]  A. Kimball,et al.  Efficacy and safety of ustekinumab, a human interleukin-12/23 monoclonal antibody, in patients with psoriasis: 76-week results from a randomised, double-blind, placebo-controlled trial (PHOENIX 1) , 2008, The Lancet.

[36]  M. Lohse,et al.  A Role for Caspase-1 in Heart Failure , 2007, Circulation research.

[37]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[38]  Dennis C. Ko,et al.  Functional genetic screen of human diversity reveals that a methionine salvage enzyme regulates inflammatory cell death , 2012, Proceedings of the National Academy of Sciences.

[39]  Judy H. Cho,et al.  Genome-wide association defines more than 30 distinct susceptibility loci for Crohn's disease , 2008, Nature Genetics.

[40]  M. Lebwohl,et al.  Efficacy and safety of ustekinumab, a human interleukin-12/23 monoclonal antibody, in patients with psoriasis: 52-week results from a randomised, double-blind, placebo-controlled trial (PHOENIX 2) , 2008, The Lancet.

[41]  K. Tokunaga,et al.  Association of Fcgamma receptor IIA, but not IIB and IIIA, polymorphisms with systemic lupus erythematosus: A family-based association study in Caucasians. , 2004, Arthritis and rheumatism.

[42]  Peggy Hall,et al.  The NHGRI GWAS Catalog, a curated resource of SNP-trait associations , 2013, Nucleic Acids Res..

[43]  R. Bloomfeld,et al.  Conjugated linoleic acid modulates immune responses in patients with mild to moderately active Crohn's disease. , 2012, Clinical nutrition.

[44]  I. T. de Almeida,et al.  Plasma total and free fatty acids composition in human non-alcoholic steatohepatitis. , 2002, Clinical nutrition.

[45]  Kristin G Ardlie,et al.  Genetic association of the R620W polymorphism of protein tyrosine phosphatase PTPN22 with human SLE. , 2004, American journal of human genetics.

[46]  R. McCarter,et al.  Biological Variation and Hemoglobin A1c: Relevance to Diabetes Management and Complications , 2013, Pediatric diabetes.

[47]  Robert K. Colwell,et al.  Abundance‐Based Similarity Indices and Their Estimation When There Are Unseen Species in Samples , 2006, Biometrics.

[48]  M. Daly,et al.  Genetic Mapping in Human Disease , 2008, Science.

[49]  Elaine Nsoesie,et al.  Prediction of Disease and Phenotype Associations from Genome-Wide Association Studies , 2011, PloS one.

[50]  E. Levy,et al.  Altered lipid profile, lipoprotein composition, and oxidant and antioxidant status in pediatric Crohn disease. , 2000, The American journal of clinical nutrition.

[51]  H. S. Horn,et al.  Measurement of "Overlap" in Comparative Ecological Studies , 1966, The American Naturalist.

[52]  E. Levy,et al.  Imbalances in Dietary Consumption of Fatty Acids, Vegetables, and Fruits Are Associated With Risk for Crohn's Disease in Children , 2007, The American Journal of Gastroenterology.

[53]  Kathryn E. Crosier,et al.  Chemically induced intestinal damage models in zebrafish larvae. , 2013, Zebrafish.

[54]  B. Cookson,et al.  Pyroptosis: host cell death and inflammation , 2009, Nature Reviews Microbiology.

[55]  G. Christensen,et al.  The NLRP3 inflammasome is up-regulated in cardiac fibroblasts and mediates myocardial ischaemia-reperfusion injury. , 2013, Cardiovascular research.