Integrated systems approach identifies pathways from the genome to triglycerides through a metabolomic causal network

Introduction: To leverage functionality and clinical relevance into understanding systems biology, one needs to understand the pathway of the genetic effects on risk factors/disease through intermediate molecular levels, such as metabolomics. Systems approaches integrate multi-omic information to find pathways to disease endpoints and make optimal inference decisions. Method: Here, we introduce a multi-stage approach to integrate causal networks in observational studies and GWAS to facilitate mechanistic understanding through identification of pathways from the genome to risk factors/disease via metabolomics. The pathways in causal networks reveal the underlying relationships behind observations, which do not play a significant role in more traditional correlative analyses, where one variable at a time is considered. Results: We identified a causal network over the metabolomic level using the genome directed acyclic graph (G-DAG), to systematically assess whether variations in the genome lead to variations in triglyceride levels as a risk factor of cardiovascular disease. We found LRRC46 and LRRC69 harboring loss-of-function mutations have significant effect on two metabolites with direct effects on triglyceride levels. We also found pathways of FAM198B and C6orf25 to triglycerides through indirect paths from metabolites. Conclusion: Integrating causal networks with GWAS facilitates mechanistic understanding in comparison to one-variable-at-a-time approaches due to accounting for relationships among components at intermediate molecular levels. This approach is complementary to experimental studies to identify efficacious targets in the age of big data sets.

[1]  Siamak Zamani Dadaneh,et al.  Sequential Sampling for Optimal Bayesian Classification of Sequencing Count Data , 2018, 2018 52nd Asilomar Conference on Signals, Systems, and Computers.

[2]  Ahmad Samiei,et al.  Arachidonic acid as a target for treating hypertriglyceridemia reproduced by a causal network analysis and an intervention study , 2018, Metabolomics.

[3]  H. Cordell,et al.  A comparison of methods for inferring causal relationships between genotype and phenotype using additional biological measurements , 2017, Genetic epidemiology.

[4]  A. Elwany,et al.  Mechanical properties and microstructural characterization of selective laser melted 17-4 PH stainless steel , 2017 .

[5]  V. M. Chandrasekaran,et al.  Identification of cluster of proteins in the network of MAPK pathways as cancer drug targets , 2017 .

[6]  E. Boerwinkle,et al.  Erratum to: A causal network analysis in an observational study identifies metabolomics pathways influencing plasma triglyceride levels , 2016, Metabolomics.

[7]  H. Kwasnicka,et al.  Applying bounded fuzzy possibilistic method on critical objects , 2016, 2016 IEEE 17th International Symposium on Computational Intelligence and Informatics (CINTI).

[8]  Hossein Yazdani,et al.  Fuzzy possibilistic on different search spaces , 2016, 2016 IEEE 17th International Symposium on Computational Intelligence and Informatics (CINTI).

[9]  Ahmad Samiei,et al.  Identification, analysis, and interpretation of a human serum metabolomics causal network in an observational study , 2016, J. Biomed. Informatics.

[10]  Eric Boerwinkle,et al.  Loss-of-function variants influence the human serum metabolome , 2016, Science Advances.

[11]  E. Boerwinkle,et al.  A Causal Network Analysis of the Fatty Acid Metabolome in African-Americans Reveals a Critical Role for Palmitoleate and Margarate , 2016, Omics : a journal of integrative biology.

[12]  E. Boerwinkle,et al.  Identification of Rare Variants in Metabolites of the Carnitine Pathway by Whole Genome Sequencing Analysis , 2016, Genetic epidemiology.

[13]  Eric Boerwinkle,et al.  A causal network analysis in an observational study identifies metabolomics pathways influencing plasma triglyceride levels , 2016, Metabolomics.

[14]  Ahmad Samiei,et al.  Generating a robust statistical causal structure over 13 cardiovascular disease risk factors using genomics data , 2016, J. Biomed. Informatics.

[15]  Eloisa Arbustini,et al.  A targeted metabolomics assay for cardiac metabolism and demonstration using a mouse model of dilated cardiomyopathy , 2016, Metabolomics.

[16]  P. Vourc'h,et al.  Metabolomics in amyotrophic lateral sclerosis: how far can it take us? , 2016, European journal of neurology.

[17]  Eric Boerwinkle,et al.  Conceptual Aspects of Causal Networks in an Applied Context , 2016 .

[18]  P. Corcia,et al.  Biomarkers in amyotrophic lateral sclerosis: combining metabolomic and clinical parameters to define disease progression , 2016, European journal of neurology.

[19]  David B. Dunson,et al.  A hybrid bayesian approach for genome-wide association studies on related individuals , 2015, Bioinform..

[20]  Eric Boerwinkle,et al.  Rare variants analysis using penalization methods for whole genome sequence data , 2015, BMC Bioinformatics.

[21]  J. Borén,et al.  The small leucine‐rich repeat proteoglycans in tissue repair and atherosclerosis , 2015, Journal of internal medicine.

[22]  Edward R. Dougherty,et al.  Discrete optimal Bayesian classification with error-conditioned sequential sampling , 2015, Pattern Recognit..

[23]  C. Newgard,et al.  Integrated metabolomics and genomics: systems approaches to biomarkers and mechanisms of cardiovascular disease. , 2015, Circulation. Cardiovascular genetics.

[24]  Nobutaka Hattori,et al.  Identification of novel biomarkers for Parkinson's disease by metabolomic technologies , 2015, Journal of Neurology, Neurosurgery & Psychiatry.

[25]  E. Boerwinkle,et al.  Causal Inference in the Age of Decision Medicine , 2014, Journal of data mining in genomics & proteomics.

[26]  Eric Boerwinkle,et al.  Causal inference at the population level , 2014 .

[27]  M. Baiocchi,et al.  Instrumental variable methods for causal inference , 2014, Statistics in medicine.

[28]  K. Maki,et al.  Plasma Fatty Acids as Predictors of Triglyceride and Non-HDL Cholesterol Responses to Omega-3 Free Fatty Acid Therapy in Hypertriglyceridemia , 2014 .

[29]  A. Peters,et al.  Plasma Metabolomics Reveal Alterations of Sphingo- and Glycerophospholipid Levels in Non-Diabetic Carriers of the Transcription Factor 7-Like 2 Polymorphism rs7903146 , 2013, PloS one.

[30]  S. Thompson,et al.  Use of allele scores as instrumental variables for Mendelian randomization , 2013, International journal of epidemiology.

[31]  David J. Fleet,et al.  Hamming Distance Metric Learning , 2012, NIPS.

[32]  S. Fischer,et al.  Untargeted Plasma Metabolite Profiling Reveals the Broad Systemic Consequences of Xanthine Oxidoreductase Inactivation in Mice , 2012, PloS one.

[33]  Joseph K. Pickrell,et al.  A Systematic Survey of Loss-of-Function Variants in Human Protein-Coding Genes , 2012, Science.

[34]  Aleksandar Milosavljevic,et al.  An integrative variant analysis suite for whole exome next-generation sequencing data , 2012, BMC Bioinformatics.

[35]  Mark R. Viant,et al.  Missing values in mass spectrometry based metabolomics: an undervalued step in the data processing pipeline , 2011, Metabolomics.

[36]  J. Perkel Metabolomics: where seeing is believing. , 2011, BioTechniques.

[37]  R. Mayeux,et al.  Epidemiology of Alzheimer disease , 2011, Nature Reviews Neurology.

[38]  Markus Perola,et al.  Metabonomic, transcriptomic, and genomic variation of a population cohort , 2010, Molecular systems biology.

[39]  W. Rottbauer,et al.  Myomasp/LRRC39, a Heart- and Muscle-Specific Protein, Is a Novel Component of the Sarcomeric M-Band and Is Involved in Stretch Sensing , 2010, Circulation research.

[40]  H. Hakonarson,et al.  ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data , 2010, Nucleic acids research.

[41]  Richard Durbin,et al.  Fast and accurate long-read alignment with Burrows–Wheeler transform , 2010, Bioinform..

[42]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[43]  Christian Baumgartner,et al.  Metabolite profiling of blood from individuals undergoing planned myocardial infarction reveals early markers of myocardial injury. , 2008, The Journal of clinical investigation.

[44]  D. Raftery,et al.  Metabolomics-based methods for early disease diagnostics , 2008, Expert review of molecular diagnostics.

[45]  M. Tobin,et al.  Mendelian Randomisation and Causal Inference in Observational Epidemiology , 2008, PLoS medicine.

[46]  Fengyu Zhang,et al.  An approach to incorporate linkage disequilibrium structure into genomic association analysis. , 2008, Journal of genetics and genomics = Yi chuan xue bao.

[47]  R. O’Doherty,et al.  A moderate increase in carnitine palmitoyltransferase 1a activity is sufficient to substantially reduce hepatic triglyceride levels. , 2008, American journal of physiology. Endocrinology and metabolism.

[48]  George Davey Smith,et al.  Mendelian randomization: Using genes as instruments for making causal inferences in epidemiology , 2008, Statistics in medicine.

[49]  E. Feskens,et al.  Genetic variation in thioredoxin interacting protein (TXNIP) is associated with hypertriglyceridaemia and blood pressure in diabetes mellitus , 2007, Diabetic medicine : a journal of the British Diabetic Association.

[50]  A. Dawid FUNDAMENTALS OF STATISTICAL CAUSALITY , 2007 .

[51]  Constantin F. Aliferis,et al.  The max-min hill-climbing Bayesian network structure learning algorithm , 2006, Machine Learning.

[52]  J. Castle,et al.  An integrative genomics approach to infer causal associations between gene expression and disease , 2005, Nature Genetics.

[53]  D. Rubin Causal Inference Using Potential Outcomes , 2005 .

[54]  I. Wilson,et al.  Understanding 'Global' Systems Biology: Metabonomics and the Continuum of Metabolism , 2003, Nature Reviews Drug Discovery.

[55]  M. Otagiri,et al.  Effects of uremic toxins and fatty acids on serum protein binding of furosemide: possible mechanism of the binding defect in uremia. , 1997, Clinical chemistry.

[56]  P. Schuster,et al.  From sequences to shapes and back: a case study in RNA secondary structures , 1994, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[57]  A. Folsom,et al.  The Atherosclerosis Risk in Communities (ARIC) Study: design and objectives. The ARIC investigators. , 1989, American journal of epidemiology.