Investigating perturbed pathway modules from gene expression data via structural equation models

BackgroundIt is currently accepted that the perturbation of complex intracellular networks, rather than the dysregulation of a single gene, is the basis for phenotypical diversity. High-throughput gene expression data allow to investigate changes in gene expression profiles among different conditions. Recently, many efforts have been made to individuate which biological pathways are perturbed, given a list of differentially expressed genes (DEGs). In order to understand these mechanisms, it is necessary to unveil the variation of genes in relation to each other, considering the different phenotypes. In this paper, we illustrate a pipeline, based on Structural Equation Modeling (SEM) that allowed to investigate pathway modules, considering not only deregulated genes but also the connections between the perturbed ones.ResultsThe procedure was tested on microarray experiments relative to two neurological diseases: frontotemporal lobar degeneration with ubiquitinated inclusions (FTLD-U) and multiple sclerosis (MS). Starting from DEGs and dysregulated biological pathways, a model for each pathway was generated using databases information biological databases, in order to design how DEGs were connected in a causal structure. Successively, SEM analysis proved if pathways differ globally, between groups, and for specific path relationships. The results confirmed the importance of certain genes in the analyzed diseases, and unveiled which connections are modified among them.ConclusionsWe propose a framework to perform differential gene expression analysis on microarray data based on SEM, which is able to: 1) find relevant genes and perturbed biological pathways, investigating putative sub-pathway models based on the concept of disease module; 2) test and improve the generated models; 3) detect a differential expression level of one gene, and differential connection between two genes. This could shed light, not only on the mechanisms affecting variations in gene expression, but also on the causes of gene-gene relationship modifications in diseased phenotypes.

[1]  A. Barabasi,et al.  Network biology: understanding the cell's functional organization , 2004, Nature Reviews Genetics.

[2]  M A Province,et al.  Multivariate and multilocus variance components method, based on structural relationships to assess quantitative trait linkage via SEGPATH , 2003, Genetic epidemiology.

[3]  B. Meldrum,et al.  Glutamate as a neurotransmitter in the brain: review of physiology and pathology. , 2000, The Journal of nutrition.

[4]  W. Oertel,et al.  Impaired inhibitory Fcγ receptor IIB expression on B cells in chronic inflammatory demyelinating polyneuropathy , 2009, Proceedings of the National Academy of Sciences.

[5]  Lei Wang,et al.  Network-enabled gene expression analysis , 2012, BMC Bioinformatics.

[6]  Kenneth A. Bollen,et al.  Representing general theoretical concepts in structural equation models: the role of composite variables , 2008, Environmental and Ecological Statistics.

[7]  D. A. Kenny,et al.  Correlation and Causation , 1937, Wilmott.

[8]  R. Tibshirani,et al.  Significance analysis of microarrays applied to the ionizing radiation response , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[9]  Harri T. Kiiveri,et al.  Multivariate analysis of microarray data: differential expression and differential connection , 2011, BMC Bioinformatics.

[10]  Xihong Lin,et al.  Sparse linear discriminant analysis for simultaneous testing for the significance of a gene set/pathway and gene selection , 2009, Bioinform..

[11]  Xiao-Lin Wu,et al.  Inferring causal phenotype networks using structural equation models , 2011, Genetics Selection Evolution.

[12]  Pooja Mittal,et al.  A novel signaling pathway impact analysis , 2009, Bioinform..

[13]  Alexander R. Pico,et al.  The public road to high-quality curated biological pathways. , 2008, Drug discovery today.

[14]  C. Turner,et al.  Crk Associates with a Multimolecular Paxillin / GIT 2 /-PIX Complex and Promotes Rac-dependent Relocalization of Paxillin to Focal Contacts , 2002 .

[15]  J. Trojanowski,et al.  Variations in the progranulin gene affect global gene expression in frontotemporal lobar degeneration. , 2008, Human molecular genetics.

[16]  Yves Rosseel,et al.  lavaan: An R Package for Structural Equation Modeling , 2012 .

[17]  R. Stine,et al.  Bootstrapping Goodness-of-Fit Measures in Structural Equation Models , 1992 .

[18]  Xiao Wu,et al.  Comparative genetic pathway analysis using structural equation Modeling , 2011, 2011 IEEE 1st International Conference on Computational Advances in Bio and Medical Sciences (ICCABS).

[19]  J. Ravetch,et al.  Fcγ receptors as regulators of immune responses , 2008, Nature Reviews Immunology.

[20]  J. Pearl,et al.  A New Identification Condition for Recursive Models With Correlated Errors , 2002 .

[21]  J. Bennett,et al.  The B cell response in multiple sclerosis , 2006, Neurological research.

[22]  P. Dodd,et al.  Glutamate-mediated excitotoxicity and neurodegeneration in Alzheimer’s disease , 2004, Neurochemistry International.

[23]  Gang Li,et al.  R Functions for Sample Size and Probability Calculations for Assessing Consistency of Treatment Effects in Multi-Regional Clinical Trials , 2012 .

[24]  S. Aburatani Application of Structure Equation Modeling for Inferring a Serial Transcriptional Regulation in Yeast , 2011, Gene regulation and systems biology.

[25]  Calbindin D-28k and parvalbumin immunoreactivity in the frontal cortex in patients with frontal lobe dementia of non-Alzheimer type associated with amyotrophic lateral sclerosis. , 1993, Journal of neurology, neurosurgery, and psychiatry.

[26]  D. A. Kenny,et al.  Correlation and Causation. , 1982 .

[27]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[28]  Edward E. Rigdon,et al.  A Necessary and Sufficient Identification Rule for Structural Models Estimated in Practice. , 1995, Multivariate behavioral research.

[29]  Cathy H. Wu,et al.  PIRSF Family Classification System for Protein Functional and Evolutionary Analysis , 2006, Evolutionary bioinformatics online.

[30]  Monica Chiogna,et al.  Along signal paths: an empirical gene set approach exploiting pathway topology , 2012, Nucleic acids research.

[31]  Robert V Farese,et al.  Functional Genomic Analyses Identify Pathways Dysregulated by Progranulin Deficiency, Implicating Wnt Signaling , 2011, Neuron.

[32]  Interleukin-1β Promotes Long-Term Potentiation in Patients with Multiple Sclerosis , 2014, NeuroMolecular Medicine.

[33]  Dean Y. Li,et al.  Interleukin receptor activates a MYD88-ARNO-ARF6 cascade to disrupt vascular stability , 2012, Nature.

[34]  A. Barabasi,et al.  Network medicine : a network-based approach to human disease , 2010 .

[35]  J. Pearl Graphs, Causality, and Structural Equation Models , 1998 .

[36]  Xiaojuan Mi,et al.  Structural Equation Modeling of Gene–Environment Interactions in Coronary Heart Disease , 2011, Annals of human genetics.

[37]  Keith Shockley,et al.  Structural Model Analysis of Multiple Quantitative Traits , 2006, PLoS genetics.

[38]  J. Ravetch,et al.  Fcgamma receptors as regulators of immune responses. , 2008, Nature reviews. Immunology.

[39]  Atul J. Butte,et al.  Ten Years of Pathway Analysis: Current Approaches and Outstanding Challenges , 2012, PLoS Comput. Biol..

[40]  Isabel M. Tienda-Luna,et al.  Reverse engineering gene regulatory networks , 2009, IEEE Signal Processing Magazine.

[41]  M. Browne,et al.  Alternative Ways of Assessing Model Fit , 1992 .

[42]  I. Ferrer Neurons and Their Dendrites in Frontotemporal Dementia , 1999, Dementia and Geriatric Cognitive Disorders.

[43]  D. Cox,et al.  B cell exchange across the blood-brain barrier in multiple sclerosis. , 2012, The Journal of clinical investigation.

[44]  Bill Shipley,et al.  Cause and Correlation in Biology: A User''s Guide to Path Analysis , 2016 .

[45]  M. Xiong,et al.  Identification of genetic networks. , 2004, Genetics.

[46]  Hiroyuki Ogata,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 1999, Nucleic Acids Res..

[47]  Susmita Datta,et al.  A statistical framework for differential network analysis from microarray data , 2010, BMC Bioinformatics.

[48]  Jun Covariance Structure Models for Gene Expression Microarray Data , 2006 .

[49]  George Stephanopoulos,et al.  Microarray detection of E2F pathway activation and other targets in multiple sclerosis peripheral blood mononuclear cells , 2004, Journal of Neuroimmunology.

[50]  Peter Bühlmann,et al.  Causal Inference Using Graphical Models with the R Package pcalg , 2012 .

[51]  Taesung Park,et al.  Application of Structural Equation Models to Genome-wide Association Analysis , 2010 .

[52]  C. Lippa,et al.  Review: Disruption of the Postsynaptic Density in Alzheimer’s Disease and Other Neurodegenerative Dementias , 2010, American journal of Alzheimer's disease and other dementias.

[53]  P. Bentler,et al.  Cutoff criteria for fit indexes in covariance structure analysis : Conventional criteria versus new alternatives , 1999 .

[54]  Damian Szklarczyk,et al.  STRING v9.1: protein-protein interaction networks, with increased coverage and integration , 2012, Nucleic Acids Res..

[55]  R. Fitzsimonds,et al.  Impaired PtdIns(4,5)P2 synthesis in nerve terminals produces defects in synaptic vesicle trafficking , 2004, Nature.

[56]  M A Province,et al.  The Future of Path Analysis, Segregation Analysis, and Combined Models for Genetic Dissection of Complex Traits , 1999, Human Heredity.

[57]  M. Sheng,et al.  PDZ domain proteins of synapses , 2004, Nature Reviews Neuroscience.

[58]  M. Kubo,et al.  PLD4 as a novel susceptibility gene for systemic sclerosis in a Japanese population. , 2013, Arthritis and rheumatism.

[59]  Paola Giunti,et al.  Deletion at ITPR1 Underlies Ataxia in Mice and Spinocerebellar Ataxia 15 in Humans , 2007, PLoS genetics.

[60]  B. Diamond,et al.  Selective dysregulation of the FcγIIB receptor on memory B cells in SLE , 2006, The Journal of Experimental Medicine.

[61]  F. Gilli Role of differential expression of interferon receptor isoforms on the response of multiple sclerosis patients to therapy with interferon beta. , 2010, Journal of interferon & cytokine research : the official journal of the International Society for Interferon and Cytokine Research.

[62]  M. Robinson,et al.  The role of glutamate transporters in neurodegenerative diseases and potential opportunities for intervention , 2007, Neurochemistry International.

[63]  Andrea L. Rosso,et al.  Disruption of glutamate receptors at Shank-postsynaptic platform in Alzheimer's disease , 2009, Brain Research.