Competitive pathway analysis using Structural Equation Models (CPA-SEM) for gene expression data

There is an increasing interest in the pathway analysis of multiple genes and complex traits in association studies. Recently, a number of methods of pathway analysis have been developed to detect the novel pathways associated with human complex traits. In this paper, we propose a novel statistical approach for competitive pathway analysis based on Structural Equation Modeling (CPA-SEM), taking advantage of prior knowledge on existing relationships between genes in a pathway. Our CPA-SEM identifies pathways associated with traits of interest. The CPA-SEM approach is different from the previous SEM-based approaches in that it considers all possible sub-pathways into account and performs permutation based robust analysis. We applied the proposed CPA-SEM method to gene expression data of gastric cancer (GSE27342), and found that mTOR signaling pathway was significantly associated with gastric cancer. This pathway has previously been reported to be associated with gastric cancer. In conclusion, our CPA-SEM analysis provides a better understanding of biological mechanism by identifying pathways associated with a trait of interest.

[1]  P. Khatri,et al.  Global functional profiling of gene expression ? ? This work was funded in part by a Sun Microsystem , 2003 .

[2]  R. Tibshirani,et al.  Significance analysis of microarrays applied to the ionizing radiation response , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[3]  En-ling Li,et al.  [The relationship of mTOR signaling pathway and histone acetylation in human gastric cancer cell lines]. , 2007, Zhonghua yi xue yi chuan xue za zhi = Zhonghua yixue yichuanxue zazhi = Chinese journal of medical genetics.

[4]  Pooja Mittal,et al.  A novel signaling pathway impact analysis , 2009, Bioinform..

[5]  Haiqiang Jiang,et al.  The Functional Variant in the 3'UTR of IGF1 with the Risk of Gastric Cancer in a Chinese Population , 2015, Cellular Physiology and Biochemistry.

[6]  Mark R Chance,et al.  Pathway Analyses and Understanding Disease Associations , 2013, Current Genetic Medicine Reports.

[7]  M. Xiong,et al.  Identification of genetic networks. , 2004, Genetics.

[8]  Omar Abdel-Rahman,et al.  Targeting vascular endothelial growth factor (VEGF) pathway in gastric cancer: preclinical and clinical aspects. , 2015, Critical reviews in oncology/hematology.

[9]  Yan Tao,et al.  PKG II Inhibits EGF/EGFR-Induced Migration of Gastric Cancer Cells , 2013, PloS one.

[10]  Yan Li,et al.  Biphasic ER-α36-mediated estrogen signaling regulates growth of gastric cancer cells. , 2014, International journal of oncology.

[11]  P. Khatri,et al.  Global functional profiling of gene expression. , 2003, Genomics.

[12]  David Botstein,et al.  GO: : TermFinder--open source software for accessing Gene Ontology information and finding significantly enriched Gene Ontology terms associated with a list of genes , 2004, Bioinform..

[13]  Xiao-Yan Xu,et al.  PI3K/Akt/mTOR signaling pathway in cancer stem cells: from basic research to clinical application. , 2015, American journal of cancer research.

[14]  P. Park,et al.  Discovering statistically significant pathways in expression profiling studies. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[15]  T. Speed,et al.  GOstat: find statistically overrepresented Gene Ontologies within a group of genes. , 2004, Bioinformatics.

[16]  Xiaodong Zhou,et al.  Lymphangiogenesis in Gastric Cancer regulated through Akt/mTOR-VEGF-C/VEGF-D axis , 2015, BMC Cancer.

[17]  Xiaojuan Mi,et al.  Structural Equation Modeling of Gene–Environment Interactions in Coronary Heart Disease , 2011, Annals of human genetics.

[18]  Bill Shipley,et al.  Cause and Correlation in Biology: A User''s Guide to Path Analysis , 2016 .

[19]  David Martin,et al.  GOToolBox: functional analysis of gene datasets based on Gene Ontology , 2004, Genome Biology.

[20]  Daniele Pepe,et al.  Investigating perturbed pathway modules from gene expression data via structural equation models , 2014, BMC Bioinformatics.

[21]  V. Arango,et al.  Using the Gene Ontology for Microarray Data Mining: A Comparison of Methods and Application to Age Effects in Human Prefrontal Cortex , 2004, Neurochemical Research.

[22]  Yan Song,et al.  Identification of genes and long non-coding RNAs associated with the pathogenesis of gastric cancer. , 2015, Oncology reports.

[23]  Steven C. Lawlor,et al.  MAPPFinder: using Gene Ontology and GenMAPP to create a global gene-expression profile from microarray data , 2003, Genome Biology.

[24]  Lincoln Stein,et al.  Reactome: a database of reactions, pathways and biological processes , 2010, Nucleic Acids Res..

[25]  Susumu Goto,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 2000, Nucleic Acids Res..

[26]  A. Shapiro,et al.  On the multivariate asymptotic distribution of sequential Chi-square statistics , 1985 .

[27]  G. Mcmahon,et al.  VEGF receptor signaling in tumor angiogenesis. , 2000, The oncologist.

[28]  Peter M. Bentler,et al.  Covariance Structure Models for Gene Expression Microarray Data , 2003 .

[29]  Xiao Wu,et al.  Comparative genetic pathway analysis using structural equation Modeling , 2011, 2011 IEEE 1st International Conference on Computational Advances in Bio and Medical Sciences (ICCABS).

[30]  Korbinian Strimmer,et al.  BMC Bioinformatics BioMed Central Methodology article A general modular framework for gene set enrichment analysis , 2009 .

[31]  Kenneth A. Bollen,et al.  Structural Equations with Latent Variables , 1989 .

[32]  Keith Shockley,et al.  Structural Model Analysis of Multiple Quantitative Traits , 2006, PLoS genetics.

[33]  M A Province,et al.  Multivariate and multilocus variance components method, based on structural relationships to assess quantitative trait linkage via SEGPATH , 2003, Genetic epidemiology.

[34]  Wen-Chi Chou,et al.  An integrated transcriptomic and computational analysis for biomarker identification in gastric cancer , 2010, Nucleic acids research.

[35]  D. A. Kenny,et al.  Correlation and Causation , 1937, Wilmott.

[36]  Taesung Park,et al.  Application of Structural Equation Models to Genome-wide Association Analysis , 2010 .

[37]  Yves Rosseel,et al.  lavaan: An R Package for Structural Equation Modeling , 2012 .

[38]  P. Khatri,et al.  Profiling gene expression using onto-express. , 2002, Genomics.

[39]  Dennis B. Troup,et al.  NCBI GEO: mining tens of millions of expression profiles—database and tools update , 2006, Nucleic Acids Res..