相关论文

Ten Years of Pathway Analysis: Current Approaches and Outstanding Challenges

Abstract:Pathway analysis has become the first choice for gaining insight into the underlying biology of differentially expressed genes and proteins, as it reduces complexity and has increased explanatory power. We discuss the evolution of knowledge base–driven pathway analysis over its first decade, distinctly divided into three generations. We also discuss the limitations that are specific to each generation, and how they are addressed by successive generations of methods. We identify a number of annotation challenges that must be addressed to enable development of the next generation of pathway analysis methods. Furthermore, we identify a number of methodological challenges that the next generation of methods must tackle to take advantage of the technological advances in genomics and proteomics in order to improve specificity, sensitivity, and relevance of pathway analysis.

参考文献

[1]  Tao Chen,et al.  GOFFA: Gene Ontology For Functional Analysis – A FDA Gene Ontology Tool for Analysis of Genomic and Proteomic Data , 2006, BMC Bioinformatics.

[2]  Hao Xiong,et al.  Non-linear tests for identifying differentially expressed genes or genetic networks , 2006, Bioinform..

[3]  D. Botstein,et al.  WISP genes are members of the connective tissue growth factor family that are up-regulated in wnt-1-transformed cells and aberrantly expressed in human colon tumors. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[4]  W. Fantl,et al.  Regulation of Cyclooxygenase-2 and Periostin by Wnt-3 in Mouse Mammary Epithelial Cells* , 2000, The Journal of Biological Chemistry.

[5]  Daniel J. Vis,et al.  T-profiler: scoring the activity of predefined groups of genes using gene expression data , 2005, Nucleic Acids Res..

[6]  William Stafford Noble,et al.  Exploring Gene Expression Data with Class Scores , 2001, Pacific Symposium on Biocomputing.

[7]  Denis Thieffry,et al.  RegulonDB: a database on transcriptional regulation in Escherichia coli , 1998, Nucleic Acids Res..

[8]  John D. Storey,et al.  A network-based analysis of systemic inflammation in humans , 2005, Nature.

[9]  A. Butte,et al.  AILUN: reannotating gene expression data automatically , 2007, Nature Methods.

[10]  Steven C. Lawlor,et al.  GenMAPP, a new tool for viewing and analyzing microarray data on biological pathways , 2002, Nature Genetics.

[11]  Angel Rubio,et al.  GARBAN: genomic analysis and rapid biological annotation of cDNA microarray and proteomic data , 2003, Bioinform..

[12]  Purvesh Khatri,et al.  A System Biology Approach for the Steady-State Analysis of Gene Signaling Networks , 2007, CIARP.

[13]  Daniel L. Hartl,et al.  GeneMerge - Post-genomic Analysis, Data Mining, and Hypothesis Testing , 2003, Bioinform..

[14]  Korbinian Strimmer,et al.  A general modular framework for gene set enrichment analysis , 2009, BMC Bioinformatics.

[15]  Alex E. Lash,et al.  Gene Expression Omnibus: NCBI gene expression and hybridization array data repository , 2002, Nucleic Acids Res..

[16]  Oliver H. Tam,et al.  Pseudogene-derived small interfering RNAs regulate gene expression in mouse oocytes , 2008, Nature.

[17]  Ker-Chau Li,et al.  Genome-wide coexpression dynamics: Theory and application , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[18]  Steven C. Lawlor,et al.  MAPPFinder: using Gene Ontology and GenMAPP to create a global gene-expression profile from microarray data , 2003, Genome Biology.

[19]  E Birney,et al.  The Genome Knowledgebase: a resource for biologists and bioinformaticists. , 2003, Cold Spring Harbor symposia on quantitative biology.

[20]  Peter D. Karp,et al.  The MetaCyc Database , 2002, Nucleic Acids Res..

[21]  John N. Weinstein,et al.  High-Throughput GoMiner, an 'industrial-strength' integrative gene ontology tool for interpretation of multiple-microarray experiments, with application to studies of Common Variable Immune Deficiency (CVID) , 2005, BMC Bioinformatics.

[22]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[23]  P. D. Karp,et al.  The outcomes of pathway database computations depend on pathway ontology , 2006, Nucleic acids research.

[24]  Tatiana A. Tatusova,et al.  Entrez Gene: gene-centered information at NCBI , 2004, Nucleic Acids Res..

[25]  Jason E. Stewart,et al.  Minimum information about a microarray experiment (MIAME)—toward standards for microarray data , 2001, Nature Genetics.

[26]  U. Mansmann,et al.  Testing Differential Gene Expression in Functional Groups , 2005, Methods of Information in Medicine.

[27]  Lin Fang,et al.  WEGO: a web tool for plotting GO annotations , 2006, Nucleic Acids Res..

[28]  Rainer Breitling,et al.  Iterative Group Analysis (iGA): A simple tool to enhance sensitivity and facilitate interpretation of microarray experiments , 2004, BMC Bioinformatics.

[29]  May D. Wang,et al.  GoMiner: a resource for biological interpretation of genomic and proteomic data , 2003, Genome Biology.

[30]  Enrica Calura,et al.  The Biological Connection Markup Language: a SBGN-compliant format for visualization, filtering and analysis of biological pathways , 2011, Bioinform..

[31]  T. Tatusova,et al.  Entrez Gene: gene-centered information at NCBI , 2006, Nucleic Acids Res..

[32]  Pooja Mittal,et al.  A novel signaling pathway impact analysis , 2009, Bioinform..

[33]  Mone Zaidi,et al.  Complexity in signal transduction , 2010, Annals of the New York Academy of Sciences.

[34]  Chris Sander,et al.  Characterizing gene sets with FuncAssociate , 2003, Bioinform..

[35]  Pornpimol Charoentong,et al.  ClueGO: a Cytoscape plug-in to decipher functionally grouped gene ontology and pathway annotation networks , 2009, Bioinform..

[36]  Tim Hubbard Finishing the euchromatic sequence of the human genome , 2004 .

[37]  Qi Zheng,et al.  GOEAST: a web-based software toolkit for Gene Ontology enrichment analysis , 2008, Nucleic Acids Res..

[38]  Galina V. Glazko,et al.  A Multivariate Extension of the gene Set Enrichment Analysis , 2007, J. Bioinform. Comput. Biol..

[39]  M. Daly,et al.  PGC-1α-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes , 2003, Nature Genetics.

[40]  Seon-Young Kim,et al.  PAGE: Parametric Analysis of Gene Set Enrichment , 2005, BMC Bioinform..

[41]  W F Bodmer,et al.  Target genes of beta-catenin-T cell-factor/lymphoid-enhancer-factor signaling in human colorectal carcinomas. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[42]  Peter J. Park,et al.  A multivariate approach for integrating genome-wide expression data and biological knowledge , 2006, Bioinform..

[43]  A. Bauer-Mehren,et al.  Pathway databases and tools for their exploitation: benefits, current limitations and challenges , 2009, Molecular systems biology.

[44]  Jean-Daniel Zucker,et al.  Clustering Biological Annotations and Gene Expression Data to Identify Putatively Co-regulated Biological Processes , 2006, J. Bioinform. Comput. Biol..

[45]  Purvesh Khatri,et al.  Babel's tower revisited: a universal resource for cross-referencing across annotation databases , 2006, Bioinform..

[46]  Bing Zhang,et al.  WebGestalt: an integrated system for exploring gene sets in various biological contexts , 2005, Nucleic Acids Res..

[47]  Eric T. Wang,et al.  Alternative Isoform Regulation in Human Tissue Transcriptomes , 2008, Nature.

[48]  Qi Liu,et al.  Improving gene set analysis of microarray data by SAM-GS , 2007, BMC Bioinformatics.

[49]  Zhen Jiang,et al.  Bioconductor Project Bioconductor Project Working Papers Year Paper Extensions to Gene Set Enrichment , 2013 .

[50]  T. Golub,et al.  mTOR inhibition reverses Akt-dependent prostate intraepithelial neoplasia through regulation of apoptotic and HIF-1-dependent pathways , 2004, Nature Medicine.

[51]  P. Khatri,et al.  Profiling gene expression using onto-express. , 2002, Genomics.

[52]  P. Park,et al.  Discovering statistically significant pathways in expression profiling studies. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[53]  David Martin,et al.  GOToolBox: functional analysis of gene datasets based on Gene Ontology , 2004, Genome Biology.

[54]  Purvesh Khatri,et al.  Ontological analysis of gene expression data: current tools, limitations, and open problems , 2005, Bioinform..

[55]  J. Bonfield,et al.  Finishing the euchromatic sequence of the human genome , 2004, Nature.

[56]  C. Ji,et al.  A novel splice variant of human XRN2 gene is mainly expressed in blood leukocyte† , 2005, DNA sequence : the journal of DNA sequencing and mapping.

[57]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[58]  Peng Xiao,et al.  Hotelling's T2 multivariate profiling for detecting differential expression in microarrays , 2005, Bioinform..

[59]  David Botstein,et al.  GO: : TermFinder--open source software for accessing Gene Ontology information and finding significantly enriched Gene Ontology terms associated with a list of genes , 2004, Bioinform..

[60]  K. Buetow,et al.  Pathways of Distinction Analysis: A New Technique for Multi–SNP Analysis of GWAS Data , 2010, PLoS genetics.

[61]  V. Arango,et al.  Using the Gene Ontology for Microarray Data Mining: A Comparison of Methods and Application to Age Effects in Human Prefrontal Cortex , 2004, Neurochemical Research.

[62]  Joaquín Dopazo,et al.  Discovering molecular functions significantly related to phenotypes by combining gene expression data and biological information , 2005, Bioinform..

[63]  Christina Backes,et al.  GeneTrail—advanced gene set enrichment analysis , 2007, Nucleic Acids Res..

[64]  Andrew B. Nobel,et al.  Significance analysis of functional categories in gene expression studies: a structured permutation approach , 2005, Bioinform..

[65]  Sang-Bae Kim,et al.  GAzer: gene set analyzer , 2007, Bioinform..

[66]  K. Bretonnel Cohen,et al.  Manual curation is not sufficient for annotation of genomic databases , 2007, ISMB/ECCB.

[67]  T. Nikolcheva,et al.  Deconvoluting Post-Transplant Immunity: Cell Subset-Specific Mapping Reveals Pathways for Activation and Expansion of Memory T, Monocytes and B Cells , 2010, PloS one.

[68]  Peter Bühlmann,et al.  Analyzing gene expression data in terms of gene sets: methodological issues , 2007, Bioinform..

[69]  Brad T. Sherman,et al.  Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists , 2008, Nucleic acids research.

[70]  Ulrich Mansmann,et al.  GlobalANCOVA: exploration and assessment of gene group effects , 2008, Bioinform..

[71]  T. Speed,et al.  GOstat: find statistically overrepresented Gene Ontologies within a group of genes. , 2004, Bioinformatics.

[72]  Thomas Lengauer,et al.  Statistical Applications in Genetics and Molecular Biology Calculating the Statistical Significance of Changes in Pathway Activity From Gene Expression Data , 2011 .

[73]  Joaquín Dopazo,et al.  FatiGO: a web tool for finding significant associations of Gene Ontology terms with groups of genes , 2004, Bioinform..

[74]  R. Moon,et al.  A beta-catenin/XTcf-3 complex binds to the siamois promoter to regulate dorsal axis specification in Xenopus. , 1997, Genes & development.

[75]  P. Khatri,et al.  Global functional profiling of gene expression. , 2003, Genomics.

[76]  R. Gentleman,et al.  Gene expression profile of adult T-cell acute lymphocytic leukemia identifies distinct subsets of patients with different response to therapy and survival. , 2004, Blood.

[77]  Patrik Edén,et al.  Comparing Functional Annotation Analyses with Catmap Comparing Functional Annotation Analyses with Catmap , 2004 .

[78]  Mark D. Robinson,et al.  FunSpec: a web-based cluster interpreter for yeast , 2002, BMC Bioinformatics.

[79]  K. Dolinski,et al.  Use and misuse of the gene ontology annotations , 2008, Nature Reviews Genetics.

[80]  Keli Xu,et al.  Calcium oscillations increase the efficiency and specificity of gene expression , 1998, Nature.

[81]  Zhou Du,et al.  agriGO: a GO analysis toolkit for the agricultural community , 2010, Nucleic Acids Res..

[82]  中尾 光輝,et al.  KEGG(Kyoto Encyclopedia of Genes and Genomes)〔和文〕 (特集 ゲノム医学の現在と未来--基礎と臨床) -- (データベース) , 2000 .

[83]  M. Campbell,et al.  PANTHER: a library of protein families and subfamilies indexed by function. , 2003, Genome research.

[84]  Miguel A. Andrade-Navarro,et al.  Inconsistencies over time in 5% of NetAffx probe-to-gene annotations , 2005, BMC Bioinformatics.

[85]  Ali Shojaie,et al.  Analysis of Gene Sets Based on the Underlying Regulatory Network , 2009, J. Comput. Biol..

[86]  International Human Genome Sequencing Consortium Finishing the euchromatic sequence of the human genome , 2004 .

[87]  Frank Emmert-Streib,et al.  Unite and conquer: univariate and multivariate approaches for finding differentially expressed gene sets , 2009, Bioinform..

[88]  Jelle J. Goeman,et al.  A global test for groups of genes: testing association with a clinical outcome , 2004, Bioinform..

[89]  R. Tibshirani,et al.  On testing the significance of sets of genes , 2006, math/0610667.

[90]  P. Khatri,et al.  A systems biology approach for pathway level analysis. , 2007, Genome research.

[91]  Temple F. Smith,et al.  Overview of the Alliance for Cellular Signaling , 2002, Nature.

引用
Signaling pathway impact analysis by incorporating the importance and specificity of genes (SPIA-IS)
Comput. Biol. Chem.
2017
Protein-protein docking for interactomic studies and its aplication to personalized medicine
2017
Biostatistics, Data Mining and Computational Modeling
2016
Differential expression and feature selection in the analysis of multiple omics studies
2018
Integrated Omics: Tools, Advances, and Future Approaches.
Journal of molecular endocrinology
2019
Kernel-Based Pathway Approaches for Testing and Selection
2017
Bladder cancer specific pathway interaction networks
ECAL
2013
A flexible ontology for inference of emergent whole cell function from relationships between subcellular processes
bioRxiv
2017
A flexible ontology for inference of emergent whole cell function from relationships between subcellular processes
Scientific Reports
2017
Data Analysis for Antibody Arrays.
Methods in molecular biology
2021
Molecular network analysis enhances understanding of the biology of mental disorders
BioEssays : news and reviews in molecular, cellular and developmental biology
2014
Bioinformatics Meets Biomedicine: OncoFinder, a Quantitative Approach for Interrogating Molecular Pathways Using Gene Expression Data.
Methods in molecular biology
2017
Identification of gene pathways implicated in Alzheimer's disease using longitudinal imaging phenotypes with sparse regression☆
NeuroImage
2012
Gene annotation bias impedes biomedical research
Scientific Reports
2017
A multi-functional analyzer uses parameter constraints to improve the efficiency of model-based gene-set analysis
1310.6322
2013
A mixture model to detect edges in sparse co-expression graphs with an application for comparing breast cancer subtypes.
PloS one
2018
Developing Network-Based Systems Toxicology by Combining Transcriptomics Data with Literature Mining and Multiscale Quantitative Modeling
Bioinformatics in the Era of Post Genomics and Big Data
2018
Differential pathway network analysis used to identify key pathways associated with pediatric pneumonia.
Microbial pathogenesis
2016
Integration Of Mutation And Gene Expression Data To Identify Disease Subtypes
2017
Prioritizing network communities
Nature Communications
2018