ComPath: An ecosystem for exploring, analyzing, and curating mappings across pathway databases

Although pathways are widely used for the analysis and representation of biological systems, their lack of clear boundaries, their dispersion across numerous databases, and the lack of interoperability impedes the evaluation of the coverage, agreements, and discrepancies between them. Here, we present ComPath, an ecosystem that supports curation of pathway mappings between databases and fosters the exploration of pathway knowledge through several novel visualizations. We have curated mappings between three of the major pathway databases and present a case study focusing on Parkinson’s disease that illustrates how ComPath can generate new biological insights by identifying pathway modules, clusters, and cross-talks with these mappings. The ComPath source code and resources are available at https://github.com/ComPath and the web application can be accessed at http://compath.scai.fraunhofer.de/.

[1]  Chris T. A. Evelo,et al.  Reactome from a WikiPathways Perspective , 2016, PLoS Comput. Biol..

[2]  Ram Rup Sarkar,et al.  Comparison of human cell signaling pathway databases—evolution, drawbacks and challenges , 2015, Database J. Biol. Databases Curation.

[3]  Subramanian Rajagopalan,et al.  Regulation of ATP13A2 via PHD2-HIF1α Signaling Is Critical for Cellular Iron Homeostasis: Implications for Parkinson's Disease , 2016, The Journal of Neuroscience.

[4]  Tim Beißbarth,et al.  Comparative study on gene set and pathway topology-based enrichment methods , 2015, BMC Bioinformatics.

[5]  Gary D. Bader,et al.  Pathway Commons, a web resource for biological pathway data , 2010, Nucleic Acids Res..

[6]  Ryan Miller,et al.  WikiPathways: a multifaceted pathway database bridging metabolomics to other omics research , 2017, Nucleic Acids Res..

[7]  Lincoln D. Stein,et al.  Impact of outdated gene annotations on pathway enrichment analysis , 2016, Nature Methods.

[8]  Gerbert A. Jansen,et al.  Critical assessment of human metabolic pathway databases: a stepping stone for future integration , 2011, BMC Systems Biology.

[9]  Gary D Bader,et al.  BioPAX – A community standard for pathway data sharing , 2010, Nature Biotechnology.

[10]  Adeeb Rahman,et al.  Clustergrammer, a web-based heatmap visualization and analysis tool for high-dimensional biological data , 2017, Scientific Data.

[11]  Hiroaki Kitano,et al.  The systems biology markup language (SBML): a medium for representation and exchange of biochemical network models , 2003, Bioinform..

[12]  Ryan Miller,et al.  WikiPathways: capturing the full diversity of pathway knowledge , 2015, Nucleic Acids Res..

[13]  David S. Wishart,et al.  HMDB 4.0: the human metabolome database for 2018 , 2017, Nucleic Acids Res..

[14]  A Hofman,et al.  Dietary folate, vitamin B12, and vitamin B6 and the risk of Parkinson disease , 2006, Neurology.

[15]  Vladimir I. Levenshtein,et al.  Binary codes capable of correcting deletions, insertions, and reversals , 1965 .

[16]  Atul J. Butte,et al.  Ten Years of Pathway Analysis: Current Approaches and Outstanding Challenges , 2012, PLoS Comput. Biol..

[17]  Minoru Kanehisa,et al.  KEGG: new perspectives on genomes, pathways, diseases and drugs , 2016, Nucleic Acids Res..

[18]  Alexander JR Bishop,et al.  Pathway Distiller - multisource biological pathway consolidation , 2012, BMC Genomics.

[19]  Chris Sander,et al.  Pathway information for systems biology , 2005, FEBS letters.

[20]  C. Chu,et al.  ATP13A2 regulates mitochondrial bioenergetics through macroautophagy , 2012, Neurobiology of Disease.

[21]  Gary D. Bader,et al.  Cytoscape.js: a graph theory library for visualisation and analysis , 2015, Bioinform..

[22]  Y. Benjamini,et al.  THE CONTROL OF THE FALSE DISCOVERY RATE IN MULTIPLE TESTING UNDER DEPENDENCY , 2001 .

[23]  Andreas Krämer,et al.  Causal analysis approaches in Ingenuity Pathway Analysis , 2013, Bioinform..

[24]  Doron Lancet,et al.  PathCards: multi-source consolidation of human biological pathways , 2015, Database J. Biol. Databases Curation.

[25]  Peer Bork,et al.  iPath2.0: interactive pathway explorer , 2011, Nucleic Acids Res..

[26]  Martin Hofmann-Apitius,et al.  Towards a Pathway Inventory of the Human Brain for Modeling Disease Mechanisms Underlying Neurodegeneration. , 2016, Journal of Alzheimer's disease : JAD.

[27]  Kenji Mizuguchi,et al.  Integrated Pathway Clusters with Coherent Biological Themes for Target Prioritisation , 2014, PloS one.

[28]  Patrizia Agostinis,et al.  A lipid switch unlocks Parkinson’s disease-associated ATP13A2 , 2015, Proceedings of the National Academy of Sciences.

[29]  W. Tatton,et al.  Apoptosis in Parkinson's disease: Signals for neuronal degradation , 2003, Annals of neurology.

[30]  A. Goldenberg,et al.  Intertumoral Heterogeneity within Medulloblastoma Subgroups. , 2017, Cancer cell.

[31]  George M. Spyrou,et al.  PathwayConnector: finding complementary pathways to enhance functional analysis , 2018, Bioinform..

[32]  Punit Kaur,et al.  Identification of Shared Molecular Signatures Indicate the Susceptibility of Endometriosis to Multiple Sclerosis , 2018, Front. Genet..

[33]  Sara Ciucci,et al.  LIPEA: Lipid Pathway Enrichment Analysis , 2018, bioRxiv.

[34]  Wenbin Wei,et al.  The Pathway Coexpression Network: Revealing pathway relationships , 2018, PLoS Comput. Biol..

[35]  Hedi Peterson,et al.  g:Profiler—a web server for functional interpretation of gene lists (2016 update) , 2016, Nucleic Acids Res..

[36]  M. Obulesu,et al.  Apoptosis in Alzheimer’s Disease: An Understanding of the Physiology, Pathology and Therapeutic Avenues , 2014, Neurochemical Research.

[37]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[38]  Eva Budinska,et al.  A critical comparison of topology-based pathway analysis methods , 2018, PloS one.

[39]  Melinda R. Dwinell,et al.  The pathway ontology – updates and applications , 2014, Journal of Biomedical Semantics.

[40]  Helga Thorvaldsdóttir,et al.  Molecular signatures database (MSigDB) 3.0 , 2011, Bioinform..

[41]  Christophe G. Giraud-Carrier,et al.  Learning the Threshold in Hierarchical Agglomerative Clustering , 2006, 2006 5th International Conference on Machine Learning and Applications (ICMLA'06).

[42]  John Hardy,et al.  SnapShot: Genetics of Parkinson’s Disease , 2015, Cell.

[43]  Rebecca M. Perrett,et al.  The endosomal pathway in Parkinson's disease , 2015, Molecular and Cellular Neuroscience.

[44]  Brad T. Sherman,et al.  The DAVID Gene Functional Classification Tool: a novel biological module-centric algorithm to functionally analyze large gene lists , 2007, Genome Biology.

[45]  A. Bauer-Mehren,et al.  Pathway databases and tools for their exploitation: benefits, current limitations and challenges , 2009, Molecular systems biology.

[46]  Mohammad Asif Emon,et al.  Multimodal mechanistic signatures for neurodegenerative diseases (NeuroMMSig): a web server for mechanism enrichment , 2017, Bioinform..

[47]  Livia Perfetto,et al.  SIGNOR: a database of causal relationships between biological entities , 2015, Nucleic Acids Res..

[48]  Henning Hermjakob,et al.  The Reactome pathway knowledgebase , 2013, Nucleic Acids Res..

[49]  Ching-Seng Ang,et al.  FunRich: An open access standalone functional enrichment and interaction network analysis tool , 2015, Proteomics.

[50]  Andrew D. Rouillard,et al.  Enrichr: a comprehensive gene set enrichment analysis web server 2016 update , 2016, Nucleic Acids Res..

[51]  Marc W Fariss,et al.  Vitamin E therapy in Parkinson's disease. , 2003, Toxicology.