Fast Approximate Shortest Hyperpaths for Inferring Pathways in Cell Signaling Hypergraphs

Cell signaling pathways, which are a series of reactions that start at receptors and end at transcription factors, are basic to systems biology. Properly modeling the reactions in such pathways requires directed hypergraphs, where an edge is now directed between two sets of vertices. Inferring a pathway by the most parsimonious series of reactions then corresponds to finding a shortest hyperpath in a directed hypergraph, which is NP-complete. The state of the art for shortest hyperpaths in cell-signaling hypergraphs solves a mixed-integer linear program to find an optimal hyperpath that is restricted to be acyclic, and offers no efficiency guarantees. We present for the first time a heuristic for general shortest hyperpaths that properly handles cycles, and is guaranteed to be efficient. Its accuracy is demonstrated through exhaustive experiments on all instances from the standard NCI-PID and Reactome pathway databases, which show the heuristic finds a hyperpath that matches the state-of-the-art mixed-integer linear program on over 99% of all instances that are acyclic. On instances where only cyclic hyperpaths exist, the heuristic surpasses the state-of-the-art, which finds no solution; on every such cyclic instance, enumerating all possible hyperpaths shows that the solution found by the heuristic is in fact optimal. 2012 ACM Subject Classification Applied computing → Bioinformatics; Applied computing → Systems biology; Theory of computation → Shortest paths; Mathematics of computing → Hypergraphs

[1]  M. Queyranne,et al.  K best solutions to combinatorial optimization problems , 1985 .

[2]  Steffen Klamt,et al.  Hypergraphs and Cellular Networks , 2009, PLoS Comput. Biol..

[3]  Daniele Pretolani,et al.  A remark on the definition of B-hyperpath , 2001 .

[4]  M. Duñach,et al.  Tyrosine Phosphorylation of Plakoglobin Causes Contrary Effects on Its Association with Desmosomes and Adherens Junction Components and Modulates β-Catenin-Mediated Transcription , 2003, Molecular and Cellular Biology.

[5]  Jens Nielsen,et al.  Reconstruction and logical modeling of glucose repression signaling pathways in Saccharomyces cerevisiae , 2009, BMC Systems Biology.

[6]  X. Hou,et al.  A new role of NUAK1: directly phosphorylating p53 and regulating cell proliferation , 2011, Oncogene.

[7]  Giuseppe F. Italiano,et al.  Online Maintenance of Minimal Directed Hypergraphs , 1989 .

[8]  B. Oneda,et al.  The Metalloprotease Meprinβ Processes E-Cadherin and Weakens Intercellular Adhesion , 2008, PloS one.

[9]  Lincoln Stein,et al.  Reactome: a knowledgebase of biological pathways , 2004, Nucleic Acids Res..

[10]  David Tuck,et al.  A hyper-graph approach for analyzing transcriptional networks in breast cancer , 2010, BCB '10.

[11]  T M Murali,et al.  Hypergraph-based connectivity measures for signaling pathway topologies , 2019, PLoS Comput. Biol..

[12]  Gary D. Bader,et al.  Using Biological Pathway Data with Paxtools , 2013, PLoS Comput. Biol..

[13]  Luay Nakhleh,et al.  Properties of metabolic graphs: biological organization or representation artifacts? , 2011, BMC Bioinformatics.

[14]  Xin-She Yang,et al.  Introduction to Algorithms , 2021, Nature-Inspired Optimization Algorithms.

[15]  A. Barabasi,et al.  Interactome Networks and Human Disease , 2011, Cell.

[16]  Giorgio Ausiello,et al.  Directed hypergraphs: Introduction and fundamental algorithms - A survey , 2017, Theor. Comput. Sci..

[17]  Giorgio Gallo,et al.  Directed Hypergraphs and Applications , 1993, Discret. Appl. Math..

[18]  Pablo Carbonell,et al.  Enumerating metabolic pathways for the production of heterologous target chemicals in chassis organisms , 2012, BMC Systems Biology.

[19]  T. Ideker,et al.  Modeling cellular machinery through biological network comparison , 2006, Nature Biotechnology.

[20]  Emad Ramadan,et al.  A hypergraph model for the yeast protein complex network , 2004, 18th International Parallel and Distributed Processing Symposium, 2004. Proceedings..

[21]  Lenwood S. Heath,et al.  Semantics of Multimodal Network Models , 2009, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[22]  Leen Stougie,et al.  Enumerating Precursor Sets of Target Metabolites in a Metabolic Network , 2008, WABI.

[23]  Justin Zhan,et al.  Modeling Cell Communication with Time-Dependent Signaling Hypergraphs , 2019, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[24]  Leen Stougie,et al.  Enumeration of minimal stoichiometric precursor sets in metabolic networks , 2016, Algorithms for Molecular Biology.

[25]  C. D’Souza-Schorey,et al.  Lysosomal Targeting of E-Cadherin: a Unique Mechanism for the Down-Regulation of Cell-Cell Adhesion during Epithelial to Mesenchymal Transitions , 2005, Molecular and Cellular Biology.

[26]  Zhenjun Hu,et al.  Towards zoomable multidimensional maps of the cell , 2007, Nature Biotechnology.

[27]  Kenneth H. Buetow,et al.  PID: the Pathway Interaction Database , 2008, Nucleic Acids Res..

[28]  T. M. Murali,et al.  Pathway Analysis with Signaling Hypergraphs , 2017, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[29]  Kei-Hoi Cheung,et al.  BioPAX – A community standard for pathway data sharing , 2010, Nature Biotechnology.

[30]  T M Murali,et al.  Signaling hypergraphs. , 2014, Trends in biotechnology.

[31]  Leen Stougie,et al.  Algorithms and complexity of enumerating minimal precursor sets in genome-wide metabolic networks , 2012, Bioinform..