The latent geometry of the human protein interaction network

Abstract Motivation A series of recently introduced algorithms and models advocates for the existence of a hyperbolic geometry underlying the network representation of complex systems. Since the human protein interaction network (hPIN) has a complex architecture, we hypothesized that uncovering its latent geometry could ease challenging problems in systems biology, translating them into measuring distances between proteins. Results We embedded the hPIN to hyperbolic space and found that the inferred coordinates of nodes capture biologically relevant features, like protein age, function and cellular localization. This means that the representation of the hPIN in the two-dimensional hyperbolic plane offers a novel and informative way to visualize proteins and their interactions. We then used these coordinates to compute hyperbolic distances between proteins, which served as likelihood scores for the prediction of plausible protein interactions. Finally, we observed that proteins can efficiently communicate with each other via a greedy routing process, guided by the latent geometry of the hPIN. We show that these efficient communication channels can be used to determine the core members of signal transduction pathways and to study how system perturbations impact their efficiency. Availability and implementation An R implementation of our network embedder is available at https://github.com/galanisl/NetHypGeom. Also, a web tool for the geometric analysis of the hPIN accompanies this text at http://cbdm-01.zdv.uni-mainz.de/~galanisl/gapi. Supplementary information Supplementary data are available at Bioinformatics online.

[1]  Anupam Gupta,et al.  Discovering pathways by orienting edges in protein interaction networks , 2010, Nucleic acids research.

[2]  P. Mehlen,et al.  Patched-1 Proapoptotic Activity Is Downregulated by Modification of K1413 by the E3 Ubiquitin-Protein Ligase Itchy Homolog , 2014, Molecular and Cellular Biology.

[3]  S. Oliver Proteomics: Guilt-by-association goes global , 2000, Nature.

[4]  Juan M. Vaquerizas,et al.  A census of human transcription factors: function, expression and evolution , 2009, Nature Reviews Genetics.

[5]  Marián Boguñá,et al.  Popularity versus similarity in growing networks , 2011, Nature.

[6]  Henning Hermjakob,et al.  The Reactome pathway knowledgebase , 2013, Nucleic Acids Res..

[7]  Carlo Vittorio Cannistraci,et al.  Minimum curvilinearity to enhance topological prediction of protein interactions by network embedding , 2013, Bioinform..

[8]  Lubert Stryer,et al.  Signal-Transduction Pathways: An Introduction to Information Metabolism , 2002 .

[9]  S. Gerstberger,et al.  A census of human RNA-binding proteins , 2014, Nature Reviews Genetics.

[10]  Hiroyuki Ogata,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 1999, Nucleic Acids Res..

[11]  Timothy Ravasi,et al.  From link-prediction in brain connectomes and protein interactomes to the local-community-paradigm in complex networks , 2013, Scientific Reports.

[12]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[13]  Miguel A. Andrade-Navarro,et al.  Manifold learning and maximum likelihood estimation for hyperbolic network embedding , 2016, Applied Network Science.

[14]  Martin H. Schaefer,et al.  HIPPIE: Integrating Protein Interaction Networks with Experiment Based Quality Scores , 2012, PloS one.

[15]  Andreas Zell,et al.  BowTieBuilder: modeling signal transduction pathways , 2009, BMC Systems Biology.

[16]  A. Barabasi,et al.  Network medicine : a network-based approach to human disease , 2010 .

[17]  Amin Vahdat,et al.  Hyperbolic Geometry of Complex Networks , 2010, Physical review. E, Statistical, nonlinear, and soft matter physics.

[18]  Marián Boguñá,et al.  Network Cosmology , 2012, Scientific Reports.

[19]  Yoshihide Hayashizaki,et al.  Construction of reliable protein-protein interaction networks with a new interaction generality measure , 2003, Bioinform..

[20]  Martin H. Schaefer,et al.  HIPPIE v2.0: enhancing meaningfulness and reliability of protein–protein interaction networks , 2016, Nucleic Acids Res..

[21]  J. Tenenbaum,et al.  A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[22]  Desmond J. Higham,et al.  Geometric De-noising of Protein-Protein Interaction Networks , 2009, PLoS Comput. Biol..

[23]  Yusuke Nakamura,et al.  DKK1, a negative regulator of Wnt signaling, is a target of the β-catenin/TCF pathway , 2004, Oncogene.

[24]  Yujun Shen,et al.  Small ubiquitin-related modifier 2/3 interacts with p65 and stabilizes it in the cytoplasm in HBV-associated hepatocellular carcinoma , 2015, BMC Cancer.

[25]  Linyuan Lü,et al.  Toward link predictability of complex networks , 2015, Proceedings of the National Academy of Sciences.

[26]  R. Sharan,et al.  Toward accurate reconstruction of functional protein networks , 2009, Molecular systems biology.

[27]  Dmitri V. Krioukov,et al.  Network Geometry Inference using Common Neighbors , 2015, Physical review. E, Statistical, nonlinear, and soft matter physics.

[28]  Miguel A. Andrade-Navarro,et al.  Distance Distribution between Complex Network Nodes in Hyperbolic Space , 2016, Complex Syst..

[29]  Zhu-Hong You,et al.  Using manifold embedding for assessing and predicting protein interactions from high-throughput experimental data , 2010, Bioinform..

[30]  Devin K. Schweppe,et al.  Architecture of the human interactome defines protein communities and disease networks , 2017, Nature.

[31]  Ying Zhang,et al.  Requirement of Smurf-mediated endocytosis of Patched1 in sonic hedgehog signal reception , 2014, eLife.

[32]  Marc Barthelemy,et al.  Spatial Networks , 2010, Encyclopedia of Social Network Analysis and Mining.

[33]  Ryan Miller,et al.  WikiPathways: capturing the full diversity of pathway knowledge , 2015, Nucleic Acids Res..

[34]  Miguel A. Andrade-Navarro,et al.  Efficient embedding of complex networks to hyperbolic space via their Laplacian , 2016, Scientific Reports.

[35]  Mikhail Belkin,et al.  Laplacian Eigenmaps and Spectral Techniques for Embedding and Clustering , 2001, NIPS.

[36]  C. Niehrs,et al.  Function and biological roles of the Dickkopf family of Wnt modulators , 2006, Oncogene.

[37]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[38]  Timothy Ravasi,et al.  Exploitation of genetic interaction network topology for the prediction of epistatic behavior. , 2013, Genomics.

[39]  Marián Boguñá,et al.  Navigability of Complex Networks , 2007, ArXiv.

[40]  M. Newman Clustering and preferential attachment in growing networks. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[41]  Joanna L. Sharman,et al.  The IUPHAR/BPS Guide to PHARMACOLOGY in 2016: towards curated quantitative interactions between 1300 protein targets and 6000 ligands , 2015, Nucleic Acids Res..

[42]  G. von Heijne,et al.  Tissue-based map of the human proteome , 2015, Science.

[43]  Bairong Shen,et al.  New genes drive the evolution of gene interaction networks in the human and mouse genomes , 2015, Genome Biology.

[44]  Miguel A. Andrade-Navarro,et al.  FastaHerder2: Four Ways to Research Protein Function and Evolution with Clustering and Clustered Databases , 2016, J. Comput. Biol..

[45]  Eun Mi Kim,et al.  The mouse small ubiquitin-like modifier-2 (SUMO-2) inhibits interleukin-12 (IL-12) production in mature dendritic cells by blocking the translocation of the p65 subunit of NFκB into the nucleus. , 2011, Molecular immunology.

[46]  Albert Y. Zomaya,et al.  A Survey of Mobile Device Virtualization , 2016, ACM Comput. Surv..

[47]  Anna Ritz,et al.  Pathways on demand: automated reconstruction of human signaling networks , 2016, npj Systems Biology and Applications.

[48]  Yoshihide Hayashizaki,et al.  Interaction generality, a measurement to assess the reliability of a protein-protein interaction. , 2002, Nucleic acids research.

[49]  Allen Li,et al.  Induction of sonic hedgehog mediators by transforming growth factor-beta: Smad3-dependent activation of Gli2 and Gli1 expression in vitro and in vivo. , 2007, Cancer research.

[50]  Marián Boguñá,et al.  Sustaining the Internet with Hyperbolic Mapping , 2010, Nature communications.

[51]  Yu Xue,et al.  AnimalTFDB 2.0: a resource for expression, prediction and functional study of animal transcription factors , 2014, Nucleic Acids Res..

[52]  Gloria M. Sheynkman,et al.  Proteome-Scale Human Interactomics. , 2017, Trends in biochemical sciences.

[53]  Antoine Allard,et al.  The hidden hyperbolic geometry of international trade: World Trade Atlas 1870–2013 , 2015, Scientific Reports.

[54]  Mong-Li Lee,et al.  Increasing confidence of protein interactomes using network topological metrics , 2006, Bioinform..

[55]  I. Screpanti,et al.  Numb activates the E3 ligase Itch to control Gli1 function through a novel degradation signal , 2011, Oncogene.

[56]  I. Taylor,et al.  Protein interaction networks in medicine and disease , 2012, Proteomics.

[57]  Fernando Berzal Galiano,et al.  A Survey of Link Prediction in Complex Networks , 2016, ACM Comput. Surv..

[58]  Brad T. Sherman,et al.  Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists , 2008, Nucleic acids research.

[59]  Lada A. Adamic,et al.  Friends and neighbors on the Web , 2003, Soc. Networks.

[60]  T. Ideker,et al.  A decade of systems biology. , 2010, Annual review of cell and developmental biology.

[61]  Marián Boguñá,et al.  Uncovering the hidden geometry behind metabolic networks. , 2011, Molecular bioSystems.

[62]  Sreeurpa Ray,et al.  The Cell: A Molecular Approach , 1996 .

[63]  Gregorio Alanis-Lobato,et al.  Mining protein interactomes to improve their reliability and support the advancement of network medicine , 2015, Front. Genet..

[64]  Pedro Beltrão,et al.  Specificity and Evolvability in Eukaryotic Protein Interaction Networks , 2007, PLoS Comput. Biol..

[65]  A. Barabasi,et al.  Interactome Networks and Human Disease , 2011, Cell.

[66]  Igor Jurisica,et al.  Modeling interactome: scale-free or geometric? , 2004, Bioinform..