Scale-space measures for graph topology link protein network architecture to function

Motivation: The network architecture of physical protein interactions is an important determinant for the molecular functions that are carried out within each cell. To study this relation, the network architecture can be characterized by graph topological characteristics such as shortest paths and network hubs. These characteristics have an important shortcoming: they do not take into account that interactions occur across different scales. This is important because some cellular functions may involve a single direct protein interaction (small scale), whereas others require more and/or indirect interactions, such as protein complexes (medium scale) and interactions between large modules of proteins (large scale). Results: In this work, we derive generalized scale-aware versions of known graph topological measures based on diffusion kernels. We apply these to characterize the topology of networks across all scales simultaneously, generating a so-called graph topological scale-space. The comprehensive physical interaction network in yeast is used to show that scale-space based measures consistently give superior performance when distinguishing protein functional categories and three major types of functional interactions—genetic interaction, co-expression and perturbation interactions. Moreover, we demonstrate that graph topological scale spaces capture biologically meaningful features that provide new insights into the link between function and protein network architecture. Availability and implementation: MatlabTM code to calculate the scale-aware topological measures (STMs) is available at http://bioinformatics.tudelft.nl/TSSA Contact: j.deridder@tudelft.nl Supplementary information: Supplementary data are available at Bioinformatics online.

[1]  P. Robinson,et al.  Walking the interactome for prioritization of candidate disease genes. , 2008, American journal of human genetics.

[2]  M. Gerstein,et al.  Relating whole-genome expression data with protein-protein interactions. , 2002, Genome research.

[3]  Leo Katz,et al.  A new status index derived from sociometric analysis , 1953 .

[4]  Robert Patro,et al.  Global network alignment using multiscale spectral signatures , 2012, Bioinform..

[5]  Pooja Jain,et al.  The YEASTRACT database: a tool for the analysis of transcription regulatory associations in Saccharomyces cerevisiae , 2005, Nucleic Acids Res..

[6]  Ernesto Estrada Generalized walks-based centrality measures for complex biological networks. , 2010, Journal of theoretical biology.

[7]  A. Barabasi,et al.  Network biology: understanding the cell's functional organization , 2004, Nature Reviews Genetics.

[8]  Mike Tyers,et al.  BioGRID: a general repository for interaction datasets , 2005, Nucleic Acids Res..

[9]  Patrick J. Killion,et al.  Genetic reconstruction of a functional transcriptional regulatory network , 2007, Nature Genetics.

[10]  C. Dangalchev Residual closeness in networks , 2006 .

[11]  Dmitrij Frishman,et al.  MIPS: analysis and annotation of proteins from whole genomes in 2005 , 2005, Nucleic Acids Res..

[12]  Sean R. Collins,et al.  Global landscape of protein complexes in the yeast Saccharomyces cerevisiae , 2006, Nature.

[13]  Andrew P. Witkin,et al.  Scale-space filtering: A new approach to multi-scale description , 1984, ICASSP.

[14]  Daniel E. Newburger,et al.  High-resolution DNA-binding specificity analysis of yeast transcription factors. , 2009, Genome research.

[15]  Jianzhi Zhang,et al.  Why Do Hubs Tend to Be Essential in Protein Networks? , 2006, PLoS genetics.

[16]  Mike Tyers,et al.  PhosphoGRID: a database of experimentally verified in vivo protein phosphorylation sites from the budding yeast Saccharomyces cerevisiae , 2010, Database J. Biol. Databases Curation.

[17]  Gary D Bader,et al.  The Genetic Landscape of a Cell , 2010, Science.

[18]  B. Palsson,et al.  The evolution of molecular biology into systems biology , 2004, Nature Biotechnology.

[19]  P. Bonacich Factoring and weighting approaches to status scores and clique identification , 1972 .

[20]  P. Phillips Epistasis — the essential role of gene interactions in the structure and evolution of genetic systems , 2008, Nature Reviews Genetics.

[21]  Leonard M. Freeman,et al.  A set of measures of centrality based upon betweenness , 1977 .

[22]  Karl J. Friston,et al.  Structural and Functional Brain Networks: From Connections to Cognition , 2013, Science.

[23]  Charles Boone,et al.  Identifying transcription factor functions and targets by phenotypic activation , 2006, Proceedings of the National Academy of Sciences.

[24]  Dianne P. O'Leary,et al.  Why Do Hubs in the Yeast Protein Interaction Network Tend To Be Essential: Reexamining the Connection between the Network Topology and Essentiality , 2008, PLoS Comput. Biol..

[25]  J. A. Rodríguez-Velázquez,et al.  Subgraph centrality in complex networks. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[26]  Li Sheng,et al.  On an ensemble algorithm for clustering cancer patient data , 2013, BMC Systems Biology.

[27]  D. Botstein,et al.  Genomic expression programs in the response of yeast cells to environmental changes. , 2000, Molecular biology of the cell.

[28]  D. Ingber,et al.  High-Betweenness Proteins in the Yeast Protein Interaction Network , 2005, Journal of biomedicine & biotechnology.

[29]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[30]  John D. Lafferty,et al.  Diffusion Kernels on Graphs and Other Discrete Input Spaces , 2002, ICML.

[31]  KingsfordCarl,et al.  Global network alignment using multiscale spectral signatures , 2012 .

[32]  P. Kemmeren,et al.  Functional Overlap and Regulatory Links Shape Genetic Interactions between Signaling Pathways , 2010, Cell.

[33]  Mona Singh,et al.  Disentangling function from topology to infer the network properties of disease genes , 2013, BMC Systems Biology.

[34]  H. Kitano Systems Biology: A Brief Overview , 2002, Science.

[35]  M. Gerstein,et al.  Global analysis of protein phosphorylation in yeast , 2005, Nature.

[36]  Roded Sharan,et al.  SPINE: a framework for signaling-regulatory pathway inference from cause-effect experiments , 2007, ISMB/ECCB.

[37]  T. Ideker,et al.  Systematic interpretation of genetic interactions using protein networks , 2005, Nature Biotechnology.

[38]  Tijana Milenkoviæ,et al.  Uncovering Biological Network Function via Graphlet Degree Signatures , 2008, Cancer informatics.

[39]  Mark Gerstein,et al.  The Importance of Bottlenecks in Protein Networks: Correlation with Gene Essentiality and Expression Dynamics , 2007, PLoS Comput. Biol..

[40]  Ferenc Jordán,et al.  Searching for keystones in ecological networks , 2002 .

[41]  Yudong D. He,et al.  Functional Discovery via a Compendium of Expression Profiles , 2000, Cell.

[42]  Piet Van Mieghem,et al.  Local Topological Signatures for Network-Based Prediction of Biological Function , 2013, PRIB.

[43]  Piet Van Mieghem,et al.  Topology of molecular interaction networks , 2013, BMC Systems Biology.

[44]  C. Myers,et al.  Using networks to measure similarity between genes: association index selection , 2013, Nature Methods.

[45]  Kara Dolinski,et al.  The BioGRID Interaction Database: 2011 update , 2010, Nucleic Acids Res..