Exploiting locational and topological overlap model to identify modules in protein interaction networks

BackgroundClustering molecular network is a typical method in system biology, which is effective in predicting protein complexes or functional modules. However, few studies have realized that biological molecules are spatial-temporally regulated to form a dynamic cellular network and only a subset of interactions take place at the same location in cells.ResultsIn this study, considering the subcellular localization of proteins, we first construct a co-localization human protein interaction network (PIN) and systematically investigate the relationship between subcellular localization and biological functions. After that, we propose a Locational and Topological Overlap Model (LTOM) to preprocess the co-localization PIN to identify functional modules. LTOM requires the topological overlaps, the common partners shared by two proteins, to be annotated in the same localization as the two proteins. We observed the model has better correspondence with the reference protein complexes and shows more relevance to cancers based on both human and yeast datasets and two clustering algorithms, ClusterONE and MCL.ConclusionTaking into consideration of protein localization and topological overlap can improve the performance of module detection from protein interaction networks.

[1]  María Martín,et al.  The Universal Protein Resource (UniProt) in 2010 , 2010 .

[2]  L. Stein,et al.  A human functional protein interaction network and its application to cancer data analysis , 2010, Genome Biology.

[3]  S. Tsuboi,et al.  Autophagy in yeast demonstrated with proteinase-deficient mutants and conditions for its induction , 1992, The Journal of cell biology.

[4]  Dong Wang,et al.  CrossNorm: a novel normalization strategy for microarray data in cancers , 2016, Scientific Reports.

[5]  S. Horvath,et al.  Conservation and evolution of gene coexpression networks in human and chimpanzee brains , 2006, Proceedings of the National Academy of Sciences.

[6]  Juyong Park,et al.  Protein localization as a principal feature of the etiology and comorbidity of genetic diseases , 2011, Molecular systems biology.

[7]  Mona Singh,et al.  Simple Topological Features Reflect Dynamics and Modularity in Protein Interaction Networks , 2013, PLoS Comput. Biol..

[8]  Nektarios Tavernarakis,et al.  A dual role of p 53 in the control of autophagy , 2022 .

[9]  Yi Pan,et al.  Protein-protein interactions: detection, reliability assessment and applications , 2016, Briefings Bioinform..

[10]  Youping Deng,et al.  Recent advances in clustering methods for protein interaction networks , 2010, BMC Genomics.

[11]  Dong Wang,et al.  Full Characterization of Localization Diversity in the Human Protein Interactome. , 2017, Journal of proteome research.

[12]  Yan Huang,et al.  RNALocate: a resource for RNA subcellular localizations , 2016, Nucleic Acids Res..

[13]  Jianzhen Xu,et al.  Connect the dots , 2013, Autophagy.

[14]  Kwong-Sak Leung,et al.  SMILE: a novel procedure for subcellular module identification with localisation expansion , 2018, IET systems biology.

[15]  Changning Liu,et al.  ncFANs: a web server for functional annotation of long non-coding RNAs , 2011, Nucleic Acids Res..

[16]  M. Stratton,et al.  A census of amplified and overexpressed human cancer genes , 2010, Nature Reviews Cancer.

[17]  S. Horvath,et al.  A General Framework for Weighted Gene Co-Expression Network Analysis , 2005, Statistical applications in genetics and molecular biology.

[18]  Nektarios Tavernarakis,et al.  Regulation of autophagy by cytoplasmic p53 , 2008, Nature Cell Biology.

[19]  A. Sali,et al.  The molecular sociology of the cell , 2007, Nature.

[20]  A. Barabasi,et al.  Network biology: understanding the cell's functional organization , 2004, Nature Reviews Genetics.

[21]  Nektarios Tavernarakis,et al.  A dual role of p53 in the control of autophagy , 2008, Autophagy.

[22]  Baris E. Suzek,et al.  The Universal Protein Resource (UniProt) in 2010 , 2009, Nucleic Acids Res..

[23]  Kwong-Sak Leung,et al.  Identification and characterization of moonlighting long non‐coding RNAs based on RNA and protein interactome , 2018, Bioinform..

[24]  Manik Sharma,et al.  STATIC AND DYNAMIC BNP PARALLEL SCHEDULING ALGORITHMS FOR DISTRIBUTED DATABASE , 2011, BIOINFORMATICS 2011.

[25]  Xia Li,et al.  RAID: a comprehensive resource for human RNA-associated (RNA–RNA/RNA–protein) interaction , 2014, RNA.

[26]  L. Tran,et al.  Integrated Systems Approach Identifies Genetic Nodes and Networks in Late-Onset Alzheimer’s Disease , 2013, Cell.

[27]  Andy M. Yip,et al.  Gene network interconnectedness and the generalized topological overlap measure , 2007, BMC Bioinformatics.

[28]  Tamás Korcsmáros,et al.  ComPPI: a cellular compartment-specific database for protein–protein interaction network analysis , 2014, Nucleic Acids Res..

[29]  A. Barabasi,et al.  Interactome Networks and Human Disease , 2011, Cell.

[30]  A. Barabasi,et al.  High-Quality Binary Protein Interaction Map of the Yeast Interactome Network , 2008, Science.

[31]  Kwong-Sak Leung,et al.  Discovering approximate-associated sequence patterns for protein-DNA interactions , 2011, Bioinform..

[32]  Sandhya Rani,et al.  Human Protein Reference Database—2009 update , 2008, Nucleic Acids Res..

[33]  Kwong-Sak Leung,et al.  ICN: a normalization method for gene expression data considering the over-expression of informative genes. , 2016, Molecular bioSystems.

[34]  Ozlem Keskin,et al.  Human Cancer Protein-Protein Interaction Network: A Structural Perspective , 2009, PLoS Comput. Biol..

[35]  María Martín,et al.  Activities at the Universal Protein Resource (UniProt) , 2013, Nucleic Acids Res..

[36]  Fabian J. Theis,et al.  MIPS: curated databases and comprehensive secondary data resources in 2010 , 2010, Nucleic Acids Res..

[37]  Haiyuan Yu,et al.  Detecting overlapping protein complexes in protein-protein interaction networks , 2012, Nature Methods.

[38]  Y. Leea,et al.  Analysis of oncogenic signaling networks in glioblastoma identifies ASPM as a molecular target , 2006 .

[39]  A. Barabasi,et al.  Hierarchical Organization of Modularity in Metabolic Networks , 2002, Science.

[40]  Steve Horvath,et al.  WGCNA: an R package for weighted correlation network analysis , 2008, BMC Bioinformatics.

[41]  Limsoon Wong,et al.  Prediction of problematic complexes from PPI networks: sparse, embedded, and small complexes , 2015, Biology Direct.

[42]  Martin H. Schaefer,et al.  HIPPIE v2.0: enhancing meaningfulness and reliability of protein–protein interaction networks , 2016, Nucleic Acids Res..

[43]  Kara Dolinski,et al.  The BioGRID interaction database: 2015 update , 2014, Nucleic Acids Res..

[44]  Kwong-Sak Leung,et al.  Quantification of non-coding RNA target localization diversity and its application in cancers , 2018, Journal of molecular cell biology.