Viewing the Meso-Scale Structures in Protein-Protein Interaction Networks Using 2-Clubs

Many current analyses on protein-protein interaction (PPI) networks concentrate on developing the algorithms for identifying the functional modules within the PPI networks. However, understanding the internal structure in the functional modules is needed to define their roles within the entire network. We propose the use of a two-club meso-scale structure that possesses refined inner topological structural properties, useful for deciphering the aforementioned obscure structural supporting evidences. In this paper, we: 1) illustrate the feasibility and advantages of modeling functional modules as two-clubs in PPI networks by taking statistics on the diameter distribution of benchmark functional modules within several golden standard sets; 2) categorize the two-clubs into six subcategories through the use of well-defined internal graph-theoretic characterizations; and 3) analyze these six subcategories based on factors, such as their structure, and various metrics, such as topological centralities, the numbers of involved transcription factors and essential genes, and the numbers of matched protein complexes or GO terms. Our structure-driven analysis allows us to predict the roles of identified functional modules from their network structure alone. Our subcategories of coteries and social circles serve as a classification scheme for determining which functional modules are central, regional, and satellite and which are coordinating. Experimental results show that in order to precisely control and examine the PPI networks for further research, a clear understanding of how the internal topology of modules affects their function is essential.

[1]  Bang Ye Wu,et al.  Influence Clubs in Social Networks , 2010, ICCCI.

[2]  Carl Kingsford,et al.  The power of protein interaction networks for associating genes with diseases , 2010, Bioinform..

[3]  Sean R. Collins,et al.  Global landscape of protein complexes in the yeast Saccharomyces cerevisiae , 2006, Nature.

[4]  Derek G. Corneil,et al.  Complement reducible graphs , 1981, Discret. Appl. Math..

[5]  Xiaoli Li,et al.  Computational approaches for detecting protein complexes from protein interaction networks: a survey , 2010, BMC Genomics.

[6]  Bachelor Opleiding,et al.  Fast Finding of 2-clubs , 2012 .

[7]  Bang Ye Wu,et al.  A Structural Approach for Finding Real-Friend Links in Internet Social Networks , 2011, 2011 International Conference on Internet of Things and 4th International Conference on Cyber, Physical and Social Computing.

[8]  Natasa Przulj,et al.  Predicting disease associations via biological network analysis , 2014, BMC Bioinformatics.

[9]  Guimei Liu,et al.  Complex discovery from weighted PPI networks , 2009, Bioinform..

[10]  Haiyuan Yu,et al.  Detecting overlapping protein complexes in protein-protein interaction networks , 2012, Nature Methods.

[11]  Srinivas Pasupuleti,et al.  Detection of Protein Complexes in Protein Interaction Networks Using n-Clubs , 2008, EvoBIO.

[12]  Christian Komusiewicz,et al.  Parameterized Algorithmics and Computational Experiments for Finding 2-Clubs , 2015, J. Graph Algorithms Appl..

[13]  Lin Gao,et al.  Defining and identifying cograph communities in complex networks , 2015 .

[14]  Yan Xiaoyan,et al.  On fuzzy cliques in fuzzy networks , 1988 .

[15]  Mona Singh,et al.  Interaction-based discovery of functionally important genes in cancers , 2013, Nucleic acids research.

[16]  A. Barabasi,et al.  Drug—target network , 2007, Nature Biotechnology.

[17]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[18]  Sergiy Butenko,et al.  On clique relaxation models in network analysis , 2013, Eur. J. Oper. Res..

[19]  Nasrullah Memon,et al.  Structural Analysis and Mathematical Methods for Destabilizing Terrorist Networks Using Investigative Data Mining , 2006, ADMA.

[20]  Yijie Wang,et al.  Functional module identification in protein interaction networks by interaction patterns , 2014, Bioinform..

[21]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[22]  Wei Jiang,et al.  The analysis of the drug–targets based on the topological properties in the human protein–protein interaction network , 2009, Journal of drug targeting.

[23]  Sergiy Butenko,et al.  Clique Relaxations in Social Network Analysis: The Maximum k-Plex Problem , 2011, Oper. Res..

[24]  R. Luce,et al.  Connectivity and generalized cliques in sociometric group structure , 1950, Psychometrika.

[25]  T. Vicsek,et al.  Uncovering the overlapping community structure of complex networks in nature and society , 2005, Nature.

[26]  Linton C. Freeman,et al.  Cliques, Galois lattices, and the structure of human social groups☆ , 1996 .

[27]  Keith C. C. Chan,et al.  Evolutionary Graph Clustering for Protein Complex Identification , 2018, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[28]  D. Latchman Transcription factors: an overview. , 1997, The international journal of biochemistry & cell biology.

[29]  Kara Dolinski,et al.  Gene Ontology annotations at SGD: new data sources and annotation methods , 2007, Nucleic Acids Res..

[30]  Sergiy Butenko,et al.  Novel Approaches for Analyzing Biological Networks , 2005, J. Comb. Optim..

[31]  Yan Lin,et al.  DEG 10, an update of the database of essential genes that includes both protein-coding genes and noncoding genomic elements , 2013, Nucleic Acids Res..

[32]  U. Brandes A faster algorithm for betweenness centrality , 2001 .

[33]  Jaques Reifman,et al.  Exploiting large-scale drug-protein interaction information for computational drug repurposing , 2014, BMC Bioinformatics.

[34]  Weinan Zhang,et al.  A Bootstrapping Framework With Interactive Information Modeling for Network Alignment , 2018, IEEE Access.

[35]  Nicolas Wieseke,et al.  On tree representations of relations and graphs: symbolic ultrametrics and cograph edge decompositions , 2015, J. Comb. Optim..

[36]  Ulrich Stelzl,et al.  Dual Coordination of Post Translational Modifications in Human Protein Networks , 2013, PLoS Comput. Biol..

[37]  Hans-Werner Mewes,et al.  CORUM: the comprehensive resource of mammalian protein complexes , 2007, Nucleic Acids Res..

[38]  Wei-Po Lee,et al.  Predicting Drug Side Effects Using Data Analytics and the Integration of Multiple Data Sources , 2017, IEEE Access.

[39]  Yong Gao,et al.  Familial groups in social networks , 2013, Soc. Networks.

[40]  Teng-Hung Chen,et al.  Graph theory and stability analysis of protein complex interaction networks. , 2016, IET systems biology.

[41]  Maria Teresa Almeida,et al.  Upper bounds and heuristics for the 2-club problem , 2011, Eur. J. Oper. Res..

[42]  Dmitrij Frishman,et al.  MIPS: analysis and annotation of proteins from whole genomes in 2005 , 2006, Nucleic Acids Res..

[43]  Hui Sun,et al.  Protein Function Prediction Using Function Associations in Protein–Protein Interaction Network , 2018, IEEE Access.

[44]  Christian Komusiewicz,et al.  A Graph Modification Approach for Finding Core-Periphery Structures in Protein Interaction Networks , 2014, WABI.

[45]  Ulrich Stelzl,et al.  Studying post-translational modifications with protein interaction networks. , 2014, Current opinion in structural biology.

[46]  Jiangning Song,et al.  Structural Principles Analysis of Host-Pathogen Protein-Protein Interactions: A Structural Bioinformatics Survey , 2018, IEEE Access.

[47]  Alexander Veremyev,et al.  Identifying large robust network clusters via new compact formulations of maximum k-club problems , 2012, Eur. J. Oper. Res..

[48]  Michael Q. Zhang,et al.  Network-based global inference of human disease genes , 2008, Molecular systems biology.

[49]  Gerard J. Chang,et al.  Quasi-threshold Graphs , 1996, Discret. Appl. Math..

[50]  Giovanni Scardoni,et al.  Analyzing biological network parameters with CentiScaPe , 2009, Bioinform..

[51]  Yong Gao,et al.  Exploring triad-rich substructures by graph-theoretic characterizations in complex networks , 2016, ArXiv.

[52]  R. Sharan,et al.  Protein networks in disease. , 2008, Genome research.

[53]  Robert J. Mokken,et al.  Coteries, Social Circles and Hamlets. Close Communities: A Study of Acquaintance Networks , 2011, ArXiv.

[54]  Nicolas Wieseke,et al.  On Symbolic Ultrametrics, Cotree Representations, and Cograph Edge Decompositions and Partitions , 2015, COCOON.

[55]  R. Alba A graph‐theoretic definition of a sociometric clique† , 1973 .

[56]  Sandhya Rani,et al.  Human Protein Reference Database—2009 update , 2008, Nucleic Acids Res..

[57]  Rick Stevens,et al.  Essential genes on metabolic maps. , 2006, Current opinion in biotechnology.

[58]  Katsuhiko Murakami,et al.  PCDq: human protein complex database with quality index which summarizes different levels of evidences of protein complexes predicted from H-Invitational protein-protein interactions integrative dataset , 2012, BMC Systems Biology.

[59]  Mark E. J. Newman A measure of betweenness centrality based on random walks , 2005, Soc. Networks.

[60]  Volkhard Helms,et al.  Identifying transcription factor complexes and their roles , 2014, Bioinform..