Determining minimum set of driver nodes in protein-protein interaction networks

BackgroundRecently, several studies have drawn attention to the determination of a minimum set of driver proteins that are important for the control of the underlying protein-protein interaction (PPI) networks. In general, the minimum dominating set (MDS) model is widely adopted. However, because the MDS model does not generate a unique MDS configuration, multiple different MDSs would be generated when using different optimization algorithms. Therefore, among these MDSs, it is difficult to find out the one that represents the true driver set of proteins.ResultsTo address this problem, we develop a centrality-corrected minimum dominating set (CC-MDS) model which includes heterogeneity in degree and betweenness centralities of proteins. Both the MDS model and the CC-MDS model are applied on three human PPI networks. Unlike the MDS model, the CC-MDS model generates almost the same sets of driver proteins when we implement it using different optimization algorithms. The CC-MDS model targets more high-degree and high-betweenness proteins than the uncorrected counterpart. The more central position allows CC-MDS proteins to be more important in maintaining the overall network connectivity than MDS proteins. To indicate the functional significance, we find that CC-MDS proteins are involved in, on average, more protein complexes and GO annotations than MDS proteins. We also find that more essential genes, aging genes, disease-associated genes and virus-targeted genes appear in CC-MDS proteins than in MDS proteins. As for the involvement in regulatory functions, the sets of CC-MDS proteins show much stronger enrichment of transcription factors and protein kinases. The results about topological and functional significance demonstrate that the CC-MDS model can capture more driver proteins than the MDS model.ConclusionsBased on the results obtained, the CC-MDS model presents to be a powerful tool for the determination of driver proteins that can control the underlying PPI networks. The software described in this paper and the datasets used are available at https://github.com/Zhangxf-ccnu/CC-MDS.

[1]  Leonard M. Freeman,et al.  A set of measures of centrality based upon betweenness , 1977 .

[2]  Stephen T. Hedetniemi,et al.  Bibliography on domination in graphs and some basic definitions of domination parameters , 1991, Discret. Math..

[3]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[4]  M. Daly,et al.  Guilt by association , 2000, Nature Genetics.

[5]  A. Barabasi,et al.  Lethality and centrality in protein networks , 2001, Nature.

[6]  T. Hunter,et al.  The Protein Kinase Complement of the Human Genome , 2002, Science.

[7]  Alan F. Scott,et al.  Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders , 2002, Nucleic Acids Res..

[8]  Alexander E. Kel,et al.  TRANSFAC®: transcriptional regulation, from patterns to profiles , 2003, Nucleic Acids Res..

[9]  N. Campbell Genetic association database , 2004, Nature Reviews Genetics.

[10]  Ren Zhang,et al.  DEG: a database of essential genes. , 2004, Nucleic acids research.

[11]  A. Barabasi,et al.  Network biology: understanding the cell's functional organization , 2004, Nature Reviews Genetics.

[12]  Cathy H. Wu,et al.  The Universal Protein Resource (UniProt) , 2005, Nucleic Acids Res..

[13]  D. Ingber,et al.  High-Betweenness Proteins in the Yeast Protein Interaction Network , 2005, Journal of biomedicine & biotechnology.

[14]  Jianzhi Zhang,et al.  Why Do Hubs Tend to Be Essential in Protein Networks? , 2006, PLoS genetics.

[15]  Mark Gerstein,et al.  The Importance of Bottlenecks in Protein Networks: Correlation with Gene Essentiality and Expression Dynamics , 2007, PLoS Comput. Biol..

[16]  Dianne P. O'Leary,et al.  Why Do Hubs in the Yeast Protein Interaction Network Tend To Be Essential: Reexamining the Connection between the Network Topology and Essentiality , 2008, PLoS Comput. Biol..

[17]  Octave Noubibou Doudieu,et al.  CORUM: the comprehensive resource of mammalian protein complexes , 2007, Nucleic Acids Res..

[18]  Jianzhi Zhang,et al.  Null mutations in human and mouse orthologs frequently result in different phenotypes , 2008, Proceedings of the National Academy of Sciences.

[19]  A. Barabasi,et al.  High-Quality Binary Protein Interaction Map of the Yeast Interactome Network , 2008, Science.

[20]  Yan Lin,et al.  DEG 5.0, a database of essential genes in both prokaryotes and eukaryotes , 2008, Nucleic Acids Res..

[21]  Shekhar Verma,et al.  A Power Aware Minimum Connected Dominating Set for Wireless Sensor Networks , 2009, J. Networks.

[22]  Michele Tinti,et al.  VirusMINT: a viral protein interaction database , 2008, Nucleic Acids Res..

[23]  Ben Lehner,et al.  Tissue specificity and the human protein interaction network , 2009, Molecular systems biology.

[24]  Hans-Werner Mewes,et al.  CORUM: the comprehensive resource of mammalian protein complexes , 2007, Nucleic Acids Res..

[25]  Jens M. Olesen,et al.  Centrality measures and the importance of generalist species in pollination networks , 2010 .

[26]  Ailsa H. Land,et al.  An Automatic Method of Solving Discrete Programming Problems , 1960 .

[27]  A. Barabasi,et al.  Network medicine : a network-based approach to human disease , 2010 .

[28]  A. Bonato,et al.  Dominating Biological Networks , 2011, PloS one.

[29]  M. Egerstedt Complex networks: Degrees of control , 2011, Nature.

[30]  F. Müller,et al.  Few inputs can reprogram biological networks , 2011, Nature.

[31]  Jesse Gillis,et al.  The Impact of Multifunctional Genes on "Guilt by Association" Analysis , 2011, PloS one.

[32]  Albert-László Barabási,et al.  Controllability of complex networks , 2011, Nature.

[33]  Haiyuan Yu,et al.  HINT: High-quality protein interactomes and their applications in understanding human disease , 2012, BMC Systems Biology.

[34]  Helga Thorvaldsdóttir,et al.  Molecular signatures database (MSigDB) 3.0 , 2011, Bioinform..

[35]  J. Kurths,et al.  Correction: Identifying Controlling Nodes in Neuronal Networks in Different Scales , 2012, PLoS ONE.

[36]  Andrei L. Turinsky,et al.  A Census of Human Soluble Protein Complexes , 2012, Cell.

[37]  Tatsuya Akutsu,et al.  Dominating scale-free networks with variable scaling exponent: heterogeneous networks are not difficult to control , 2012 .

[38]  Rahul C. Deo,et al.  Interpreting cancer genomes using systematic host network perturbations by tumour virus proteins - eScholarship , 2012 .

[39]  E. Furlong,et al.  Transcription factors: from enhancer binding to developmental control , 2012, Nature Reviews Genetics.

[40]  J. Kurths,et al.  Identifying Controlling Nodes in Neuronal Networks in Different Scales , 2012, PloS one.

[41]  Dao-Qing Dai,et al.  A Framework for Incorporating Functional Interrelationships into Protein Function Prediction Algorithms , 2012, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[42]  Alain Guénoche,et al.  Multifunctional proteins revealed by overlapping clustering in protein interaction network , 2011, Bioinform..

[43]  Deok-Sun Lee,et al.  Viral Perturbations of Host Networks Reflect Disease Etiology , 2012, PLoS Comput. Biol..

[44]  Mona Singh,et al.  From Hub Proteins to Hub Modules: The Relationship Between Essentiality and Centrality in the Yeast Interactome at Different Scales of Organization , 2013, PLoS Comput. Biol..

[45]  Tatsuya Akutsu,et al.  Structural controllability of unidirectional bipartite networks , 2013, Scientific Reports.

[46]  M. Bucan,et al.  From Mouse to Human: Evolutionary Genomics Analysis of Human Orthologs of Essential Genes , 2013, PLoS genetics.

[47]  João Pedro de Magalhães,et al.  Human Ageing Genomic Resources: Integrated databases and tools for the biology and genetics of ageing , 2012, Nucleic Acids Res..

[48]  Albert-László Barabási,et al.  Observability of complex systems , 2013, Proceedings of the National Academy of Sciences.

[49]  Li Wang,et al.  Corrigendum: Ultrafast universal quantum control of a quantum-dot charge qubit using Landau–Zener–Stückelberg interference , 2013, Nature Communications.

[50]  Endre Csóka,et al.  Emergence of bimodality in controlling complex networks , 2013, Nature Communications.

[51]  M. Gearing,et al.  Correction: Corrigendum: Tonic inhibition in dentate gyrus impairs long-term potentiation and memory in an Alzheimer’s disease model , 2014, Nature Communications.

[52]  Tatsuya Akutsu,et al.  Analysis of critical and redundant nodes in controlling directed and undirected complex networks using dominating sets , 2014, J. Complex Networks.

[53]  Albert-László Barabási,et al.  Target control of complex networks , 2014, Nature Communications.

[54]  Stefan Wuchty,et al.  Controllability in protein interaction networks , 2014, Proceedings of the National Academy of Sciences.

[55]  Yanhui Hu,et al.  Integrating protein-protein interaction networks with phenotypes reveals signs of interactions , 2013, Nature Methods.

[56]  Hsien-Da Huang,et al.  RegPhos 2.0: an updated resource to explore protein kinase–substrate phosphorylation networks in mammals , 2014, Database J. Biol. Databases Curation.

[57]  Peng Yang,et al.  Detecting temporal protein complexes from dynamic protein-protein interaction networks , 2014, BMC Bioinformatics.

[58]  Bridget E. Begg,et al.  A Proteome-Scale Map of the Human Interactome Network , 2014, Cell.

[59]  Tatsuya Akutsu,et al.  Structurally robust control of complex networks. , 2014, Physical review. E, Statistical, nonlinear, and soft matter physics.

[60]  Elspeth A. Bruford,et al.  Genenames.org: the HGNC resources in 2015 , 2014, Nucleic Acids Res..