A systematic survey of centrality measures for protein-protein interaction networks

Background Numerous centrality measures have been introduced to identify “central” nodes in large networks. The availability of a wide range of measures for ranking influential nodes leaves the user to decide which measure may best suit the analysis of a given network. The choice of a suitable measure is furthermore complicated by the impact of the network topology on ranking influential nodes by centrality measures. To approach this problem systematically, we examined the centrality profile of nodes of yeast protein-protein interaction networks (PPINs) in order to detect which centrality measure is succeeding in predicting influential proteins. We studied how different topological network features are reflected in a large set of commonly used centrality measures. Results We used yeast PPINs to compare 27 common of centrality measures. The measures characterize and assort influential nodes of the networks. We applied principal component analysis (PCA) and hierarchical clustering and found that the most informative measures depend on the network’s topology. Interestingly, some measures had a high level of contribution in comparison to others in all PPINs, namely Latora closeness, Decay, Lin, Freeman closeness, Diffusion, Residual closeness and Average distance centralities. Conclusions The choice of a suitable set of centrality measures is crucial for inferring important functional properties of a network. We concluded that undertaking data reduction using unsupervised machine learning methods helps to choose appropriate variables (centrality measures). Hence, we proposed identifying the contribution proportions of the centrality measures with PCA as a prerequisite step of network analysis before inferring functional consequences, e.g., essentiality of a node.

[1]  Seyed Shahriar Arab,et al.  CentiServer: A Comprehensive Resource, Web-Based Application and R Package for Centrality Analysis , 2015, PloS one.

[2]  An-Ping Zeng,et al.  The Connectivity Structure, Giant Strong Component and Centrality of Metabolic Networks , 2003, Bioinform..

[3]  J. H. Ward Hierarchical Grouping to Optimize an Objective Function , 1963 .

[4]  Falk Schreiber,et al.  Comparison of Centralities for Biological Networks , 2004, German Conference on Bioinformatics.

[5]  P. Erdos,et al.  On the evolution of random graphs , 1984 .

[6]  V Latora,et al.  Efficient behavior of small-world networks. , 2001, Physical review letters.

[7]  Falk Schreiber,et al.  Analysis of Biological Networks , 2008 .

[8]  Cynthia M. Lakon,et al.  How Correlated Are Network Centrality Measures? , 2008, Connections.

[9]  M. Niazi,et al.  Towards a Methodology for Validation of Centrality Measures in Complex Networks , 2014, PloS one.

[10]  S. Bergmann,et al.  Similarities and Differences in Genome-Wide Expression Data of Six Organisms , 2003, PLoS biology.

[11]  Falk Schreiber,et al.  Exploration of biological network centralities with CentiBiN , 2006, BMC Bioinformatics.

[12]  I. Xenarios,et al.  UniProtKB/Swiss-Prot, the Manually Annotated Section of the UniProt KnowledgeBase: How to Use the Entry View. , 2016, Methods in molecular biology.

[13]  Pasquale De Meo,et al.  A Novel Measure of Edge Centrality in Social Networks , 2012, Knowl. Based Syst..

[14]  A. Barabasi,et al.  Lethality and centrality in protein networks , 2001, Nature.

[15]  Cun-Quan Zhang,et al.  Laplacian centrality: A new centrality measure for weighted networks , 2012, Inf. Sci..

[16]  F. Harary,et al.  Eccentricity and centrality in networks , 1995 .

[17]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[18]  Padhraic Smyth,et al.  Algorithms for estimating relative importance in networks , 2003, KDD '03.

[19]  Ernesto Estrada Virtual identification of essential proteins within the protein interaction network of yeast , 2005, Proteomics.

[20]  Linton C. Freeman,et al.  Going the Wrong Way on a One-Way Street: Centrality in Physics and Biology , 2008, J. Soc. Struct..

[21]  David W. Scott The New S Language , 1990 .

[22]  Tim Dwyer,et al.  Visual analysis of network centralities , 2006, APVIS.

[23]  Carter T. Butts,et al.  network: A Package for Managing Relational Data in R , 2008 .

[24]  Soon-Heng Tan,et al.  Functional centrality: detecting lethality of proteins in protein interaction networks. , 2007, Genome informatics. International Conference on Genome Informatics.

[25]  Falk Schreiber,et al.  CentiLib: comprehensive analysis and exploration of network centralities. , 2012 .

[26]  L. Freeman Centrality in social networks conceptual clarification , 1978 .

[27]  Stefan Wuchty,et al.  Essentiality and centrality in protein interaction networks revisited , 2015, BMC Bioinformatics.

[28]  D. Ingber,et al.  High-Betweenness Proteins in the Yeast Protein Interaction Network , 2005, Journal of biomedicine & biotechnology.

[29]  Olaf Wolkenhauer,et al.  Evolution of Centrality Measurements for the Detection of Essential Proteins in Biological Networks , 2016, Front. Physiol..

[30]  Guy N. Brock,et al.  clValid , an R package for cluster validation , 2008 .

[31]  C. Dangalchev Residual closeness in networks , 2006 .

[32]  D. Fell,et al.  The small world inside large metabolic networks , 2000, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[33]  Heng Tao Shen,et al.  Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[34]  Ruth Nussinov,et al.  Structure and dynamics of molecular networks: A novel paradigm of drug discovery. A comprehensive review , 2012, Pharmacology & therapeutics.

[35]  Steve Horvath,et al.  Weighted Network Analysis , 2011 .

[36]  Matthew W. Hahn,et al.  Comparative genomics of centrality and essentiality in three eukaryotic protein-interaction networks. , 2005, Molecular biology and evolution.

[37]  Martin G. Everett,et al.  A Graph-theoretic perspective on centrality , 2006, Soc. Networks.

[38]  E. Wingender,et al.  Topology of mammalian transcription networks. , 2005, Genome informatics. International Conference on Genome Informatics.

[39]  J Craig Venter,et al.  A systems biology tour de force for a near-minimal bacterium , 2009, Molecular systems biology.

[40]  Gábor Csárdi,et al.  The igraph software package for complex network research , 2006 .

[41]  Mohieddin Jafari,et al.  Updates on drug-target network; facilitating polypharmacology and data integration by growth of DrugBank database , 2015, Briefings Bioinform..

[42]  Leng Han,et al.  Gene co-expression network analysis reveals common system-level properties of prognostic genes across cancer types , 2014, Nature Communications.

[43]  C. A. Murthy,et al.  A New Centrality Measure for Influence Maximization in Social Networks , 2011, PReMI.

[44]  Dirk Koschützki,et al.  How to identify essential genes from molecular networks? , 2009, BMC Systems Biology.

[45]  Meghana Viswanath Ontology-based automatic text summarization , 2009 .

[46]  Andrea Landherr,et al.  A Critical Review of Centrality Measures in Social Networks , 2010, Bus. Inf. Syst. Eng..

[47]  A. Telcs,et al.  Lobby index in networks , 2008, 0809.0514.

[48]  Chung-Yen Lin,et al.  Hubba: hub objects analyzer—a framework of interactome hubs identification for network biology , 2008, Nucleic Acids Res..

[49]  P. Bonacich Power and Centrality: A Family of Measures , 1987, American Journal of Sociology.

[50]  Damian Szklarczyk,et al.  The STRING database in 2017: quality-controlled protein–protein association networks, made broadly accessible , 2016, Nucleic Acids Res..

[51]  M. Zelen,et al.  Rethinking centrality: Methods and examples☆ , 1989 .

[52]  Jianzhi Zhang,et al.  Why Do Hubs Tend to Be Essential in Protein Networks? , 2006, PLoS genetics.

[53]  Hui Gao,et al.  Identifying Influential Nodes in Large-Scale Directed Networks: The Role of Clustering , 2013, PloS one.

[54]  Charles B. Shrader,et al.  Structure, context, and centrality in interorganizational networks , 1990 .

[55]  Joachim Krieter,et al.  Social network analysis - centrality parameters and individual network positions of agonistic behavior in pigs over three different age levels , 2015, SpringerPlus.

[56]  Harry Eugene Stanley,et al.  Correlation between centrality metrics and their application to the opinion model , 2014, The European Physical Journal B.

[57]  Yi Pan,et al.  Rechecking the Centrality-Lethality Rule in the Scope of Protein Subcellular Localization Interaction Networks , 2015, PloS one.

[58]  Dianne P. O'Leary,et al.  Why Do Hubs in the Yeast Protein Interaction Network Tend To Be Essential: Reexamining the Connection between the Network Topology and Essentiality , 2008, PLoS Comput. Biol..

[59]  Paul J. Laurienti,et al.  A New Measure of Centrality for Brain Networks , 2010, PloS one.

[60]  Steve Horvath,et al.  WGCNA: an R package for weighted correlation network analysis , 2008, BMC Bioinformatics.

[61]  J. A. Rodríguez-Velázquez,et al.  Subgraph centrality in complex networks. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[62]  Michalis Vazirgiannis,et al.  Locating influential nodes in complex networks , 2016, Scientific Reports.

[63]  S. Horvath Weighted Network Analysis: Applications in Genomics and Systems Biology , 2011 .