Mining important nodes in complex networks using nonlinear PCA

An important problem in the analysis of complex networks is to mine the top k important nodes. The existing literature offers several metrics, also called centrality measures which estimates importance using the structural properties of node, namely, degree, closeness, betweenness, eigenvector centrality, Pagerank etc. Though there exists plenty of centrality measures, none of them emphasizes the non-linearity of data. In the current study we propose a non-linear principle component analysis based approach to identify the top k important nodes. The proposed method is evaluated based on the saturation time and the fraction of infected nodes during a susceptible-infected propagation. The experiment on synthetic as well as real life data sets show that the developed method is competitive with the state-of-the-art.

[1]  Phillip Bonacich,et al.  Eigenvector-like measures of centrality for asymmetric relations , 2001, Soc. Networks.

[2]  Florian Probst,et al.  Identifying Key Users in Online Social Networks: A PageRank Based Approach , 2010, ICIS.

[3]  Yu Wang,et al.  Community-based greedy algorithm for mining top-K influential nodes in mobile social networks , 2010, KDD.

[4]  Alex Bavelas,et al.  Communication Patterns in Task‐Oriented Groups , 1950 .

[5]  Christophe Diot,et al.  Impact of Human Mobility on Opportunistic Forwarding Algorithms , 2007, IEEE Transactions on Mobile Computing.

[6]  Sergey Brin,et al.  Reprint of: The anatomy of a large-scale hypertextual web search engine , 2012, Comput. Networks.

[7]  Masahiro Kimura,et al.  Tractable Models for Information Diffusion in Social Networks , 2006, PKDD.

[8]  Yicheng Zhang,et al.  Identifying influential nodes in complex networks , 2012 .

[9]  S. Mahadevan,et al.  Identifying influential nodes in weighted networks based on evidence theory , 2013 .

[10]  Pablo A. Estévez,et al.  Selecting the Most Influential Nodes in Social Networks , 2007, 2007 International Joint Conference on Neural Networks.

[11]  Gert Sabidussi,et al.  The centrality index of a graph , 1966 .

[12]  Masahiro Kimura,et al.  Extracting Influential Nodes for Information Diffusion on a Social Network , 2007, AAAI.

[13]  L. Freeman Centrality in social networks conceptual clarification , 1978 .

[14]  Masahiro Kimura,et al.  Extracting influential nodes on a social network for information diffusion , 2009, Data Mining and Knowledge Discovery.

[15]  Weiren Shi,et al.  Evaluating the importance of nodes in complex networks , 2016 .

[16]  Mark Gerstein,et al.  The Importance of Bottlenecks in Protein Networks: Correlation with Gene Essentiality and Expression Dynamics , 2007, PLoS Comput. Biol..

[17]  Mohammad Ali Nematbakhsh,et al.  IMPROVING DETECTION OF INFLUENTIAL NODES IN COMPLEX NETWORKS , 2015, ArXiv.

[18]  Vasyl Pihur,et al.  Weighted rank aggregation of cluster validation measures: a Monte Carlo cross-entropy approach , 2007, Bioinform..

[19]  R. Burt The Social Structure of Competition , 2004 .

[20]  M. Kramer Nonlinear principal component analysis using autoassociative neural networks , 1991 .

[21]  S. Mahadevan,et al.  A modified evidential methodology of identifying influential nodes in weighted networks , 2013 .

[22]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[23]  Steven B. Andrews,et al.  Structural Holes: The Social Structure of Competition , 1995, The SAGE Encyclopedia of Research Design.

[24]  Stanford,et al.  Learning to Discover Social Circles in Ego Networks , 2012 .

[25]  Marián Boguñá,et al.  How to make the top ten: Approximating PageRank from in-degree , 2005, ArXiv.

[26]  Yu Zhang,et al.  Identifying Key Users for Targeted Marketing by Mining Online Social Network , 2010, 2010 IEEE 24th International Conference on Advanced Information Networking and Applications Workshops.

[27]  Jennifer Golbeck,et al.  Analyzing the Social Web , 2013 .

[28]  Jérôme Kunegis,et al.  KONECT: the Koblenz network collection , 2013, WWW.