Local representatives in weighted networks

The main features of current real-world networks are their large sizes and structures, which show varying degrees of importance of the nodes in their surroundings. The topic of evaluating the importance of the nodes offers many different approaches that usually work with unweighted networks. We present a novel, simple and straightforward approach for the evaluation of the network's nodes with a focus on local properties in their surroundings. The presented approach is intended for weighted networks where the weight can be interpreted as the proximity between the nodes. Our suggested x-representativeness then takes into account the degree of the node, its nearest neighbors and one other parameter which we call the x-representativeness base. Following that, we also present experiments with three different real-world networks. The aim of these experiments is to show that the x-representativeness can be used to deterministically reduce the network to differently sized samples of representatives, while maintaining the topological properties of the original network.

[1]  Václav Snásel,et al.  Social and swarm aspects of co-authorship network , 2012, Log. J. IGPL.

[2]  A. Vespignani,et al.  The architecture of complex weighted networks. , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[3]  Torsten Suel,et al.  Local methods for estimating pagerank values , 2004, CIKM '04.

[4]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[5]  M. Newman,et al.  Scientific collaboration networks. II. Shortest paths, weighted networks, and centrality. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[6]  Christos Faloutsos,et al.  Density biased sampling: an improved method for data mining and clustering , 2000, SIGMOD '00.

[7]  Pairote Sattayatham,et al.  Weighted K-Means for Density-Biased Clustering , 2005, DaWaK.

[8]  Hans-Peter Kriegel,et al.  Metropolis Algorithms for Representative Subgraph Sampling , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[9]  Padhraic Smyth,et al.  Algorithms for estimating relative importance in networks , 2003, KDD '03.

[10]  Pavel Berkhin,et al.  A Survey of Clustering Data Mining Techniques , 2006, Grouping Multidimensional Data.

[11]  Amy Nicole Langville,et al.  Google's PageRank and beyond - the science of search engine rankings , 2006 .

[12]  M. Newman,et al.  The structure of scientific collaboration networks. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[13]  Wenpu Xing,et al.  Weighted PageRank algorithm , 2004, Proceedings. Second Annual Conference on Communication Networks and Services Research, 2004..

[14]  John Skvoretz,et al.  Node centrality in weighted networks: Generalizing degree and shortest paths , 2010, Soc. Networks.

[15]  Jignesh M. Patel,et al.  Efficient aggregation for graph summarization , 2008, SIGMOD Conference.

[16]  Jürgen Kurths,et al.  Node-weighted interacting network measures improve the representation of real-world complex systems , 2013, ArXiv.

[17]  Nisheeth Shrivastava,et al.  Graph summarization with bounded error , 2008, SIGMOD Conference.

[18]  Yanlei Wu,et al.  2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2014, Beijing, China, August 17-20, 2014 , 2014, ASONAM.

[19]  Stefan Berchtold,et al.  Efficient Biased Sampling for Approximate Clustering and Outlier Detection in Large Data Sets , 2003, IEEE Trans. Knowl. Data Eng..

[20]  Edo Liberty,et al.  Estimating Sizes of Social Networks via Biased Sampling , 2014, Internet Math..

[21]  Ying Ding,et al.  Applying weighted PageRank to author citation networks , 2011, J. Assoc. Inf. Sci. Technol..

[22]  L. Freeman Centrality in social networks conceptual clarification , 1978 .

[23]  Hawoong Jeong,et al.  Statistical properties of sampled networks. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[24]  Tanya Y. Berger-Wolf,et al.  Sampling community structure , 2010, WWW '10.

[25]  Huzefa Rangwala,et al.  Sparsification and Sampling of Networks for Collective Classification , 2013, SBP.

[26]  Walter Willinger,et al.  Evaluating Sampling Techniques for Large Dynamic Graphs , 2008 .

[27]  Marek Chrobak,et al.  Sampling large Internet topologies for simulation purposes , 2007, Comput. Networks.

[28]  Jon M Kleinberg,et al.  Hubs, authorities, and communities , 1999, CSUR.

[29]  Pasi Fränti,et al.  Iterative shrinking method for clustering problems , 2006, Pattern Recognit..

[30]  Laks V. S. Lakshmanan,et al.  Proceedings of the 2008 ACM SIGMOD international conference on Management of data , 2008, SIGMOD 2008.

[31]  Stephen Curial,et al.  Effectively visualizing large networks through sampling , 2005, VIS 05. IEEE Visualization, 2005..

[32]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[33]  Christos Faloutsos,et al.  Sampling from large graphs , 2006, KDD '06.

[34]  Yannis Manolopoulos,et al.  An efficient and effective algorithm for density biased sampling , 2002, CIKM '02.

[35]  Donald E. Knuth,et al.  The Stanford GraphBase - a platform for combinatorial computing , 1993 .

[36]  M. F. Fuller,et al.  Practical Nonparametric Statistics; Nonparametric Statistical Inference , 1973 .

[37]  Tian Zhang,et al.  BIRCH: A New Data Clustering Algorithm and Its Applications , 1997, Data Mining and Knowledge Discovery.

[38]  Anna C. Gilbert,et al.  Compressing Network Graphs , 2004 .

[39]  Nick Koudas,et al.  Sampling Online Social Networks , 2013, IEEE Transactions on Knowledge and Data Engineering.

[40]  Ramana Rao Kompella,et al.  Reconsidering the Foundations of Network Sampling , 2010 .

[41]  Carl D. Meyer,et al.  Deeper Inside PageRank , 2004, Internet Math..