Leveraging Structural Hierarchy for Scalable Network Comparison

K-core decomposition is a popular method that segments a network revealing the underlying hierarchy. We explore the propensity of this decomposition method for structural discrimination among networks by extracting features from each level of the hierarchy. We propose a novel algorithm for Network Comparison using k-coreDecomposition NCKD. The method is effective, efficient and scalable, with computational complexity of $$O|{\mathcal E}|$$, where $${\mathcal E}$$ is the set of edges in the network. The low computational complexity of the method makes it attractive for scalable network comparison. NCKD algorithm decomposes networks and extracts features from the resulting shells. Jensen-Shannon distance between extracted features quantifies structural differences between networks. We establish that probability distributions of coreness and intra/inter-shell edges are capable of characterizing different genres of networks and capturing finer structural differences between networks of the same genre. We experiment with synthetic and real-life networks upi¾?to eight million edges on a single PC. Comparison with two recent state-of-the-art network comparison methods affirms that NCKD outperforms in terms of effectiveness and scalability.

[1]  Christos Faloutsos,et al.  Graph evolution: Densification and shrinking diameters , 2006, TKDD.

[2]  Katherine Faust,et al.  Comparing Social Networks: Size, Density, and Local Structure , 2006 .

[3]  Stephen B. Seidman,et al.  Network structure and minimum degree , 1983 .

[4]  Alessandro Vespignani,et al.  Large scale networks fingerprinting and visualization using the k-core decomposition , 2005, NIPS.

[5]  Vishesh Karwa,et al.  Statistical models for cores decomposition of an undirected random graph , 2014, ArXiv.

[6]  Xuelong Li,et al.  A survey of graph edit distance , 2010, Pattern Analysis and Applications.

[7]  Sergey N. Dorogovtsev,et al.  Critical phenomena in complex networks , 2007, ArXiv.

[8]  Francesco De Pellegrini,et al.  Distributed k-Core Decomposition , 2013 .

[9]  R. Albert,et al.  The large-scale organization of metabolic networks , 2000, Nature.

[10]  Jianhua Lin,et al.  Divergence measures based on the Shannon entropy , 1991, IEEE Trans. Inf. Theory.

[11]  Tamara G. Kolda,et al.  Accelerating Community Detection by Using K-core Subgraphs , 2014, ArXiv.

[12]  Erhard Rahm,et al.  Similarity flooding: a versatile graph matching algorithm and its application to schema matching , 2002, Proceedings 18th International Conference on Data Engineering.

[13]  Jia Wang,et al.  Truss Decomposition in Massive Networks , 2012, Proc. VLDB Endow..

[14]  Whitman Richards,et al.  Graph Comparison Using Fine Structure Analysis , 2010, 2010 IEEE Second International Conference on Social Computing.

[15]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[16]  Danai Koutra,et al.  DELTACON: A Principled Massive-Graph Similarity Function , 2013, SDM.

[17]  Giorgios Kollias,et al.  Network Similarity Decomposition (NSD): A Fast and Scalable Approach to Network Alignment , 2012, IEEE Transactions on Knowledge and Data Engineering.

[18]  Anirban Banerjee,et al.  Structural distance and evolutionary relationship of networks , 2008, Biosyst..

[19]  Danai Koutra,et al.  Network similarity via multiple social theories , 2013, 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2013).

[20]  Donald F. Towsley,et al.  Complex network comparison using random walks , 2014, WWW '14 Companion.

[21]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[22]  Dimitrios M. Thilikos,et al.  Evaluating Cooperation in Communities with the k-Core Structure , 2011, 2011 International Conference on Advances in Social Networks Analysis and Mining.

[23]  Albert-László Barabási,et al.  Statistical mechanics of complex networks , 2001, ArXiv.

[24]  Vladimir Batagelj,et al.  An O(m) Algorithm for Cores Decomposition of Networks , 2003, ArXiv.