Shortest Path Approximate Algorithm for Complex Network Analysis

The tremendous scale of the social networks mined from Internet is the main obstacle of a social network analysis application. The bottleneck of many network analysis algorithms is the extortionate computational complexity of calculating the shortest path. Real-World networks usually exhibit the same topological features as complex networks such as the "scale-free" and etc,which indicate the intrinsic laws of the shortest paths in complex networks. Based on the topological features of real-world networks,a novel shortest path approximate algorithm which uses an existent short path passing through some local center nodes to estimate the shortest path in complex networks,is proposed. This paper illustrates the advantage and feasibility of incorporating the proposed algorithm within the network properties,which suggests a new idea for complex social network analysis. The proposed algorithm has been evaluated both on synthetic network stage and real world network stage. Experimental results show that the proposed algorithm can largely reduce the computational complexity and remain highly effective in complex networks.

[1]  Edmond Chow,et al.  A Graph Search Heuristic for Shortest Distance Paths , 2005, AAAI 2005.

[2]  Andrew V. Goldberg,et al.  Shortest paths algorithms: Theory and experimental evaluation , 1994, SODA '94.

[3]  Mikkel Thorup,et al.  Approximate distance oracles , 2005, J. ACM.

[4]  Yang Bo,et al.  Complex Network Clustering Algorithms , 2009 .

[5]  Kazunari Ishida Extracting Latent Weblog Communities-A Partitioning Algorithm for Bipartite Graphs - , 2005 .

[6]  David D. Jensen,et al.  Using structure indices for efficient approximation of network properties , 2006, KDD '06.

[7]  U. Brandes A faster algorithm for betweenness centrality , 2001 .

[8]  Andrew McCallum,et al.  A Machine Learning Approach to Building Domain-Specific Search Engines , 1999, IJCAI.

[9]  Sharon L. Milgram,et al.  The Small World Problem , 1967 .

[10]  G. Sabidussi The centrality of a graph. , 1966, Psychometrika.

[11]  Edith Cohen,et al.  All-pairs small-stretch paths , 1997, SODA '97.

[12]  Mark S. Granovetter The Strength of Weak Ties , 1973, American Journal of Sociology.

[13]  L. Freeman Centrality in social networks conceptual clarification , 1978 .

[14]  P. Erdos,et al.  On the evolution of random graphs , 1984 .

[15]  Narsingh Deo,et al.  Shortest-path algorithms: Taxonomy and annotation , 1984, Networks.

[16]  David Eppstein,et al.  A steady state model for graph power laws , 2002, ArXiv.

[17]  F. Benjamin Zhan,et al.  Shortest Path Algorithms: An Evaluation Using Real Road Networks , 1998, Transp. Sci..

[18]  Jerome L. Myers,et al.  Research Design and Statistical Analysis , 1991 .

[19]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[20]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[21]  Sandeep Sen,et al.  All-Pairs Nearly 2-Approximate Shortest-Paths in O(n2 polylog n) Time , 2005, STACS.

[22]  Li Ning,et al.  Networked Data Mining Based on Social Network Visualizations , 2008 .

[23]  David D. Jensen,et al.  Graph clustering with network structure indices , 2007, ICML '07.

[24]  M. Newman,et al.  The structure of scientific collaboration networks. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[25]  Aleksandrs Slivkins Distance estimation and object location via rings of neighbors , 2006, Distributed Computing.

[26]  Salvatore J. Stolfo,et al.  Behavior Profiling of Email , 2003, ISI.

[27]  Qiao Shao Mining Key Members of Crime Networks Based on Personality Trait Simulation Email Analysis System , 2008 .

[28]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[29]  J Lu,et al.  Complex Dynamical Networks and Their Applications in Software Engineering , 2008 .

[30]  P. Bearman,et al.  Chains of Affection: The Structure of Adolescent Romantic and Sexual Networks1 , 2004, American Journal of Sociology.

[31]  Edsger W. Dijkstra,et al.  A note on two problems in connexion with graphs , 1959, Numerische Mathematik.

[32]  Sandeep Sen,et al.  All-pairs nearly 2-approximate shortest paths in I time , 2009, Theor. Comput. Sci..

[33]  Robert E. Tarjan,et al.  Fibonacci heaps and their uses in improved network optimization algorithms , 1984, JACM.

[34]  Kristina Lerman,et al.  Social Browsing on Flickr , 2006, ICWSM.

[35]  Uri Zwick,et al.  Exact and Approximate Distances in Graphs - A Survey , 2001, ESA.

[36]  B. Bollobás The evolution of random graphs , 1984 .

[37]  Xu Ye Analysis on Traveling Diameter of Internet , 2006 .