CORE Analysis for Efficient Shortest Path Traversal Queries in Social Graphs

The shortest path traversal queries require to scan the entire graph. The repetitive scans make it hard to answer the traversal queries in a reasonable time. Many traversal algorithms imply precomputed information to speed up the run-time query process. However, computing effective auxiliary data at pre-processing stage is still an active problem. It is useful but non-trivial to determine the occurrence of vertices or edges during shortest path computations. In this paper, we empirically analyze the continuous overlapped regions (COREs) to articulate the behavior of shortest path traversals on different real life social graphs. First, we compute the shortest paths between a set of vertices. Each shortest path is considered as one transaction. Second, we utilize the pattern mining approach to identify the frequency of occurrence of the vertices appear together. Further, we evaluate the results in terms of network properties, i.e. Degree distribution, average shortest path, and clustering coefficient, along with the visual analysis.

[1]  Guy E. Blelloch,et al.  GraphChi: Large-Scale Graph Computation on Just a PC , 2012, OSDI.

[2]  Jian Pei,et al.  Mining Frequent Patterns without Candidate Generation: A Frequent-Pattern Tree Approach , 2006, Sixth IEEE International Conference on Data Mining - Workshops (ICDMW'06).

[3]  Jeffrey Xu Yu,et al.  Relational Approach for Shortest Path Discovery over Large Graphs , 2011, Proc. VLDB Endow..

[4]  Waqas Nawaz,et al.  Shortest path analysis for efficient traversal queries in large networks , 2014 .

[5]  Peter Sanders,et al.  Engineering Route Planning Algorithms , 2009, Algorithmics of Large and Complex Networks.

[6]  Andrew V. Goldberg,et al.  Robust Exact Distance Queries on Massive Networks , 2014 .

[7]  Christian Sommer,et al.  Shortest-path queries in static networks , 2014, ACM Comput. Surv..

[8]  Takuya Akiba,et al.  Fast exact shortest-path distance queries on large networks by pruned landmark labeling , 2013, SIGMOD '13.

[9]  Shuigeng Zhou,et al.  Shortest path and distance queries on road networks: towards bridging theory and practice , 2013, SIGMOD '13.

[10]  Amos Fiat,et al.  Highway Dimension and Provably Efficient Shortest Path Algorithms , 2016, J. ACM.

[11]  Jeffrey Xu Yu,et al.  Shortest Path Computing in Relational DBMSs , 2014, IEEE Transactions on Knowledge and Data Engineering.

[12]  Andrew V. Goldberg,et al.  Implementation Challenge for Shortest Paths , 2008, Encyclopedia of Algorithms.

[13]  Young-Koo Lee,et al.  BiShard Parallel Processor: A Disk-Based Processing Engine for Billion-Scale Graphs , 2014, MUE 2014.