VRGQ: Evaluating a Stream of Iterative Graph Queries via Value Reuse

While much of the research on graph analytics over large power-law graphs has focused on developing algorithms for evaluating a single global graph query, in practice we may be faced with a stream of queries. We observe that, due to their global nature, vertex specific graph queries present an opportunity for sharing work across queries. To take advantage of this opportunity, we have developed the VRGQ framework that accelerates the evaluation of a stream of queries via coarsegrained value reuse. In particular, the results of queries for a small set of source vertices are reused to speedup all future queries. We present a two step algorithm that in its first step initializes the query result based upon value reuse and then in the second step iteratively evaluates the query to convergence. The reused results for a small number of queries are held in a reuse table. Our experiments with best reuse configurations on four power law graphs and thousands of graph queries of five kinds yielded average speedups of 143×, 13.2×, 6.89×, 1.43×, and 1.18×.

[1]  Zhijia Zhao,et al.  Tripoline: generalized incremental graph processing via graph triangle inequality , 2021, EuroSys.

[2]  Rajiv Gupta,et al.  BEAD: Batched Evaluation of Iterative Graph Queries with Evolving Analytics Demands , 2020, 2020 IEEE International Conference on Big Data (Big Data).

[3]  Rajiv Gupta,et al.  SimGQ: Simultaneously Evaluating Iterative Graph Queries , 2020, 2020 IEEE 27th International Conference on High Performance Computing, Data, and Analytics (HiPC).

[4]  Xiaolin Jiang,et al.  MultiLyra: Scalable Distributed Evaluation of Batches of Iterative Graph Queries , 2019, 2019 IEEE International Conference on Big Data (Big Data).

[5]  Rajiv Gupta,et al.  PnP: Pruning and Prediction for Point-To-Point Iterative Graph Analytics , 2019, ASPLOS.

[6]  Hai Jin,et al.  CGraph: A Correlations-aware Approach for Efficient Concurrent Iterative Graph Processing , 2018, USENIX Annual Technical Conference.

[7]  Kristi Kuljus,et al.  Estimation of Viterbi path in Bayesian hidden Markov models , 2018, METRON.

[8]  Chao Li,et al.  Congra: Towards Efficient Processing of Concurrent Graph Queries on Shared-Memory Machines , 2017, 2017 IEEE International Conference on Computer Design (ICCD).

[9]  Rajiv Gupta,et al.  CoRAL: Confined Recovery in Distributed Asynchronous Graph Processing , 2017, ASPLOS.

[10]  Fan Yang,et al.  A General-Purpose Query-Centric Framework for Querying Big Graphs , 2016, Proc. VLDB Endow..

[11]  Huy T. Vo,et al.  The More the Merrier: Efficient Multi-Source Graph Traversal , 2014, Proc. VLDB Endow..

[12]  Rajiv Gupta,et al.  ASPIRE: exploiting asynchronous parallelism in iterative algorithms using a relaxed consistency based DSM , 2014, OOPSLA.

[13]  Yafei Dai,et al.  Seraph: an efficient, low-cost system for concurrent graph processing , 2014, HPDC '14.

[14]  Keshav Pingali,et al.  A lightweight infrastructure for graph analytics , 2013, SOSP.

[15]  Haixun Wang,et al.  Hub-Accelerator: Fast and Exact Shortest Path Computation in Large Social Networks , 2013, ArXiv.

[16]  Guy E. Blelloch,et al.  Ligra: a lightweight graph processing framework for shared memory , 2013, PPoPP '13.

[17]  Carlos Guestrin,et al.  Usenix Association 10th Usenix Symposium on Operating Systems Design and Implementation (osdi '12) 31 Graphchi: Large-scale Graph Computation on Just a Pc , 2022 .

[18]  Aart J. C. Bik,et al.  Pregel: a system for large-scale graph processing , 2010, SIGMOD Conference.

[19]  Krishna P. Gummadi,et al.  Measuring User Influence in Twitter: The Million Follower Fallacy , 2010, ICWSM.

[20]  Hosung Park,et al.  What is Twitter, a social network or a news media? , 2010, WWW '10.

[21]  Jon M. Kleinberg,et al.  Group formation in large social networks: membership, growth, and evolution , 2006, KDD '06.

[22]  Rizal Setya Perdana What is Twitter , 2013 .

[23]  L. Takac DATA ANALYSIS IN PUBLIC SOCIAL NETWORKS , 2012 .

[24]  Carlos Guestrin,et al.  Distributed GraphLab : A Framework for Machine Learning and Data Mining in the Cloud , 2012 .

[25]  Carlos Guestrin,et al.  PowerGraph: Distributed Graph-Parallel Computation on Natural Graphs , 2012 .