论文信息 - A Hybrid Update Strategy for I/O-Efficient Out-of-Core Graph Processing

A Hybrid Update Strategy for I/O-Efficient Out-of-Core Graph Processing

In recent years, a number of out-of-core graph processing systems have been proposed to process graphs with billions of edges on just one commodity computer, due to their high cost efficiency. To obtain a better performance, these systems adopt a full I/O model that scans all edges during the computation to avoid the inefficiency of random I/Os. Although this model ensures good I/O access locality, it leads to a large number of useless edges to be loaded when running graph algorithms that only access a small portion of edges in each iteration. An intuitive method to solve this I/O inefficiency problem is the on-demand I/O model that only accesses the active edges. However, this method only works well for the graph algorithms with very few active edges, since the I/O cost will grow rapidly as the number of active edges increases due to the increasing amount of random I/Os. In this article, we present HUS-Graph, an efficient out-of-core graph processing system to address the above I/O issues and achieve a good balance between I/O traffic and I/O access locality. HUS-Graph adopts a hybrid update strategy including two update models, Row-oriented Push (ROP) and Column-oriented Pull (COP). It supports switching between ROP and COP adaptively, for the graph algorithms that have different computation and I/O features. For traversal-based algorithms, HUS-Graph also provides an immediate propagation-based vertex update scheme to accelerate the vertex state propagation and convergence speed. Furthermore, HUS-Graph adopts a locality-optimized dual-block representation to organize graph data and an I/O-based performance prediction method to enable the system to dynamically select the optimal update model between ROP and COP. To save the disk space and further reduce I/O traffic, HUS-Graph implements a space-efficient storage format by combining several graph compression methods. Extensive experimental results show that HUS-Graph outperforms two existing out-of-core systems GraphChi and GridGraph by 1.2x-52.8x.

[1] Shoaib Kamil,et al. GraphIt: a high-performance graph DSL , 2018, Proc. ACM Program. Lang..

[2] Leslie G. Valiant,et al. A bridging model for parallel computation , 1990, CACM.

[3] Jon M. Kleinberg,et al. Group formation in large social networks: membership, growth, and evolution , 2006, KDD '06.

[4] Jianlong Zhong,et al. Medusa: Simplified Graph Processing on GPUs , 2014, IEEE Transactions on Parallel and Distributed Systems.

[5] Wenguang Chen,et al. Gemini: A Computation-Centric Distributed Graph Processing System , 2016, OSDI.

[6] Jimeng Sun,et al. GBASE: a scalable and general graph management system , 2011, KDD.

[7] Zhenguo Li,et al. VENUS: Vertex-centric streamlined graph computation on a single PC , 2015, 2015 IEEE 31st International Conference on Data Engineering.

[8] Alexander S. Szalay,et al. FlashGraph: Processing Billion-Node Graphs on an Array of Commodity SSDs , 2014, FAST.

[9] H. Howie Huang,et al. Graphene: Fine-Grained IO Management for Graph Computing , 2017, FAST.

[10] Guy E. Blelloch,et al. Smaller and Faster: Parallel Processing of Compressed Graphs with Ligra+ , 2015, 2015 Data Compression Conference.

[11] Wenguang Chen,et al. GridGraph: Large-Scale Graph Processing on a Single Machine Using 2-Level Hierarchical Partitioning , 2015, USENIX ATC.

[12] Hong Jiang,et al. A highly cost-effective task scheduling strategy for very large graph computation , 2018, Future Gener. Comput. Syst..

[13] Guy E. Blelloch,et al. GraphChi: Large-Scale Graph Computation on Just a PC , 2012, OSDI.

[14] Sebastiano Vigna,et al. The webgraph framework I: compression techniques , 2004, WWW '04.

[15] Hong Jiang,et al. A communication-reduced and computation-balanced framework for fast graph computation , 2018, Frontiers of Computer Science.

[16] Hosung Park,et al. What is Twitter, a social network or a news media? , 2010, WWW '10.

[17] Sebastiano Vigna,et al. A large time-aware web graph , 2008, SIGF.

[18] Rajiv Gupta,et al. Load the Edges You Need: A Generic I/O Optimization for Disk-based Graph Processing , 2016, USENIX Annual Technical Conference.

[19] Carlos Guestrin,et al. Distributed GraphLab : A Framework for Machine Learning and Data Mining in the Cloud , 2012 .

[20] Keval Vora,et al. LUMOS: Dependency-Driven Disk-based Graph Processing , 2019, USENIX ATC.

[21] Guy E. Blelloch,et al. Ligra: a lightweight graph processing framework for shared memory , 2013, PPoPP '13.

[22] Hong Jiang,et al. HUS-Graph: I/O-Efficient Out-of-Core Graph Processing with Hybrid Update Strategy , 2018, ICPP.

[23] Aart J. C. Bik,et al. Pregel: a system for large-scale graph processing , 2010, SIGMOD Conference.

[24] Kai Wang,et al. GraphQ: Graph Query Processing with Abstraction Refinement - Scalable and Programmable Analytics over Very Large Graphs on a Single PC , 2015, USENIX Annual Technical Conference.

[25] Kang Chen,et al. Wonderland: A Novel Abstraction-Based Out-Of-Core Graph Processing System , 2018, ASPLOS.

[26] Yu Wang,et al. NXgraph: An efficient graph processing system on a single machine , 2015, 2016 IEEE 32nd International Conference on Data Engineering (ICDE).

[27] Jinha Kim,et al. TurboGraph: a fast parallel graph engine handling billion-scale graphs in a single PC , 2013, KDD.

[28] David A. Patterson,et al. Direction-optimizing Breadth-First Search , 2012, 2012 International Conference for High Performance Computing, Networking, Storage and Analysis.

[29] Yonggang Wen,et al. GraphMP: I/O-Efficient Big Graph Analytics on a Single Commodity Machine , 2018, IEEE Transactions on Big Data.

[30] Joseph Gonzalez,et al. PowerGraph: Distributed Graph-Parallel Computation on Natural Graphs , 2012, OSDI.

[31] Yafei Dai,et al. Garaph: Efficient GPU-accelerated Graph Processing on a Single Machine with Balanced Replication , 2017, USENIX Annual Technical Conference.

[32] Sizhuo Zhang,et al. GraFBoost: Using Accelerated Flash Storage for External Graph Analytics , 2018, 2018 ACM/IEEE 45th Annual International Symposium on Computer Architecture (ISCA).

[33] Christos Faloutsos,et al. PEGASUS: A Peta-Scale Graph Mining System Implementation and Observations , 2009, 2009 Ninth IEEE International Conference on Data Mining.

[34] Hong Jiang,et al. Using High-Bandwidth Networks Efficiently for Fast Graph Computation , 2019, IEEE Transactions on Parallel and Distributed Systems.

[35] Haibo Chen,et al. NUMA-aware graph-structured analytics , 2015, PPoPP.

[36] Willy Zwaenepoel,et al. X-Stream: edge-centric graph processing using streaming partitions , 2013, SOSP.

[37] Alex Brooks,et al. Gluon: a communication-optimizing substrate for distributed heterogeneous graph analytics , 2018, PLDI.

[38] Hai Jin,et al. TripleBit: a Fast and Compact System for Large Scale RDF Data , 2013, Proc. VLDB Endow..

[39] Anand Sivasubramaniam,et al. Large-Scale Graph Processing on Emerging Storage Devices , 2019, FAST.

[40] Willy Zwaenepoel,et al. Chaos: scale-out graph processing from secondary storage , 2015, SOSP.

[41] Ge Yu,et al. Hybrid Pulling/Pushing for I/O-Efficient Distributed and Iterative Graph Computing , 2016, SIGMOD Conference.

[42] Reynold Xin,et al. GraphX: Graph Processing in a Distributed Dataflow Framework , 2014, OSDI.