论文信息 - Betweenness Centrality in an HSA-enabled System

Betweenness Centrality in an HSA-enabled System

This paper studies different approaches to implementing betweenness centrality in a heterogeneous system. Betweenness centrality is an important algorithm in graph processing. It presents multiple levels of parallelism when processing a graph, and is an interesting problem to exploit various optimizations. We implement different versions of betweenness centrality on an AMD accelerated processing unit (APU). These include GPU-only implementations with two edge distribution methods, GPU-side load balancing, CPU-GPU load balancing in a master-worker model with queue monitoring and in a work stealing model. We take advantage of the latest development of heterogeneous system architecture (HSA), such as the features of unified virtual address space and diverse atomics. We also use different memory scope and ordering options for different synchronization scenarios. We compare multiple implementations of betweenness centrality, analyze their performance, and discuss important future research directions.

Shuai Che | Gregory Rodgers | Jonathan Gallmeier | Marc S. Orr

[1] Timothy A. Davis,et al. The university of Florida sparse matrix collection , 2011, TOMS.

[2] John D. Owens,et al. Gunrock: a high-performance graph processing library on the GPU , 2015, PPoPP.

[3] Bradford M. Beckmann,et al. Graph Coloring on the GPU and Some Techniques to Improve Load Imbalance , 2015, 2015 IEEE International Parallel and Distributed Processing Symposium Workshop.

[4] Ümit V. Çatalyürek,et al. Betweenness centrality on GPUs and heterogeneous architectures , 2013, GPGPU@ASPLOS.

[5] U. Brandes. A faster algorithm for betweenness centrality , 2001 .

[6] David A. Bader,et al. Scalable and High Performance Betweenness Centrality on the GPU , 2014, SC14: International Conference for High Performance Computing, Networking, Storage and Analysis.

[7] Wen-mei W. Hwu,et al. GPU Computing Gems Jade Edition , 2011 .

[8] David Kaeli,et al. Heterogeneous Computing with OpenCL 2.0 , 2015 .

[9] Bing Zhang,et al. Fast network centrality analysis using GPUs , 2011, BMC Bioinformatics.

[10] Jared Hoberock,et al. Edge v. Node Parallelism for Graph Centrality Metrics , 2012 .

[11] Philippas Tsigas,et al. Dynamic Load Balancing Using Work-Stealing , 2011 .