LightHouse: An Automatic Code Generator for Graph Algorithms on GPUs
暂无分享,去创建一个
[1] Keshav Pingali,et al. Optimistic parallelism benefits from data partitioning , 2008, ASPLOS.
[2] Edmond Chow,et al. A Scalable Distributed Parallel Breadth-First Search Algorithm on BlueGene/L , 2005, ACM/IEEE SC 2005 Conference (SC'05).
[3] Kunle Olukotun,et al. Green-Marl: a DSL for easy and efficient graph analysis , 2012, ASPLOS XVII.
[4] Keshav Pingali,et al. A quantitative study of irregular programs on GPUs , 2012, 2012 IEEE International Symposium on Workload Characterization (IISWC).
[5] Nancy M. Amato,et al. Multithreaded Asynchronous Graph Traversal for In-Memory and Semi-External Memory , 2010, 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis.
[6] Matei Ripeanu,et al. A yoke of oxen and a thousand chickens for heavy lifting graph processing , 2012, 2012 21st International Conference on Parallel Architectures and Compilation Techniques (PACT).
[7] David A. Bader,et al. Designing Multithreaded Algorithms for Breadth-First Search and st-connectivity on the Cray MTA-2 , 2006, 2006 International Conference on Parallel Processing (ICPP'06).
[8] Wu-chun Feng,et al. Inter-block GPU communication via fast barrier synchronization , 2010, 2010 IEEE International Symposium on Parallel & Distributed Processing (IPDPS).
[9] Jianlong Zhong,et al. Medusa: Simplified Graph Processing on GPUs , 2014, IEEE Transactions on Parallel and Distributed Systems.
[10] Keshav Pingali,et al. Elixir: a system for synthesizing concurrent graph programs , 2012, OOPSLA '12.
[11] Fabio Checconi,et al. Breaking the speed and scalability Barriers for Graph exploration on distributed-memory machines , 2012, 2012 International Conference for High Performance Computing, Networking, Storage and Analysis.
[12] Keshav Pingali,et al. How much parallelism is there in irregular applications? , 2009, PPoPP '09.
[13] Keshav Pingali,et al. Optimistic parallelism requires abstractions , 2009, CACM.
[14] Xipeng Shen,et al. On-the-fly elimination of dynamic irregularities for GPU computing , 2011, ASPLOS XVI.
[15] Kamesh Madduri,et al. Parallel breadth-first search on distributed memory systems , 2011, 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC).
[16] Feng Liu,et al. Dynamically managed data for CPU-GPU architectures , 2012, CGO '12.
[17] Mary W. Hall,et al. Non-affine Extensions to Polyhedral Code Generation , 2014, CGO '14.
[18] Keshav Pingali,et al. The tao of parallelism in algorithms , 2011, PLDI '11.
[19] David A. Bader,et al. An Experimental Study of A Parallel Shortest Path Algorithm for Solving Large-Scale Graph Instances , 2007, ALENEX.
[20] Keshav Pingali,et al. Synthesizing parallel graph programs via automated planning , 2015, PLDI.
[21] Keshav Pingali,et al. Morph algorithms on GPUs , 2013, PPoPP '13.
[22] Frédo Durand,et al. Halide: a language and compiler for optimizing parallelism, locality, and recomputation in image processing pipelines , 2013, PLDI 2013.
[23] Rok Sosic,et al. SNAP , 2016, ACM Trans. Intell. Syst. Technol..
[24] Guy E. Blelloch,et al. Ligra: a lightweight graph processing framework for shared memory , 2013, PPoPP '13.