Alleviating Irregularity in Graph Analytics Acceleration: a Hardware/Software Co-Design Approach
暂无分享,去创建一个
Zhimin Zhang | Dongrui Fan | Xin Ma | Lei Deng | Yuan Xie | Shuangchen Li | Han Li | Yujing Feng | Abanti Basak | Mingyu Yan | Xing Hu | Itir Akgun | Peng Gu | Xiaochun Ye | Lei Deng | Shuangchen Li | P. Gu | Yuan Xie | Dongrui Fan | Xin Ma | Mingyu Yan | Xiaochun Ye | Xing Hu | Abanti Basak | Yujing Feng | Zhimin Zhang | Han Li | Itir Akgun
[1] Krishna P. Gummadi,et al. Measurement and analysis of online social networks , 2007, IMC '07.
[2] Phillip H. Jones,et al. CyGraph: A Reconfigurable Architecture for Parallel Breadth-First Search , 2014, 2014 IEEE International Parallel & Distributed Processing Symposium Workshops.
[3] Keshav Pingali,et al. How much parallelism is there in irregular applications? , 2009, PPoPP '09.
[4] Margaret Martonosi,et al. Graphicionado: A high-performance and energy-efficient accelerator for graph analytics , 2016, 2016 49th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO).
[5] Kevin Skadron,et al. Pannotia: Understanding irregular GPGPU graph applications , 2013, 2013 IEEE International Symposium on Workload Characterization (IISWC).
[6] Aart J. C. Bik,et al. Pregel: a system for large-scale graph processing , 2010, SIGMOD Conference.
[7] Yu Wang,et al. A Reconfigurable Computing Approach for Efficient and Scalable Parallel Graph Exploration , 2012, 2012 IEEE 23rd International Conference on Application-Specific Systems, Architectures and Processors.
[8] Pradeep Dubey,et al. GraphMat: High performance graph analytics made productive , 2015, Proc. VLDB Endow..
[9] Brian W. Barrett,et al. Introducing the Graph 500 , 2010 .
[10] Paolo Faraboschi,et al. Parallel Graph Processing: Prejudice and State of the Art , 2016, ICPE.
[11] Sizhuo Zhang,et al. GraFBoost: Using Accelerated Flash Storage for External Graph Analytics , 2018, 2018 ACM/IEEE 45th Annual International Symposium on Computer Architecture (ISCA).
[12] Onur Mutlu,et al. Ramulator: A Fast and Extensible DRAM Simulator , 2016, IEEE Computer Architecture Letters.
[13] Joseph Gonzalez,et al. PowerGraph: Distributed Graph-Parallel Computation on Natural Graphs , 2012, OSDI.
[14] Yu Wang,et al. ForeGraph: Exploring Large-scale Graph Processing on Multi-FPGA Architecture , 2017, FPGA.
[15] Jimmy J. Lin,et al. Information network or social network?: the structure of the twitter follow graph , 2014, WWW.
[16] Yiran Chen,et al. GraphR: Accelerating Graph Processing Using ReRAM , 2017, 2018 IEEE International Symposium on High Performance Computer Architecture (HPCA).
[17] Emilio Frazzoli,et al. A Survey of Motion Planning and Control Techniques for Self-Driving Urban Vehicles , 2016, IEEE Transactions on Intelligent Vehicles.
[18] Yu Wang,et al. GraphH: A Processing-in-Memory Architecture for Large-Scale Graph Processing , 2019, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.
[19] Zhisong Fu,et al. MapGraph: A High Level API for Fast Development of High Performance Graph Analytics on GPUs , 2014, GRADES.
[20] Reynold Xin,et al. GraphX: Graph Processing in a Distributed Dataflow Framework , 2014, OSDI.
[21] Keval Vora,et al. CuSha: vertex-centric graph processing on GPUs , 2014, HPDC '14.
[22] David A. Patterson,et al. Locality Exists in Graph Processing: Workload Characterization on an Ivy Bridge Server , 2015, 2015 IEEE International Symposium on Workload Characterization.
[23] John D. Owens,et al. Gunrock: a high-performance graph processing library on the GPU , 2015, PPoPP.
[24] Guy E. Blelloch,et al. Ligra: a lightweight graph processing framework for shared memory , 2013, PPoPP '13.
[25] Ching-Yung Lin,et al. GraphBIG: understanding graph computing in the context of industrial solutions , 2015, SC15: International Conference for High Performance Computing, Networking, Storage and Analysis.
[26] O. Sporns,et al. Complex brain networks: graph theoretical analysis of structural and functional systems , 2009, Nature Reviews Neuroscience.
[27] Tajana Simunic,et al. GRAM: graph processing in a ReRAM-based computational memory , 2019, ASP-DAC.
[28] Ramyad Hadidi,et al. GraphPIM: Enabling Instruction-Level PIM Offloading in Graph Computing Frameworks , 2017, 2017 IEEE International Symposium on High Performance Computer Architecture (HPCA).
[29] Kiyoung Choi,et al. A scalable processing-in-memory accelerator for parallel graph processing , 2015, 2015 ACM/IEEE 42nd Annual International Symposium on Computer Architecture (ISCA).
[30] Jonathan W. Berry,et al. Challenges in Parallel Graph Processing , 2007, Parallel Process. Lett..
[31] Zhijia Zhao,et al. Tigr: Transforming Irregular Graphs for GPU-Friendly Graph Processing , 2018, ASPLOS.
[32] Pengcheng Yao,et al. An efficient graph accelerator with parallel data conflict management , 2018, PACT.
[33] Hosung Park,et al. What is Twitter, a social network or a news media? , 2010, WWW '10.
[34] Keshav Pingali,et al. Atomic-free irregular computations on GPUs , 2013, GPGPU@ASPLOS.
[35] Ron Ho,et al. Modeling and Design of High-Radix On-Chip Crossbar Switches , 2015, NOCS.
[36] Sachin Katti,et al. Parallel Graph Processing on Modern Multi-core Servers: New Findings and Remaining Challenges , 2016, 2016 IEEE 24th International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems (MASCOTS).
[37] Christoforos E. Kozyrakis,et al. GraphP: Reducing Communication for PIM-Based Graph Processing with Efficient Data Partition , 2018, 2018 IEEE International Symposium on High Performance Computer Architecture (HPCA).
[38] Norman P. Jouppi,et al. CACTI: an enhanced cache access and cycle time model , 1996, IEEE J. Solid State Circuits.
[39] Yu Wang,et al. FPGP: Graph Processing Framework on FPGA A Case Study of Breadth-First Search , 2016, FPGA.
[40] Rajeev Motwani,et al. The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.
[41] Laxmi N. Bhuyan,et al. Scalable SIMD-Efficient Graph Processing on GPUs , 2015, 2015 International Conference on Parallel Architecture and Compilation (PACT).
[42] Uday Bondhugula,et al. Parallel FPGA-based all-pairs shortest-paths in a directed graph , 2006, Proceedings 20th IEEE International Parallel & Distributed Processing Symposium.
[43] Timothy A. Davis,et al. The university of Florida sparse matrix collection , 2011, TOMS.
[44] Hyeran Jeon,et al. Graph processing on GPUs: Where are the bottlenecks? , 2014, 2014 IEEE International Symposium on Workload Characterization (IISWC).
[45] Stijn Eyerman,et al. Many-Core Graph Workload Analysis , 2018, SC18: International Conference for High Performance Computing, Networking, Storage and Analysis.
[46] Ozcan Ozturk,et al. Energy Efficient Architecture for Graph Analytics Accelerators , 2016, 2016 ACM/IEEE 43rd Annual International Symposium on Computer Architecture (ISCA).
[47] Christoforos E. Kozyrakis,et al. Memory Hierarchy for Web Search , 2018, 2018 IEEE International Symposium on High Performance Computer Architecture (HPCA).
[48] Yu Wang,et al. GraphSAR: a sparsity-aware processing-in-memory architecture for large-scale graph processing on ReRAMs , 2019, ASP-DAC.
[49] Javier González,et al. Hierarchical graph search for mobile robot path planning , 1998, Proceedings. 1998 IEEE International Conference on Robotics and Automation (Cat. No.98CH36146).
[50] Li Zhao,et al. Analysis and Optimization of the Memory Hierarchy for Graph Processing Workloads , 2019, 2019 IEEE International Symposium on High Performance Computer Architecture (HPCA).
[51] Yafei Dai,et al. Garaph: Efficient GPU-accelerated Graph Processing on a Single Machine with Balanced Replication , 2017, USENIX Annual Technical Conference.
[52] Kunle Olukotun,et al. GraphOps: A Dataflow Library for Graph Analytics Acceleration , 2016, FPGA.