HAccRG: Hardware-Accelerated Data Race Detection in GPUs
暂无分享,去创建一个
[1] Guodong Li,et al. Scalable SMT-based verification of GPU kernel functions , 2010, FSE '10.
[2] Feng Qin,et al. GRace: a low-overhead mechanism for detecting data races in GPU programs , 2011, PPoPP '11.
[3] Jong-Deok Choi,et al. An efficient cache-based access anomaly detection scheme , 1991, ASPLOS IV.
[4] Peng Li,et al. GKLEE: concolic verification and test generation for GPUs , 2012, PPoPP '12.
[5] Sorin Lerner,et al. Verifying GPU kernels by test amplification , 2012, PLDI.
[6] Kun Zhou,et al. Debugging GPU stream programs through automatic dataflow recording and visualization , 2009, SIGGRAPH 2009.
[7] Josep Torrellas,et al. Pacman: Tolerating asymmetric data races with unintrusive hardware , 2012, IEEE International Symposium on High-Performance Comp Architecture.
[8] Josep Torrellas,et al. ReEnact: using thread-level speculation mechanisms to debug data races in multithreaded codes , 2003, ISCA '03.
[9] Rudolf Eigenmann,et al. OpenMP to GPGPU: a compiler framework for automatic translation and optimization , 2009, PPoPP '09.
[10] Josep Torrellas,et al. SigRace: signature-based data race detection , 2009, ISCA '09.
[11] Pin Zhou,et al. HARD: Hardware-Assisted Lockset-based Race Detection , 2007, 2007 IEEE 13th International Symposium on High Performance Computer Architecture.
[12] Henry Wong,et al. Analyzing CUDA workloads using a detailed GPU simulator , 2009, 2009 IEEE International Symposium on Performance Analysis of Systems and Software.
[13] Adam Betts,et al. GPUVerify: a verifier for GPU kernels , 2012, OOPSLA '12.
[14] Burton H. Bloom,et al. Space/time trade-offs in hash coding with allowable errors , 1970, CACM.
[15] Kun Zhou,et al. Debugging GPU stream programs through automatic dataflow recording and visualization , 2009, ACM Trans. Graph..
[16] Milos Prvulovic,et al. CORD: cost-effective (and nearly overhead-free) order-recording and data race detection , 2006, The Twelfth International Symposium on High-Performance Computer Architecture, 2006..
[17] J. Xu. OpenCL – The Open Standard for Parallel Programming of Heterogeneous Systems , 2009 .
[18] Tor M. Aamodt,et al. Thread block compaction for efficient SIMT control flow , 2011, 2011 IEEE 17th International Symposium on High Performance Computer Architecture.
[19] Feng Qin,et al. GMRace: Detecting Data Races in GPU Programs via a Low-Overhead Scheme , 2014, IEEE Transactions on Parallel and Distributed Systems.
[20] Michael Boyer. Automated Dynamic Analysis of CUDA Programs , 2008 .
[21] Michael Burrows,et al. Eraser: a dynamic data race detector for multithreaded programs , 1997, TOCS.