Concurrent Kernel Execution on Xeon Phi within Parallel Heterogeneous Workloads
暂无分享,去创建一个
[1] T. Steinke,et al. On Improving the Performance of Multi-threaded CUDA Applications with Concurrent Kernel Execution by Kernel Reordering , 2012, 2012 Symposium on Application Accelerators in High Performance Computing.
[2] Antonio González,et al. A Performance and Area Efficient Architecture for Intrusion Detection Systems , 2011, 2011 IEEE International Parallel & Distributed Processing Symposium.
[3] Tarek El-Ghazawi,et al. Towards efficient GPU sharing on multicore processors , 2011, PMBS '11.
[4] Ioan Raicu,et al. Understanding the Costs of Many-Task Computing Workloads on Intel Xeon Phi Coprocessors , 2013 .
[5] Ravi Narayanaswamy,et al. Offload Compiler Runtime for the Intel® Xeon Phi Coprocessor , 2013, 2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum.
[6] James Reinders,et al. Intel Xeon Phi Coprocessor High Performance Programming , 2013 .
[7] Thomas Steinke,et al. Multi-threaded Kernel Offloading to GPGPU Using Hyper-Q on Kepler Architecture , 2014 .
[8] Wen-mei W. Hwu,et al. GPU Computing Gems Jade Edition , 2011 .
[9] Stephen A. Jarvis,et al. Exploring SIMD for Molecular Dynamics, Using Intel® Xeon® Processors and Intel® Xeon Phi Coprocessors , 2013, 2013 IEEE 27th International Symposium on Parallel and Distributed Processing.