VOCL: An optimized environment for transparent virtualization of graphics processing units
暂无分享,去创建一个
Wu-chun Feng | Gaojin Wen | Qian Zhu | Heshan Lin | Jue Hong | Shucai Xiao | P. Balaji | R. Thakur | S. Coghlan | R. Thakur | Heshan Lin | P. Balaji | S. Coghlan | Wu-chun Feng | Jue Hong | Qian Zhu | Gaojin Wen | S. Xiao
[1] Rajeev Thakur,et al. Toward Efficient Support for Multithreaded MPI Communication , 2008, PVM/MPI.
[2] Federico Silla,et al. An Efficient Implementation of GPU Virtualization in High Performance Clusters , 2009, Euro-Par Workshops.
[3] Jason N. Dale,et al. Cell Broadband Engine Architecture and its first implementation - A performance view , 2007, IBM J. Res. Dev..
[4] M S Waterman,et al. Identification of common molecular subsequences. , 1981, Journal of molecular biology.
[5] A. Barak,et al. The MOSIX Cluster Operating System for High-Performance Computing on Linux Clusters, Multi-Clusters, GPU Clusters and Clouds , 2011 .
[6] Wu-chun Feng,et al. On the Robust Mapping of Dynamic Programming onto a Graphics Processing Unit , 2009, 2009 15th International Conference on Parallel and Distributed Systems.
[7] Aaftab Munshi,et al. The OpenCL specification , 2009, 2009 IEEE Hot Chips 21 Symposium (HCS).
[8] Bronis R. de Supinski,et al. Minimizing MPI Resource Contention in Multithreaded Multicore Environments , 2010, 2010 IEEE International Conference on Cluster Computing.
[9] Fumihiko Ino,et al. Design and implementation of the Smith-Waterman algorithm on the CUDA-compatible GPU , 2008, 2008 8th IEEE International Conference on BioInformatics and BioEngineering.
[10] Rajeev Thakur,et al. Enabling Concurrent Multithreaded MPI Communication on Multicore Petascale Systems , 2010, EuroMPI.
[11] Ali Akoglu,et al. Sequence alignment with GPU: Performance and design challenges , 2009, 2009 IEEE International Symposium on Parallel & Distributed Processing.
[12] Collin McCurdy,et al. The Scalable Heterogeneous Computing (SHOC) benchmark suite , 2010, GPGPU-3.
[13] ChenT.,et al. Cell Broadband Engine Architecture and its first implementation—A view , 2007 .
[14] Klaus Schulten,et al. GPU acceleration of cutoff pair potentials for molecular modeling applications , 2008, CF '08.
[15] Federico Silla,et al. Performance of CUDA Virtualized Remote GPUs in High Performance Clusters , 2011, 2011 International Conference on Parallel Processing.
[16] Giorgio Valle,et al. CUDA compatible GPU cards as efficient hardware accelerators for Smith-Waterman sequence alignment , 2008, BMC Bioinformatics.
[17] Rajeev Thakur,et al. Fine-Grained Multithreading Support for Hybrid Threaded MPI Programming , 2010, Int. J. High Perform. Comput. Appl..
[18] Amnon Barak,et al. The MOSIX Virtual OpenCL (VCL) Cluster Platform , 2011 .
[19] Kim M. Hazelwood,et al. Where is the data? Why you cannot debate CPU vs. GPU performance without the answer , 2011, (IEEE ISPASS) IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE.
[20] Kenli Li,et al. vCUDA: GPU-Accelerated High-Performance Computing in Virtual Machines , 2012, IEEE Trans. Computers.
[21] Federico Silla,et al. rCUDA: Reducing the number of GPU-based accelerators in high performance clusters , 2010, 2010 International Conference on High Performance Computing & Simulation.
[22] Orion S. Lawlor,et al. Message passing for GPGPU clusters: CudaMPI , 2009, 2009 IEEE International Conference on Cluster Computing and Workshops.