Performance of CUDA Virtualized Remote GPUs in High Performance Clusters
暂无分享,去创建一个
Federico Silla | Enrique S. Quintana-Ortí | Antonio J. Peña | José Duato | Rafael Mayo | F. Silla | J. Duato | R. Mayo | E. Quintana‐Ortí | E. Quintana-Ortí
[1] Jeremy Sugerman,et al. GPU virtualization on VMware's hosted I/O architecture , 2008, OPSR.
[2] James Demmel,et al. Benchmarking GPUs to tune dense linear algebra , 2008, 2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis.
[3] John Nagle,et al. Congestion control in IP/TCP internetworks , 1984, CCRV.
[4] Enrique S. Quintana-Ortí,et al. Exploiting the capabilities of modern GPUs for dense matrix computations , 2009, Concurr. Comput. Pract. Exp..
[5] Ahmad Afsahi,et al. 10-Gigabit iWARP Ethernet: Comparative Performance Analysis with InfiniBand and Myrinet-10G , 2007, 2007 IEEE International Parallel and Distributed Processing Symposium.
[6] Giulio Giunta,et al. A GPGPU Transparent Virtualization Component for High Performance Computing Clouds , 2010, Euro-Par.
[7] William R. Mark,et al. Cg: a system for programming graphics hardware in a C-like language , 2003, ACM Trans. Graph..
[8] Vanish Talwar,et al. GViM: GPU-accelerated virtual machines , 2009, HPCVirt '09.
[9] Weiguo Liu,et al. Bio-sequence database scanning on a GPU , 2006, Proceedings 20th IEEE International Parallel & Distributed Processing Symposium.
[10] Eyal de Lara,et al. VMM-independent graphics acceleration , 2007, VEE '07.
[11] Venkata Krishnan. Towards an integrated IO and clustering solution using PCI express , 2007, 2007 IEEE International Conference on Cluster Computing.
[12] Rafael Mayo,et al. Attaining High Performance in General-Purpose Computations on Current Graphics Processors , 2008, VECPAR.
[13] Lin Shi,et al. vCUDA: GPU accelerated high performance computing in virtual machines , 2009, 2009 IEEE International Symposium on Parallel & Distributed Processing.
[14] Federico Silla,et al. An Efficient Implementation of GPU Virtualization in High Performance Clusters , 2009, Euro-Par Workshops.
[15] Sudhakar Yalamanchili,et al. Extending HyperTransport Protocol for Improved Scalability , 2009 .
[16] David Blythe. The Direct3D 10 system , 2006, ACM Trans. Graph..
[17] Roger L. Davis,et al. Rapid Aerodynamic Performance Prediction on a Cluster of Graphics Processing Units , 2009 .
[18] Amnon Barak,et al. A package for OpenCL based heterogeneous computing on clusters with many GPU devices , 2010, 2010 IEEE International Conference On Cluster Computing Workshops and Posters (CLUSTER WORKSHOPS).