A Common GPU n-Dimensional Array for Python and C
暂无分享,去创建一个
Currently there are multiple incompatible array/matrix/n-dimensional base object implementations for GPUs. This hinders the sharing of GPU code and causes duplicate development work. This paper proposes and presents a first version of a common GPU n-dimensional array (tensor) named GpuNdArray [1] that works with both CUDA and OpenCL. It will be usable from Python, C, and possibly other programming languages.
[1] Nicolas Pinto,et al. PyCUDA and PyOpenCL: A scripting-based approach to GPU run-time code generation , 2009, Parallel Comput..
[2] Volodymyr Mnih,et al. CUDAMat: a CUDA-based matrix class for Python , 2009 .
[3] Travis E. Oliphant,et al. Python for Scientific Computing , 2007, Computing in Science & Engineering.