Improving Memory Access Efficiency for Heterogeneous Multiprocessor Systems

Recent trends show that using mobile GPUs to support 3D graphics and H.264 decoding based on OpenGL ES on smart phones are prevalent. To enhance the performance of both CPU and GPU, Heterogeneous System Architecture (HSA) is suggested which is one of methods accessing different kinds of memory using virtual memory structures. In this study, we suggest a method called Fusion Processor Architecture for accessing memory from CPU and GPU without copying contexts between them. We introduce control mechanisms including Job Manager, Re-mapper, and Pre-fetcher to control and distribute data between the two processors. Using these methods help reducing message exchanges and page-table requests between CPU and GPU, which in turn increasing efficiency of the two devices. Evaluations compared to Compute Unified Device Architecture (CUDA), OpenMP, and OpenCL prove the suggested mechanism works 198% faster on a QEMU emulation environment.

[1]  Yarsun Hsu,et al.  A Reliable and Secure GPU-Assisted File System , 2014, ICA3PP.

[2]  Daniel J. Sorin,et al.  Evaluating cache coherent shared virtual memory for heterogeneous multicore chips , 2013, 2013 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS).

[3]  Gustavo Alonso,et al.  Main-memory hash joins on multi-core CPUs: Tuning to the underlying hardware , 2012, 2013 IEEE 29th International Conference on Data Engineering (ICDE).

[4]  Dhabaleswar K. Panda,et al.  Optimizing MPI Communication on Multi-GPU Systems Using CUDA Inter-Process Communication , 2012, 2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops & PhD Forum.

[5]  David A. Wood,et al.  Heterogeneous system coherence for integrated CPU-GPU systems , 2013, 2013 46th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO).

[6]  Abhishek Bhattacharjee,et al.  Architectural support for address translation on GPUs: designing memory management units for CPU/GPUs with unified address spaces , 2014, ASPLOS.