vGASA: Adaptive Scheduling Algorithm of Virtualized GPU Resource in Cloud Gaming

As the virtualization technology for GPUs matures, cloud gaming has become an emerging application among cloud services. In addition to the poor default mechanisms of GPU resource sharing, the performance of cloud games is inevitably undermined by various runtime uncertainties such as rendering complex game scenarios. The question of how to handle the runtime uncertainties for GPU resource sharing remains unanswered. To address this challenge, we propose vGASA, a virtualized GPU resource adaptive scheduling algorithm in cloud gaming. vGASA interposes scheduling algorithms in the graphics API of the operating system, and hence the host graphic driver or the guest operating system remains unmodified. To fulfill the service level agreement as well as maximize GPU usage, we propose three adaptive scheduling algorithms featuring feedback control that mitigates the impact of the runtime uncertainties on the system performance. The experimental results demonstrate that vGASA is able to maintain frames per second of various workloads at the desired level with the performance overhead limited to 5-12 percent.

[1]  Kai Ma,et al.  Adaptive Power Control with Online Model Estimation for Chip Multiprocessors , 2011, IEEE Transactions on Parallel and Distributed Systems.

[2]  Federico Silla,et al.  rCUDA: Reducing the number of GPU-based accelerators in high performance clusters , 2010, 2010 International Conference on High Performance Computing & Simulation.

[3]  Chao Zhang,et al.  VGRIS: Virtualized GPU Resource Isolation and Scheduling in Cloud Gaming , 2014, ACM Trans. Archit. Code Optim..

[4]  Vanish Talwar,et al.  GViM: GPU-accelerated virtual machines , 2009, HPCVirt '09.

[5]  Lawrence M. Murray,et al.  GPU Acceleration of Runge-Kutta Integrators , 2012, IEEE Transactions on Parallel and Distributed Systems.

[6]  Yin Wang,et al.  VGRIS: Virtualized GPU Resource Isolation and Scheduling in Cloud Gaming , 2013, TACO.

[7]  Vanish Talwar,et al.  Pegasus: Coordinated Scheduling for Virtualized Accelerator-based Systems , 2011, USENIX Annual Technical Conference.

[8]  Lin Shi,et al.  vCUDA: GPU accelerated high performance computing in virtual machines , 2009, 2009 IEEE International Symposium on Parallel & Distributed Processing.

[9]  Yuting Zhang,et al.  Friendly virtual machines: leveraging a feedback-control model for application adaptation , 2005, VEE '05.

[10]  Frank Mueller,et al.  Autogeneration and Autotuning of 3D Stencil Codes on Homogeneous and Heterogeneous GPU Clusters , 2013, IEEE Transactions on Parallel and Distributed Systems.

[11]  Filip De Turck,et al.  Thin Client Computing Solutions in Low- and High-Motion Scenarios , 2007, International Conference on Networking and Services (ICNS '07).

[12]  Limin Xiao,et al.  Dynamic Resource Allocation Based on User experience in Virtualized Servers , 2011 .

[13]  Filip De Turck,et al.  A hybrid thin-client protocol for multimedia streaming and interactive gaming applications , 2006, NOSSDAV '06.

[14]  صبحي حمدان أبو جلالة,et al.  فعالية استخدام خريطة المفاهيم للشكل "Vee" في الدراسة العلمية لمادة الفيزياء في التحصيل واكتساب عمليات العلم لدى طالبات السنة الثالثة بكلية التربية بعبري - سلطنة عمان , 2001 .

[15]  Ivan Porres,et al.  Feedback Control Algorithms to Deploy and Scale Multiple Web Applications per Virtual Machine , 2012, 2012 38th Euromicro Conference on Software Engineering and Advanced Applications.

[16]  Kenli Li,et al.  vCUDA: GPU-Accelerated High-Performance Computing in Virtual Machines , 2012, IEEE Trans. Computers.

[17]  Fumihiko Ino,et al.  Sequence Homology Search Using Fine Grained Cycle Sharing of Idle GPUs , 2012, IEEE Transactions on Parallel and Distributed Systems.

[18]  Tarek S. Abdelrahman,et al.  hiCUDA: High-Level GPGPU Programming , 2011, IEEE Transactions on Parallel and Distributed Systems.

[19]  Shinpei Kato,et al.  TimeGraph: GPU Scheduling for Real-Time Multi-Tasking Environments , 2011, USENIX Annual Technical Conference.

[20]  Vishakha Gupta,et al.  Shadowfax: scaling in heterogeneous cluster systems via GPGPU assemblies , 2011, VTDC '11.

[21]  Andrew Warfield,et al.  Xen and the art of virtualization , 2003, SOSP '03.

[22]  Federico Silla,et al.  An Efficient Implementation of GPU Virtualization in High Performance Clusters , 2009, Euro-Par Workshops.

[23]  Sang-Min Park,et al.  Predictable High-Performance Computing Using Feedback Control and Admission Control , 2011, IEEE Transactions on Parallel and Distributed Systems.

[24]  Jeremy Sugerman,et al.  GPU virtualization on VMware's hosted I/O architecture , 2008, OPSR.

[25]  Kang G. Shin,et al.  Adaptive control of virtualized resources in utility computing environments , 2007, EuroSys '07.

[26]  Alessandro De Gloria,et al.  Platform for Distributed 3D Gaming , 2009, Int. J. Comput. Games Technol..

[27]  Katsuhiko Ogata,et al.  Discrete-time control systems , 1987 .