Integrated QoS-aware Resource Provisioning for Parallel and Distributed Applications

With more parallel and distributed applications moving to Cloud and data centers, it is challenging to provide predictable and controllable resources to multiple tenants, and thus guarantee application performance. In this paper, we propose an integrated QoS-aware resource provisioning platform based on virtualization technology for computing, storage and network resources. Coarse-grained CPU mapping and fine-grained CPU scheduling mechanisms are proposed to enable adjustable computing power. A hierarchical distributed scheduling mechanism is implemented on a scalable storage system to guarantee I/O throughput for individual tenants and applications. A network manager has also been developed to guarantee the data transmission rate. Web-based interface enables users to monitor real time resource utilization and to adjust resource QoS levels on the fly. According to our experimental results, the resource cost can be saved up to 45% without degrading the performance of a distributed data processing benchmark, and the performance of a parallel agent-based simulation can be improved by 91% using the same amount of resources.

[1]  Arif Merchant,et al.  Proportional-Share Scheduling for Distributed Storage Systems , 2007, FAST.

[2]  Xiaohui Gu,et al.  CloudScale: elastic resource scaling for multi-tenant cloud systems , 2011, SoCC.

[3]  Gabriele D'Angelo,et al.  Parallel and distributed simulation from many cores to the public cloud , 2011, 2011 International Conference on High Performance Computing & Simulation.

[4]  Bingsheng He,et al.  A Declarative Optimization Engine for Resource Provisioning of Scientific Workflows in IaaS Clouds , 2015, HPDC.

[5]  Prashant J. Shenoy,et al.  Empirical evaluation of latency-sensitive application performance in the cloud , 2010, MMSys '10.

[6]  Dinil Mon Divakaran,et al.  Towards Flexible Guarantees in Clouds: Adaptive Bandwidth Allocation and Pricing , 2015, IEEE Transactions on Parallel and Distributed Systems.

[7]  Saeed Sharifian,et al.  A new model for virtual machine migration in virtualized cluster server based on Fuzzy Decision Making , 2010, ArXiv.

[8]  Alexandru Iosup,et al.  Efficient management of data center resources for Massively Multiplayer Online Games , 2008, 2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis.

[9]  Gabriele D'Angelo,et al.  Parallel and Distributed Simulation from Many Cores to the Public Cloud (Extended Version) , 2011, ArXiv.

[10]  Dhananjai M. Rao,et al.  Modeling and analysis of global epidemiology of avian influenza , 2009, Environ. Model. Softw..

[11]  Guilherme Piegas Koslovski,et al.  Joint Elastic Cloud and Virtual Network Framework for Application Performance-cost Optimization , 2010, Journal of Grid Computing.

[12]  Tadashi Yamazaki,et al.  Simulation Platform: A cloud-based online simulation environment , 2011, Neural Networks.

[13]  Yu Zhang,et al.  Two-Level Storage QoS to Manage Performance for Multiple Tenants with Multiple Workloads , 2014, 2014 IEEE 6th International Conference on Cloud Computing Technology and Science.

[14]  Bhakti S. S. Onggo,et al.  A User Interface for Large-Scale Demographic Simulation , 2014, 2014 IEEE 6th International Conference on Cloud Computing Technology and Science.

[15]  H. Casanova,et al.  Accuracy and Responsiveness of CPU Sharing Using Xen's Cap Values , 2008 .

[16]  Andrzej M. Goscinski,et al.  Execution of Compute Intensive Applications on Hybrid Clouds (Case Study with mpiBLAST) , 2012, 2012 Sixth International Conference on Complex, Intelligent, and Software Intensive Systems.

[17]  Judy Qiu,et al.  Cloud Computing for Data-Intensive Applications , 2014, Springer New York.

[18]  David R. Jefferson,et al.  Virtual time , 1985, ICPP.

[19]  Srikanth B. Yoginath,et al.  Optimized hypervisor scheduler for parallel discrete event simulations on virtual machine platforms , 2013, SimuTools.

[20]  Evgenia Smirni,et al.  Predictive VM consolidation on multiple resources: Beyond load balancing , 2013, 2013 IEEE/ACM 21st International Symposium on Quality of Service (IWQoS).

[21]  Stephen John Turner,et al.  Accelerating optimistic HLA-based simulations in virtual execution environments , 2013, SIGSIM PADS '13.

[22]  Dhananjai M. Rao Study of Dynamic Component Substitutions , 2003 .

[23]  Zhenhuan Gong,et al.  PRESS: PRedictive Elastic ReSource Scaling for cloud systems , 2010, 2010 International Conference on Network and Service Management.

[24]  IEEE Standard for Modeling and Simulation (M&S) High Level Architecture (HLA) — Framework and Rules , 2001 .

[25]  Wentong Cai,et al.  QoS-Aware Revenue-Cost Optimization for Latency-Sensitive Services in IaaS Clouds , 2012, 2012 IEEE/ACM 16th International Symposium on Distributed Simulation and Real Time Applications.

[26]  Dhananjai Madhava Rao Accelerating parallel agent-based epidemiological simulations , 2014, SIGSIM PADS '14.

[27]  Brian J. Henz,et al.  Taming Wild Horses: The Need for Virtual Time-Based Scheduling of VMs in Network Simulations , 2012, 2012 IEEE 20th International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems.

[28]  José Manuel Viegas,et al.  An agent‐based simulation model to assess the impacts of introducing a shared‐taxi system: an application to Lisbon (Portugal) , 2015 .