Big Data in the Background: Maximizing Productivity while Minimizing Virtual Machine Interference

Despite the use of virtualization to share individual servers, data centers are still often only lightly loaded, leaving large amounts of spare capacity idle. In some ways, Big Data applications are an ideal fit for using this excess capacity to perform meaningful work, yet the high level of interference between interactive and batch processing workloads currently prevents this from being a practical solution in virtualized environments. In this paper we study both the amount of spare capacity that could potentially be used for Big Data processing, and the limitations of current interference isolation techniques. Within our university data center we find that an average of 90%, 53%, and 89% of CPU, memory, and disk IO capacity are left idle; an analysis of Wikipedia’s 137 application servers shows similar results. We then evaluate the impact of colocating a Hadoop VM with an interactive website, and find that median web server response time can increase by over seven times. Finally, we propose how scheduling mechanisms in both the virtualization layer and the Hadoop job tracker could be enhanced to improve this situation.

[1]  Jennifer Rexford,et al.  NoHype: virtualized cloud infrastructure without the virtualization , 2010, ISCA.

[2]  Jian Li,et al.  Performance Enhancement for Network I/O Virtualization with Efficient Interrupt Coalescing and Virtual Receive-Side Scaling , 2013, IEEE Transactions on Parallel and Distributed Systems.

[3]  Prashant J. Shenoy,et al.  BenchLab: An Open Testbed for Realistic Benchmarking of Web Applications , 2011, WebApps.

[4]  Boon Thau Loo,et al.  Automated profiling and resource management of pig programs for meeting service level objectives , 2012, ICAC '12.

[5]  Kun Wang,et al.  Optimizing virtual machine scheduling in NUMA multicore systems , 2013, 2013 IEEE 19th International Symposium on High Performance Computer Architecture (HPCA).

[6]  Hyong S. Kim,et al.  DPack: Disk scheduler for highly consolidated cloud , 2013, 2013 Proceedings IEEE INFOCOM.

[7]  Radu Sion,et al.  Enhancement of Xen's scheduler for MapReduce workloads , 2011, HPDC '11.

[8]  Cho-Li Wang,et al.  Network performance isolation for latency-sensitive cloud applications , 2013, Future Gener. Comput. Syst..

[9]  Albert G. Greenberg,et al.  The cost of a cloud: research problems in data center networks , 2008, CCRV.

[10]  Chita R. Das,et al.  HybridMR: A Hierarchical MapReduce Scheduler for Hybrid Data Centers , 2013, 2013 IEEE 33rd International Conference on Distributed Computing Systems.

[11]  Carl A. Waldspurger,et al.  Memory resource management in VMware ESX server , 2002, OSDI '02.

[12]  Chenyang Lu,et al.  RT-Xen: Towards real-time hypervisor scheduling in Xen , 2011, 2011 Proceedings of the Ninth ACM International Conference on Embedded Software (EMSOFT).

[13]  Timothy Wood,et al.  Adaptive dynamic priority scheduling for virtual desktop infrastructures , 2012, 2012 IEEE 20th International Workshop on Quality of Service.

[14]  Brian D. Noble,et al.  Bobtail: Avoiding Long Tails in the Cloud , 2013, NSDI.