An Experiment on Bare-Metal BigData Provisioning

Many BigData customers use on-demand platforms in the cloud, where they can get a dedicated virtual cluster in a couple of minutes and pay only for the time they use. Increasingly, there is a demand for bare-metal bigdata solutions for applications that cannot tolerate the unpredictability and performance degradation of virtualized systems. Existing bare-metal solutions can introduce delays of 10s of minutes to provision a cluster by installing operating systems and applications on the local disks of servers. This has motivated recent research developing sophisticated mechanisms to optimize this installation. These approaches assume that using network mounted boot disks incur unacceptable run-time overhead. Our analysis suggest that while this assumption is true for application data, it is incorrect for operating systems and applications, and network mounting the boot disk and applications result in negligible run-time impact while leading to faster provisioning time.

[1]  Peter Desnoyers,et al.  Hardware as a service - enabling dynamic, user-level bare metal provisioning of pools of data center resources. , 2014 .

[2]  Alexandru Iosup,et al.  Performance Analysis of Cloud Computing Services for Many-Tasks Scientific Computing , 2011, IEEE Transactions on Parallel and Distributed Systems.

[3]  Larry Kaplan,et al.  The Gemini System Interconnect , 2010, 2010 18th IEEE Symposium on High Performance Interconnects.

[4]  Riccardo Gusella The Analysis of Diskless Workstation Traffic on an Ethernet , 1987 .

[5]  Hovav Shacham,et al.  Hey, you, get off of my cloud: exploring information leakage in third-party compute clouds , 2009, CCS.

[6]  Phil Simon Big Data Solutions , 2012 .

[7]  Geoffrey C. Fox,et al.  High Performance Parallel Computing with Clouds and Cloud Technologies , 2009, CloudComp.

[8]  Christian Engelmann,et al.  Evaluating the Shared Root File System Approach for Diskless High-Performance Computing Systems , 2009 .

[9]  Kazuhiko Kato,et al.  Improving Agility and Elasticity in Bare-metal Clouds , 2015, ASPLOS.

[10]  Mike Hibler,et al.  Automatic Online Validation of Network Configuration in the Emulab Network Testbed , 2006, 2006 IEEE International Conference on Autonomic Computing.

[11]  Garth A. Gibson,et al.  A Comparative Study of Baremetal Provisioning Frameworks , 2015 .

[12]  Eva Hocks,et al.  Gordon: design, performance, and experiences deploying and supporting a data intensive supercomputer , 2012, XSEDE '12.