Systematic Mapping Study on Performance Scalability in Big Data on Cloud Using VM and Container

In recent years, big data and cloud computing have gained importance in IT and business. These two technologies are becoming complementing in a way that the former requires large amount of storage and computation power, which are the key enabler technologies of Big Data; the latter, cloud computing, brings the opportunity to scale on-demand computation power and provides massive quantities of storage space. Until recently, the only technique used in computation resource utilization was based on the hypervisor, which is used to create the virtual machine. Nowadays, another technique, which claims better resource utilization, called “container” is becoming popular. This technique is otherwise known as “lightweight virtualization” since it creates completely isolated virtual environments on top of underlying operating systems. The main objective of this study is to clarify the research area concerned with performance issues using VM and container in big data on cloud, and to give a direction for future research.

[1]  Miika Komu,et al.  Hypervisors vs. Lightweight Virtualization: A Performance Comparison , 2015, 2015 IEEE International Conference on Cloud Engineering.

[2]  Xiaohong Jiang,et al.  Scalability Analysis and Improvement of Hadoop Virtual Cluster with Cost Consideration , 2014, 2014 IEEE 7th International Conference on Cloud Computing.

[3]  Ramakrishnan Rajamony,et al.  An updated performance comparison of virtual machines and Linux containers , 2015, 2015 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS).

[4]  Verena Kantere,et al.  I/O Performance Modeling for Big Data Applications over Cloud Infrastructures , 2015, 2015 IEEE International Conference on Cloud Engineering.

[5]  Yang Yang,et al.  Impacts of Virtualization Technologies on Hadoop , 2013, 2013 Third International Conference on Intelligent System Design and Engineering Applications.

[6]  Kai Petersen,et al.  Guidelines for conducting systematic mapping studies in software engineering: An update , 2015, Inf. Softw. Technol..

[7]  N. B. Anuar,et al.  The rise of "big data" on cloud computing: Review and open research issues , 2015, Inf. Syst..

[8]  Pedro Roger Magalhães Vasconcelos,et al.  Performance analysis of Hadoop MapReduce on an OpenNebula cloud with KVM and OpenVZ virtualizations , 2014, The 9th International Conference for Internet Technology and Secured Transactions (ICITST-2014).