A Systematic Review of Cloud Computing, Big Data and Databases on the Cloud

Cloud computing has emerged as an initiative which offers great promise in improving access to computational resources that would otherwise be unattainable due to sheer cost. While some cloud computing concepts date back to the 1950s, it is recent new cloud architecture and platforms that shape the way that resources are leased using service-based models. However, some confusion exists regarding the relationship between cloud-based models and challenges in managing big data. Some attempt to solve the problem by replacing and upgrading physical infrastructures, while others look to intelligent software to improve the scalability of data analytics. What also remains unclear is the definition and positioning of cloud-orientated paradigms. This is important to establish as it gets to the heart of where the underlying challenges exist in terms of availability, virtualisation, partitioning and distribution, scalability and elasticity, and performance bottlenecks when managing data. The goal of this systematic review is to provide insight into the current state of cloud computing and big data research. We find that challenges have been gaining momentum in this area from 2008 to 2013. In this study, using a systematic review framework, 129 publications are evaluated. We conclude that the current cloud-computing based frameworks are potentially neglecting fundamental database properties regarding atomicity and durability issues.

[1]  Richard T. Watson,et al.  Analyzing the Past to Prepare for the Future: Writing a Literature Review , 2002, MIS Q..

[2]  Michael Stonebraker,et al.  The Future of Scientific Data Bases , 2012, 2012 IEEE 28th International Conference on Data Engineering.

[3]  Gordon Bell,et al.  Beyond the Data Deluge , 2009, Science.

[4]  Adam Barker,et al.  A Cloud Computing Survey: Developments and Future Trends in Infrastructure as a Service Computing , 2013, ArXiv.

[5]  Alan R. Hevner,et al.  Design Science in Information Systems Research , 2004, MIS Q..

[6]  Bao Rong Chang,et al.  Assessment of Hypervisor and Shared Storage for Cloud Computing Server , 2012, 2012 Third International Conference on Innovations in Bio-Inspired Computing and Applications.

[7]  Carlo Curino,et al.  DBSeer: Resource and Performance Prediction for Building a Next Generation Database Cloud , 2013, CIDR.

[8]  Carlo Curino,et al.  Schism , 2010, Proc. VLDB Endow..

[9]  Rajesh Ingle,et al.  Scalable transactions in cloud data stores , 2015, 2013 3rd IEEE International Advance Computing Conference (IACC).

[10]  Helen D. Karatza,et al.  Evaluation of gang scheduling performance and cost in a cloud computing system , 2010, The Journal of Supercomputing.

[11]  L. Youseff,et al.  Toward a Unified Ontology of Cloud Computing , 2008, 2008 Grid Computing Environments Workshop.

[12]  Jacky W. Keung,et al.  Cloud Deployment Model Selection Assessment for SMEs: Renting or Buying a Cloud , 2012, 2012 IEEE Fifth International Conference on Utility and Cloud Computing.

[13]  Paula Kotzé,et al.  Secure cloud computing: Benefits, risks and controls , 2011, 2011 Information Security for South Africa.

[14]  J. Gregory Morrisett,et al.  Toward a verified relational database management system , 2010, POPL '10.

[15]  Michael Stonebraker,et al.  MapReduce and parallel DBMSs: friends or foes? , 2010, CACM.

[16]  Yun Chi,et al.  CloudDB: One Size Fits All Revived , 2010, 2010 6th World Congress on Services.

[17]  Divyakant Agrawal,et al.  Big data and cloud computing: current state and future opportunities , 2011, EDBT/ICDT '11.

[18]  Yong Zhao,et al.  Cloud Computing and Grid Computing 360-Degree Compared , 2008, GCE 2008.

[19]  Divyakant Agrawal,et al.  Database Scalability, Elasticity, and Autonomy in the Cloud - (Extended Abstract) , 2011, DASFAA.

[20]  Barbara Kitchenham,et al.  Procedures for Performing Systematic Reviews , 2004 .

[21]  Roman Beck,et al.  Does Cloud Computing Matter? An Analysis of the Cloud Model Software-as-a-Service and Its Impact on Operational Agility , 2013, 2013 46th Hawaii International Conference on System Sciences.

[22]  Dan Suciu Big Data Begets Big Database Theory , 2013, BNCOD.

[23]  Divyakant Agrawal,et al.  ElasTraS: An elastic, scalable, and self-managing transactional database for the cloud , 2013, TODS.

[24]  Mark Woodman,et al.  Success Dimensions in Selecting Cloud Software Services , 2011, 2011 37th EUROMICRO Conference on Software Engineering and Advanced Applications.

[25]  Nick Roussopoulos,et al.  An Adaptable Methodology for Database Design , 1984, Computer.

[26]  Roger King,et al.  A database design methodology and tool for information systems , 1985, TOIS.

[27]  Xing Xu,et al.  Cloud Task and Virtual Machine Allocation Strategy in Cloud Computing Environment , 2012 .

[28]  Katarzyna Musial,et al.  Next challenges for adaptive learning systems , 2012, SKDD.

[29]  Ness B. Shroff,et al.  On distributed computation rate optimization for deploying cloud computing programming frameworks , 2013, PERV.