Analytics in/for cloud-an interdependence: A review

Abstract Cloud computing has brought a paradigmatic shift in providing data storage as well as computing resources. With the ever-increasing demand for cloud computing, the number of cloud providers is also increasing evidently, which poses challenges as well as opportunities for consumers and providers. From a consumer point of view, efficient selection of cloud resources at a minimum cost is a big challenge. On the other hand, a provider has to meet consumers’ requirements with sufficient profit in the fiercely competitive market. The relationship between cloud computing is truly symbiotic in the sense that cloud computing makes the practice of analytics more pervasive while analytics makes cloud computing more efficient and optimal in a lot of ways. In addressing these issues, analytics plays an important role. In this paper, we reviewed some important research articles, which focus on cloud computing from the viewpoint of analytics. Analytics and cloud computing are found to be quite interdependent. From analytics perspective, cloud computing makes available high-end computing resources even to an individual customer at an affordable price. We call this thread “Analytics in Cloud”. From the point of view of cloud computing, efficient management, allocation, and demand prediction can be performed using analytics. We call this thread “Analytics for Cloud”. This review paper is mainly based on these two threads of thought process. In this regard, we reviewed eighty-eight research articles published during 2003–2017 related to the formidable duo of cloud computing and analytics.

[1]  Minhaj Ahmad Khan,et al.  A survey of security issues for cloud computing , 2016, J. Netw. Comput. Appl..

[2]  Franck Le,et al.  Optimizing Resource Allocation for Virtualized Network Functions in a Cloud Center Using Genetic Algorithms , 2017, IEEE Transactions on Network and Service Management.

[3]  Jie Lu,et al.  Optimal Cloud Resource Auto-Scaling for Web Applications , 2013, 2013 13th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing.

[4]  Naixue Xiong,et al.  A game-theoretic method of fair resource allocation for cloud computing services , 2010, The Journal of Supercomputing.

[5]  Kevin Lee,et al.  Empirical prediction models for adaptive resource provisioning in the cloud , 2012, Future Gener. Comput. Syst..

[6]  Sujeet Kumar Sharma,et al.  Predicting motivators of cloud computing adoption: A developing country perspective , 2016, Comput. Hum. Behav..

[7]  Azzedine Boukerche,et al.  Predictive Dynamic Load Balancing for Large-Scale HLA-based Simulations , 2011, 2011 IEEE/ACM 15th International Symposium on Distributed Simulation and Real Time Applications.

[8]  Chen-Khong Tham,et al.  Evolutionary Optimal Virtual Machine Placement and Demand Forecaster for Cloud Computing , 2011, 2011 IEEE International Conference on Advanced Information Networking and Applications.

[9]  Rajkumar Buyya,et al.  A Particle Swarm Optimization-Based Heuristic for Scheduling Workflow Applications in Cloud Computing Environments , 2010, 2010 24th IEEE International Conference on Advanced Information Networking and Applications.

[10]  Wei Zhong,et al.  The cloud computing load forecasting algorithm based on wavelet support vector machine , 2017, ACSW.

[11]  Rajkumar Buyya,et al.  Workload Prediction Using ARIMA Model and Its Impact on Cloud Applications’ QoS , 2015, IEEE Transactions on Cloud Computing.

[12]  Vadlamani Ravi,et al.  Evolutionary computing applied to customer relationship management: A survey , 2016, Eng. Appl. Artif. Intell..

[13]  Zhenhuan Gong,et al.  PRESS: PRedictive Elastic ReSource Scaling for cloud systems , 2010, 2010 International Conference on Network and Service Management.

[14]  Ashraf A. Shahin Automatic Cloud Resource Scaling Algorithm based on Long Short-Term Memory Recurrent Neural Network , 2017, ArXiv.

[15]  Hejiao Huang,et al.  Clustering based virtual machines placement in distributed cloud computing , 2017, Future Gener. Comput. Syst..

[16]  Abdulmotaleb El-Saddik,et al.  Classification of the state-of-the-art dynamic web services composition techniques , 2006, Int. J. Web Grid Serv..

[17]  N. B. Anuar,et al.  The rise of "big data" on cloud computing: Review and open research issues , 2015, Inf. Syst..

[18]  Bernd Freisleben,et al.  Tackling uncertainty in long-term predictions for host overload and underload detection in cloud computing , 2017, Journal of Cloud Computing.

[19]  Luiz Fernando Bittencourt,et al.  Using Time Discretization to Schedule Scientific Workflows in Multiple Cloud Providers , 2013, 2013 IEEE Sixth International Conference on Cloud Computing.

[20]  Hamed S. Al-Raweshidy,et al.  Taxonomy of Grid Systems , 2010 .

[21]  James A. Thom,et al.  Cloud Computing Security: From Single to Multi-clouds , 2012, 2012 45th Hawaii International Conference on System Sciences.

[22]  Nanjangud C. Narendra,et al.  Resource Demand Prediction in Multi-Tenant Service Clouds , 2013, 2013 IEEE International Conference on Cloud Computing in Emerging Markets (CCEM).

[23]  Badie Farah,et al.  A Model for Managing Uncertainty on the Cloud , 2013 .

[24]  Benjamin Fabian,et al.  Secret Sharing for Health Data in Multi-provider Clouds , 2013, 2013 IEEE 15th Conference on Business Informatics.

[25]  Putchong Uthayopas,et al.  Multi-provider cloud computing network infrastructure optimization , 2016, Future Gener. Comput. Syst..

[26]  Virgílio A. F. Almeida,et al.  Performance by Design - Computer Capacity Planning By Example , 2004 .

[27]  Kaushik Dutta,et al.  Modeling virtualized applications using machine learning techniques , 2012, VEE '12.

[28]  Lúcia Maria de A. Drummond,et al.  Optimization of a Cloud Resource Management Problem from a Consumer Perspective , 2013, Euro-Par Workshops.

[29]  Ainuddin Wahid Abdul Wahab,et al.  Cloud Log Forensics , 2016, ACM Comput. Surv..

[30]  Jan Broeckhove,et al.  Cost-Optimal Scheduling in Hybrid IaaS Clouds for Deadline Constrained Workloads , 2010, 2010 IEEE 3rd International Conference on Cloud Computing.

[31]  Aniruddha S. Gokhale,et al.  Efficient Autoscaling in the Cloud Using Predictive Models for Workload Forecasting , 2011, 2011 IEEE 4th International Conference on Cloud Computing.

[32]  Maryam Amiri,et al.  Survey on prediction models of applications for resources provisioning in cloud , 2017, J. Netw. Comput. Appl..

[33]  Amir Masoud Rahmani,et al.  Load-balancing algorithms in cloud computing: A survey , 2017, J. Netw. Comput. Appl..

[34]  Xiaohui Gu,et al.  UBL: unsupervised behavior learning for predicting performance anomalies in virtualized cloud systems , 2012, ICAC '12.

[35]  Laiping Zhao,et al.  Online Virtual Machine Placement for Increasing Cloud Provider’s Revenue , 2017, IEEE Transactions on Services Computing.

[36]  Carlo Curino,et al.  Performance and resource modeling in highly-concurrent OLTP workloads , 2013, SIGMOD '13.

[37]  Zongpeng Li,et al.  An Online Auction Mechanism for Dynamic Virtual Cluster Provisioning in Geo-Distributed Clouds , 2017, IEEE Transactions on Parallel and Distributed Systems.

[38]  Muthu Ramachandran,et al.  A resiliency framework for an enterprise cloud , 2016, Int. J. Inf. Manag..

[39]  Zhenlong Li,et al.  Big Data and cloud computing: innovation opportunities and challenges , 2017, Int. J. Digit. Earth.

[40]  Tao Li,et al.  ASAP: A Self-Adaptive Prediction System for Instant Cloud Resource Demand Provisioning , 2011, 2011 IEEE 11th International Conference on Data Mining.

[41]  Rahat Iqbal,et al.  Cloud enabled data analytics and visualization framework for health-shocks prediction , 2016, Future Gener. Comput. Syst..

[42]  Isis Truck,et al.  Using Reinforcement Learning for Autonomic Resource Allocation in Clouds: towards a fully automated workflow , 2011 .

[43]  Eddy Caron,et al.  Forecasting for Grid and Cloud Computing On-Demand Resources Based on Pattern Matching , 2010, 2010 IEEE Second International Conference on Cloud Computing Technology and Science.

[44]  Rajkumar Buyya,et al.  Big Data computing and clouds: Trends and future directions , 2013, J. Parallel Distributed Comput..

[45]  G. Bruce Berriman,et al.  On the Use of Cloud Computing for Scientific Workflows , 2008, 2008 IEEE Fourth International Conference on eScience.

[46]  Vadlamani Ravi,et al.  Auto associative Extreme Learning Machine based non-linear principal component regression for big data applications , 2015, 2015 Tenth International Conference on Digital Information Management (ICDIM).

[47]  Anthony K. H. Tung,et al.  A new approach to dynamic self-tuning of database buffers , 2008, TOS.

[48]  Jyh-Horng Chou,et al.  Optimized task scheduling and resource allocation on cloud computing environment using improved differential evolution algorithm , 2013, Comput. Oper. Res..

[49]  Eduardo Lalla-Ruiz,et al.  A cloud brokerage approach for solving the resource management problem in multi-cloud environments , 2016, Comput. Ind. Eng..

[50]  Domenico Talia,et al.  Clouds for Scalable Big Data Analytics , 2013, Computer.

[51]  Veda C. Storey,et al.  Business Intelligence and Analytics: From Big Data to Big Impact , 2012, MIS Q..

[52]  Xiaorong Li,et al.  Hybrid Heuristic for Scheduling Data Analytics Workflow Applications in Hybrid Cloud Environment , 2011, 2011 IEEE International Symposium on Parallel and Distributed Processing Workshops and Phd Forum.

[53]  Amir Masoud Rahmani,et al.  Dynamic VMs placement for energy efficiency by PSO in cloud computing , 2016, J. Exp. Theor. Artif. Intell..

[54]  Philippe O. A. Navaux,et al.  A lightweight plug-and-play elasticity service for self-organizing resource provisioning on parallel applications , 2018, Future Gener. Comput. Syst..

[55]  Srinath Perera,et al.  Solution Recommender for System Failure Recovery via Log Event Pattern Matching on a Knowledge Graph: Demo , 2017, DEBS.

[56]  Timo Aho,et al.  Designing IDE as a Service , 2013 .

[57]  D. Milojicic,et al.  A mechanism to measure quality-of-service in a federated cloud environment , 2012, FederatedClouds '12.

[58]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[59]  Dana Petcu,et al.  Multi-Cloud: expectations and current approaches , 2013, MultiCloud '13.

[60]  Bo Cheng,et al.  An adaptive prediction approach based on workload pattern discrimination in the cloud , 2017, J. Netw. Comput. Appl..

[61]  Javier García,et al.  Optimal allocation of virtual machines in multi-cloud environments with reserved and on-demand pricing , 2017, Future Gener. Comput. Syst..

[62]  Kalyanmoy Deb,et al.  Financial time series prediction using hybrids of chaos theory, multi-layer perceptron and multi-objective evolutionary algorithms , 2017, Swarm Evol. Comput..

[63]  Vadlamani Ravi,et al.  Credit Card Fraud Detection using Big Data Analytics: Use of PSOAANN based One-Class Classification , 2016, ICIA.

[64]  Marty Humphrey,et al.  Auto-scaling to minimize cost and meet application deadlines in cloud workflows , 2011, 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC).

[65]  Bu-Sung Lee,et al.  Optimization of Resource Provisioning Cost in Cloud Computing , 2012, IEEE Transactions on Services Computing.

[66]  Peter Stone,et al.  CARVE: A Cognitive Agent for Resource Value Estimation , 2008, 2008 International Conference on Autonomic Computing.

[67]  Dave Cliff,et al.  Forecasting Demand for Cloud Computing Resources - An Agent-based Simulation of a Two Tiered Approach , 2012, ICAART.

[68]  Kashi Venkatesh Vishwanath,et al.  Characterizing cloud computing hardware reliability , 2010, SoCC '10.

[69]  Tao Li,et al.  Cloud Analytics for Capacity Planning and Instant VM Provisioning , 2013, IEEE Transactions on Network and Service Management.

[70]  Lúcia Maria de A. Drummond,et al.  Optimizing virtual machine allocation for parallel scientific workflows in federated clouds , 2015, Future Gener. Comput. Syst..

[71]  Rajkumar Buyya,et al.  SLA-Based Resource Scheduling for Big Data Analytics as a Service in Cloud Computing Environments , 2015, 2015 44th International Conference on Parallel Processing.

[72]  Daniel A. Menascé,et al.  Resource Allocation for Autonomic Data Centers using Analytic Performance Models , 2005, Second International Conference on Autonomic Computing (ICAC'05).

[73]  John Hawkins,et al.  A Commodity-Focused Multi-cloud Marketplace Exemplar Application , 2011, 2011 IEEE 4th International Conference on Cloud Computing.

[74]  Vadlamani Ravi,et al.  Forecasting financial time series volatility using Particle Swarm Optimization trained Quantile Regression Neural Network , 2017, Appl. Soft Comput..

[75]  Ivan Porres,et al.  Prediction-based VM provisioning and admission control for multi-tier web applications , 2016, Journal of Cloud Computing.

[76]  Erik-Jan van Baaren,et al.  WikiBench: A distributed, Wikipedia based web application benchmark , 2009 .

[77]  Dursun Delen,et al.  Leveraging the capabilities of service-oriented decision support systems: Putting analytics and big data in cloud , 2013, Decis. Support Syst..

[78]  Willy Zwaenepoel,et al.  Performance and scalability of EJB applications , 2002, OOPSLA '02.

[79]  Vadlamani Ravi,et al.  A survey on opinion mining and sentiment analysis: Tasks, approaches and applications , 2015, Knowl. Based Syst..

[80]  Carlo Curino,et al.  DBSeer: Resource and Performance Prediction for Building a Next Generation Database Cloud , 2013, CIDR.

[81]  Wensheng Tang,et al.  Multi-valued collaborative QoS prediction for cloud service via time series analysis , 2017, Future Gener. Comput. Syst..

[82]  Yong Zhao,et al.  Cloud Computing and Grid Computing 360-Degree Compared , 2008, GCE 2008.

[83]  Prasad Saripalli,et al.  Load Prediction and Hot Spot Detection Models for Autonomic Cloud Computing , 2011, 2011 Fourth IEEE International Conference on Utility and Cloud Computing.

[84]  Mohammad Masdari,et al.  A Survey of PSO-Based Scheduling Algorithms in Cloud Computing , 2016, Journal of Network and Systems Management.

[85]  Isis Truck,et al.  From Data Center Resource Allocation to Control Theory and Back , 2010, 2010 IEEE 3rd International Conference on Cloud Computing.

[86]  Reynold Xin,et al.  Apache Spark , 2016 .

[87]  Timothy Wood,et al.  Predicting Application Resource Requirements in Virtual Environments , 2008 .

[88]  Robert L. Grossman,et al.  Data mining using high performance data clouds: experimental studies using sector and sphere , 2008, KDD.

[89]  P. Mell,et al.  The NIST Definition of Cloud Computing , 2011 .

[90]  Arindam Banerjee,et al.  Data Analytics: Hyped Up Aspirations or True Potential? , 2013 .

[91]  M. Ashraful Amin,et al.  Neural network and regression based processor load prediction for efficient scaling of Grid and Cloud resources , 2011, 14th International Conference on Computer and Information Technology (ICCIT 2011).

[92]  Zibin Zheng,et al.  WSPred: A Time-Aware Personalized QoS Prediction Framework for Web Services , 2011, 2011 IEEE 22nd International Symposium on Software Reliability Engineering.

[93]  Dimosthenis Kyriazis,et al.  4CaaSt marketplace: An advanced business environment for trading cloud services , 2014, Future Gener. Comput. Syst..

[94]  Yao-Jen Chang,et al.  DPRA: Dynamic Power-Saving Resource Allocation for Cloud Data Center Using Particle Swarm Optimization , 2018, IEEE Systems Journal.

[95]  Athman Bouguettaya,et al.  Genetic Algorithm Based QoS-Aware Service Compositions in Cloud Computing , 2011, DASFAA.