Adaptive, scalable and reliable monitoring of big data on clouds

Real-time monitoring of cloud resources is crucial for a variety of tasks such as performance analysis, workload management, capacity planning and fault detection. Applications producing big data make the monitoring task very difficult at high sampling frequencies because of high computational and communication overheads in collecting, storing, and managing information. We present an adaptive algorithm for monitoring big data applications that adapts the intervals of sampling and frequency of updates to data characteristics and administrator needs. Adaptivity allows us to limit computational and communication costs and to guarantee high reliability in capturing relevant load changes. Experimental evaluations performed on a large testbed show the ability of the proposed adaptive algorithm to reduce resource utilization and communication overhead of big data monitoring without penalizing the quality of data, and demonstrate our improvements to the state of the art. Real time monitoring of cloud resources is crucial for system management.We propose an adaptive algorithm for scalable and reliable cloud monitoring.Our algorithm dynamically balances amount and quality of monitored time series.We reduce monitoring costs significantly without penalizing data quality.

[1]  Alberto O. Mendelzon,et al.  Efficient Retrieval of Similar Time Sequences Using DFT , 1998, FODO.

[2]  Kenjiro Taura,et al.  Scalable Data Gathering for Real-Time Monitoring Systems on Distributed Computing , 2008, 2008 Eighth IEEE International Symposium on Cluster Computing and the Grid (CCGRID).

[3]  Amit Kumar,et al.  Deterministic wavelet thresholding for maximum-error metrics , 2004, PODS.

[4]  Graham Cormode,et al.  Communication-efficient distributed monitoring of thresholded counts , 2006, SIGMOD Conference.

[5]  Stefania Tosi,et al.  Monitoring Large Cloud-Based Systems , 2014, CLOSER.

[6]  Francesco Lo Presti,et al.  An adaptive model for online detection of relevant state changes in Internet-based systems , 2012, Perform. Evaluation.

[7]  Ambuj K. Singh,et al.  Dimensionality Reduction for Similarity Searching in Dynamic Databases , 1999, Comput. Vis. Image Underst..

[8]  J. Fienup Invariant error metrics for image reconstruction. , 1997, Applied optics.

[9]  Shicong Meng,et al.  Enhanced Monitoring-as-a-Service for Effective Cloud Management , 2013, IEEE Transactions on Computers.

[10]  Michele Colajanni,et al.  Real-time adaptive algorithm for resource monitoring , 2013, Proceedings of the 9th International Conference on Network and Service Management (CNSM 2013).

[11]  Christian Engelmann,et al.  Aggregation of Real-Time System Monitoring Data for Analyzing Large-Scale Parallel and Distributed Computing Environments , 2010, 2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC).

[12]  Yogesh L. Simmhan,et al.  Cloud-Based Software Platform for Big Data Analytics in Smart Grids , 2013, Computing in Science & Engineering.

[13]  Michele Colajanni,et al.  A Scalable Architecture for Real-Time Monitoring of Large Information Systems , 2012, 2012 Second Symposium on Network Cloud Computing and Applications.

[14]  Ulrich Appel,et al.  Adaptive sequential segmentation of piecewise stationary time series , 1983, Inf. Sci..

[15]  Kotagiri Ramamohanarao,et al.  An adaptive algorithm for online time series segmentation with error bound guarantee , 2012, EDBT '12.

[16]  Eamonn J. Keogh,et al.  A Simple Dimensionality Reduction Technique for Fast Similarity Search in Large Time Series Databases , 2000, PAKDD.

[17]  Donald E. Knuth Art of Computer Programming, Volumes 1-4A Boxed Set , 2011 .

[18]  Qiang Wang,et al.  A symbolic representation of time series , 2005, Proceedings of the Eighth International Symposium on Signal Processing and Its Applications, 2005..

[19]  Eamonn J. Keogh,et al.  An online algorithm for segmenting time series , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[20]  Eamonn J. Keogh,et al.  A symbolic representation of time series, with implications for streaming algorithms , 2003, DMKD '03.

[21]  Vanish Talwar,et al.  A flexible architecture integrating monitoring and analytics for managing large-scale data centers , 2011, ICAC '11.

[22]  Michele Colajanni,et al.  A Software Architecture for the Analysis of Large Sets of Data Streams in Cloud Infrastructures , 2011, 2011 IEEE 11th International Conference on Computer and Information Technology.

[23]  Clement T. Yu,et al.  Haar Wavelets for Efficient Similarity Search of Time-Series: With and Without Time Warping , 2003, IEEE Trans. Knowl. Data Eng..

[24]  David E. Culler,et al.  The ganglia distributed monitoring system: design, implementation, and experience , 2004, Parallel Comput..

[25]  Jennifer Widom,et al.  Models and issues in data stream systems , 2002, PODS.

[26]  Donald E. Knuth,et al.  The Art of Computer Programming, Volume 4, Fascicle 2: Generating All Tuples and Permutations (Art of Computer Programming) , 2005 .

[27]  Randy H. Katz,et al.  Chukwa: A System for Reliable Large-Scale Log Collection , 2010, LISA.

[28]  A. G. Ramakrishnan,et al.  ECG coding by wavelet-based linear prediction , 1997, IEEE Transactions on Biomedical Engineering.

[29]  Eamonn J. Keogh,et al.  Locally adaptive dimensionality reduction for indexing large time series databases , 2001, SIGMOD '01.