Elastic monitoring framework for cloud infrastructures

This study presents a scalable and elastic distributed system for monitoring cloud infrastructure based on a pure peer-to-peer architecture. Its distributed nature enables deploying long-living queries across the network to monitor a diverse set of entities and metrics, spanning across all layers of a cloud stack that can change rapidly. This allows for aggregating low-level metrics from operating systems, to higher-level application-specific metrics derived from services, databases or application log files. The observed metrics and information can be evaluated and used to reliably trigger policies to automate complex management tasks within a cloud environment. The architecture incorporates a query framework for obtaining high-level information and a policy framework to provide self-management capabilities to monitored cloud infrastructure. The system has been implemented as a proof of concept. Details and statistical results are provided to validate the scalability of the underlying architecture.

[1]  Iosif Legrand,et al.  MonALISA : A Distributed Monitoring Service Architecture , 2003, ArXiv.

[2]  Chin-Feng Lai,et al.  A personalized mobile IPTV system with seamless video reconstruction algorithm in cloud networks , 2011, Int. J. Commun. Syst..

[3]  Jose M. Alcaraz Calero,et al.  Towards an architecture for deploying elastic services in the cloud , 2012, Softw. Pract. Exp..

[4]  Wolfgang Barth,et al.  Nagios: System and Network Monitoring , 2006 .

[5]  M. F.,et al.  Bibliography , 1985, Experimental Gerontology.

[6]  Kuo-Qin Yan,et al.  Reaching Fast Agreement in a Generalized Cloud Computing Environment , 2010 .

[7]  Rizos Sakellariou,et al.  A taxonomy of grid monitoring systems , 2005, Future Gener. Comput. Syst..

[8]  Rajkumar Buyya,et al.  Article in Press Future Generation Computer Systems ( ) – Future Generation Computer Systems Cloud Computing and Emerging It Platforms: Vision, Hype, and Reality for Delivering Computing as the 5th Utility , 2022 .

[9]  Chin-Feng Lai,et al.  CPRS: A cloud-based program recommendation system for digital TV platforms , 2010, Future Gener. Comput. Syst..

[10]  D. Bonacorsi,et al.  Scalability tests of R-GMA-based grid job monitoring system for CMS Monte Carlo data production , 2004, IEEE Transactions on Nuclear Science.

[11]  Luca Cardelli,et al.  Abstract Types and the Dot Notation , 1990, Programming Concepts and Methods.

[12]  Randy H. Katz,et al.  A view of cloud computing , 2010, CACM.

[13]  Judith Hurwitz,et al.  Cloud Computing for Dummies , 2009 .

[14]  David E. Culler,et al.  The ganglia distributed monitoring system: design, implementation, and experience , 2004, Parallel Comput..

[15]  Jose M. Alcaraz Calero,et al.  Toward an architecture for the automated provisioning of cloud services , 2010, IEEE Commun. Mag..

[16]  Weimin Zheng,et al.  Online Video Playing on Smartphones: A Context-Aware Approach Based on Cloud Computing , 2010 .

[17]  James Turnbull Pulling Strings With Puppet , 2007 .

[18]  Liang Zhou,et al.  Multimedia traffic security architecture for the internet of things , 2011, IEEE Network.

[19]  Carl Kesselman,et al.  Monitoring the grid with the Globus Toolkit MDS4 , 2006 .

[20]  Alistair N. Coles,et al.  The SmartFrog configuration management framework , 2009, OPSR.