CloudXplor: a tool for configuration planning in clouds based on empirical data

Configuration planning for modern information systems is a highly challenging task due to the implications of various factors such as the cloud paradigm, multi-bottleneck workloads, and Green IT efforts. Nonetheless, there is currently little or no support to help decision makers find sustainable configurations that are systematically designed according to economic principles (e.g., profit maximization). This paper explicitly addresses this shortcoming and presents a novel approach to configuration planning in clouds based on empirical data. The main contribution of this paper is our unique approach to configuration planning based on an iterative and interactive data refinement process. More concretely, our methodology correlates economic goals with sound technical data to derive intuitive domain insights. We have implemented our methodology as the CloudXplor Tool to provide a proof of concept and exemplify a concrete use case. CloudXplor, which can be modularly embedded in generic resource management frameworks, illustrates the benefits of empirical configuration planning. In general, this paper is a working example on how to navigate large quantities of technical data to provide a solid foundation for economical decisions.

[1]  T. Abdelzaher,et al.  Achieving Scalable Automated Diagnosis of Distributed Systems Performance Problems , 2007 .

[2]  Rajkumar Buyya,et al.  A taxonomy and survey of grid resource management systems for distributed computing , 2002, Softw. Pract. Exp..

[3]  Toby Velte,et al.  Green IT: Reduce Your Information System's Environmental Impact While Adding to the Bottom Line , 2008 .

[4]  Calton Pu,et al.  Experimental evaluation of N-tier systems: Observation and analysis of multi-bottlenecks , 2009, 2009 IEEE International Symposium on Workload Characterization (IISWC).

[5]  Rajkumar Buyya,et al.  Evaluating the cost-benefit of using cloud computing to extend the capacity of clusters , 2009, HPDC '09.

[6]  David J. Lilja,et al.  Measuring computer performance : A practitioner's guide , 2000 .

[7]  Jeffrey S. Chase,et al.  Correlating Instrumentation Data to System States: A Building Block for Automated Diagnosis and Control , 2004, OSDI.

[8]  Asser N. Tantawi,et al.  An analytical model for multi-tier internet services and its applications , 2005, SIGMETRICS '05.

[9]  Christopher Stewart,et al.  Exploiting nonstationarity for performance prediction , 2007, EuroSys '07.

[10]  Marin Litoiu,et al.  A performance analysis method for autonomic computing systems , 2007, TAAS.

[11]  Jörn Altmann,et al.  Taxonomy of Grid Business Models , 2007, GECON.

[12]  Markus Hedwig,et al.  Datacenter Investment Support System (DAISY) , 2010, 2010 43rd Hawaii International Conference on System Sciences.

[13]  Christopher Stewart,et al.  Performance modeling and system management for multi-component online services , 2005, NSDI.

[14]  Dirk Neumann,et al.  Taming Energy Costs of Large Enterprise Systems Through Adaptive Provisioning , 2009, ICIS.

[15]  Marcos K. Aguilera,et al.  Performance debugging for distributed systems of black boxes , 2003, SOSP '03.

[16]  Dirk Neumann,et al.  Situated Decision Support for Managing Service Level Agreement Negotiations , 2008, Proceedings of the 41st Annual Hawaii International Conference on System Sciences (HICSS 2008).

[17]  Greg Schulz The Green and Virtual Data Center , 2009 .

[18]  Calton Pu,et al.  Bottleneck Detection Using Statistical Intervention Analysis , 2007, DSOM.

[19]  Ray Jain,et al.  The art of computer systems performance analysis - techniques for experimental design, measurement, simulation, and modeling , 1991, Wiley professional computing.

[20]  Raj Jain,et al.  The art of computer systems performance analysis - techniques for experimental design, measurement, simulation, and modeling , 1991, Wiley professional computing.