Cloud Workload Characterization

Abstract This paper describes various computer system workloads and relates them to their underlying resource utilization. Specifically, the paper concentrates on Cloud workload characterization based upon issues, capabilities, and technologies surrounding the categories from the multiple points of view of the various players involved in Cloud Computing. The relationship is established between the categories and key limiting underlying technologies, and the dynamic and measurable low-level metrics and measurements that are used to detect and reduce resource contention, and identify category changes during run-time. Research questions are posed on dynamic low-level measurements and a usage case example with high performance computing (HPC) clusters. The Cloud workload categories can provide a basis for common communication for various viewpoints from players, including facility managers, Cloud IT or service providers, Cloud users, consumers, IT managers, and hardware vendors. This common communication tool will facilitate better service-level agreements (SLAs), capital purchase decisions, and future computer architecture design-decisions.

[1]  Alexandra Fedorova,et al.  Managing Contention for Shared Resources on Multicore Processors , 2010 .

[2]  Wenguang Chen,et al.  Cloud versus in-house cluster: Evaluating Amazon cluster compute instances for running MPI applications , 2011, 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC).

[3]  Jano I. van Hemert,et al.  Resource management of enterprise cloud systems using layered queuing and historical performance models , 2010, 2010 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW).

[4]  Giuseppe Serazzi,et al.  A Characterization of the Variation in Time of Workload Arrival Patterns , 1985, IEEE Transactions on Computers.

[5]  Gabriel H. Loh,et al.  Dynamic Classification of Program Memory Behaviors in CMPs , 2008 .

[6]  Chita R. Das,et al.  Characterizing Network Traffic in a Cluster-based, Multi-tier Data Center , 2007, 27th International Conference on Distributed Computing Systems (ICDCS '07).

[7]  Bruce Schneier,et al.  Schneier on security , 2008 .

[8]  Philip S. Yu,et al.  On Workload Characterization of Relational Database Environments , 1992, IEEE Trans. Software Eng..

[9]  Xifeng Yan,et al.  Workload characterization and prediction in the cloud: A multiple time series approach , 2012, 2012 IEEE Network Operations and Management Symposium.

[10]  Daniel A. Menascé,et al.  Resource Allocation for Autonomic Data Centers using Analytic Performance Models , 2005, Second International Conference on Autonomic Computing (ICAC'05).

[11]  Alec Wolman,et al.  Measurement and Analysis of a Streaming Media Workload , 2001, USITS.

[12]  Richard B. Bunt,et al.  A synthetic workload model for a distributed system file server , 1991, SIGMETRICS '91.

[13]  P. Mell,et al.  The NIST Definition of Cloud Computing , 2011 .

[14]  Preston M. Smith,et al.  Cost-Effective HPC: The Community or the Cloud? , 2010, 2010 IEEE Second International Conference on Cloud Computing Technology and Science.

[15]  Danilo Ardagna,et al.  SLA based resource allocation policies in autonomic environments , 2007, J. Parallel Distributed Comput..

[16]  Sadaf R. Alam,et al.  Characterization of Scientific Workloads on Systems with Multi-Core Processors , 2006, 2006 IEEE International Symposium on Workload Characterization.

[17]  P. Mell,et al.  SP 800-145. The NIST Definition of Cloud Computing , 2011 .

[18]  Lothar Thiele,et al.  Workload characterization model for tasks with variable execution demand , 2004, Proceedings Design, Automation and Test in Europe Conference and Exhibition.

[19]  Carey L. Williamson,et al.  Internet Web servers: workload characterization and performance implications , 1997, TNET.

[20]  Schahram Dustdar,et al.  Low level Metrics to High level SLAs - LoM2HiS framework: Bridging the gap between monitored metrics and SLA parameters in cloud environments , 2010, 2010 International Conference on High Performance Computing & Simulation.

[21]  Benny Rochwerger,et al.  Oceano-SLA based management of a computing utility , 2001, 2001 IEEE/IFIP International Symposium on Integrated Network Management Proceedings. Integrated Network Management VII. Integrated Management Strategies for the New Millennium (Cat. No.01EX470).

[22]  Randy H. Katz,et al.  A view of cloud computing , 2010, CACM.

[23]  Calton Pu,et al.  An Analysis of Performance Interference Effects in Virtual Environments , 2007, 2007 IEEE International Symposium on Performance Analysis of Systems & Software.

[24]  Francisco J. Cazorla,et al.  Multicore Resource Management , 2008, IEEE Micro.

[25]  M. R. Chapman,et al.  In Search of Stupidity: Over Twenty Years of High Tech Marketing Disasters, Second Edition , 2006 .

[26]  Kai Li,et al.  The PARSEC benchmark suite: Characterization and architectural implications , 2008, 2008 International Conference on Parallel Architectures and Compilation Techniques (PACT).

[27]  John Shalf,et al.  Performance Analysis of High Performance Computing Applications on the Amazon Web Services Cloud , 2010, 2010 IEEE Second International Conference on Cloud Computing Technology and Science.

[28]  Michael D. Smith,et al.  Improving Performance Isolation on Chip Multiprocessors via an Operating System Scheduler , 2007, 16th International Conference on Parallel Architecture and Compilation Techniques (PACT 2007).

[29]  Constantinos Evangelinos,et al.  Cloud Computing for parallel Scientific HPC Applications: Feasibility of Running Coupled Atmosphere- , 2008 .

[30]  Tong Li,et al.  Using OS Observations to Improve Performance in Multicore Systems , 2008, IEEE Micro.

[31]  Raouf Boutaba,et al.  Characterizing Task Usage Shapes in Google Compute Clusters , 2011 .