Dependability Differentiation in Cloud Services

As cloud computing is becoming more mature and pervasive, almost all types of services are being deployed in clouds. This has also widened the spectrum of cloud users which encompasses from domestic users to large companies. One of the main concerns of large companies outsourcing their IT functions to clouds is the availability of their functions. On the other hand, availability requirements for domestic users are not very strict. This requires the cloud service providers to guarantee different dependability levels for different users and services. This thesis is based upon this requirement of dependability differentiation of cloud services depending upon the nature of services and target users. In this thesis, different types of services are identified and grouped together both according to their deployment nature and their target users. Also a range of techniques for guaranteeing dependability in the cloud environment are identified and classified. In order to quantify dependability provided by different techniques, a cloud system is modeled. Two different levels of dependability differentiation are considered, namely; differentiation depending upon the state of standby replica and differentiation depending upon the spatial separation of active and standby replicas. These two levels are separately modeled by using Markov state diagrams and reliability block diagrams respectively. Due to the limitations imposed by Markov models, the former differentiation level is also studied by using a simulation. Finally, numerical analysis is conducted and different techniques are compared. Also the best technique for each user and service class is identified depending upon the results obtained. The most crucial components for guaranteeing dependability in cloud environment are also identified. This will direct the future prospects of study and also provide an idea to cloud service providers about the cloud components that are worth investing in, for enhancing service availability.

[1]  John H. Seader,et al.  Tier Classifications Define Site Infrastructure Performance , 2006 .

[2]  Akshat Verma,et al.  pMapper: Power and Migration Cost Aware Application Placement in Virtualized Systems , 2008, Middleware.

[3]  Graham Birtwistle A system for discrete event modelling on SIMULA , 1979 .

[4]  Surajit Chaudhuri,et al.  Proceedings of the 11th ACM Symposium on Cloud Computing , 2010 .

[5]  Bjarne E. Helvik,et al.  A survey of resilience differentiation frameworks in communication networks , 2007, IEEE Communications Surveys & Tutorials.

[6]  Andrew Warfield,et al.  Live migration of virtual machines , 2005, NSDI.

[7]  Erik Elmroth,et al.  Interfaces for Placement, Migration, and Monitoring of Virtual Machines in Federated Clouds , 2009, 2009 Eighth International Conference on Grid and Cooperative Computing.

[8]  Moshe Zukerman,et al.  Introduction to Queueing Theory and Stochastic Teletraffic Models , 2013, ArXiv.

[9]  W. Stewart,et al.  Supporting Differentiated Service Classes in Large IP Networks , 1997 .

[10]  Randy H. Katz,et al.  Above the Clouds: A Berkeley View of Cloud Computing , 2009 .

[11]  Karl Aberer,et al.  Dynamic cost-efficient replication in data clouds , 2009, ACDC '09.

[12]  Hein Meling,et al.  Ant system for service deployment in private and public clouds , 2010, BADS '10.

[13]  Thomas Sandholm,et al.  What's inside the Cloud? An architectural map of the Cloud landscape , 2009, 2009 ICSE Workshop on Software Engineering Challenges of Cloud Computing.

[14]  P. Heegaard Dependability Differentiation in Cloud Services , 2011 .

[15]  Rajkumar Buyya,et al.  Modeling and simulation of scalable Cloud computing environments and the CloudSim toolkit: Challenges and opportunities , 2009, 2009 International Conference on High Performance Computing & Simulation.

[16]  Karl Aberer,et al.  Cost-efficient and differentiated data availability guarantees in data clouds , 2010, 2010 IEEE 26th International Conference on Data Engineering (ICDE 2010).

[17]  Archana Ganapathi,et al.  Why Do Internet Services Fail, and What Can Be Done About It? , 2002, USENIX Symposium on Internet Technologies and Systems.

[18]  W. H I T E P A P,et al.  Protecting Mission-Critical Workloads with VMware Fault Tolerance , 2009 .

[19]  Steve Keckler,et al.  Proceedings of the 36th annual international symposium on Computer architecture , 2009, ISCA 2009.

[20]  Antti Toskala,et al.  WCDMA for UMTS: HSPA Evolution and LTE , 2010 .

[21]  P. Mell,et al.  The NIST Definition of Cloud Computing , 2011 .

[22]  L. Youseff,et al.  Toward a Unified Ontology of Cloud Computing , 2008, 2008 Grid Computing Environments Workshop.

[23]  Wolf-Dietrich Weber,et al.  Power provisioning for a warehouse-sized computer , 2007, ISCA '07.

[24]  Kishor S. Trivedi,et al.  Fighting bugs: remove, retry, replicate, and rejuvenate , 2007, Computer.

[25]  Graham M. Birtwistle,et al.  DEMOS A System for Discrete Event Modelling on Simula , 1979, Springer New York.

[26]  Krishna Kant,et al.  Data center evolution: A tutorial on state of the art, issues, and challenges , 2009, Comput. Networks.

[27]  Bjarne E. Helvik,et al.  Application of the RESTART/Splitting technique to network resilience studies in NS2 , 2008 .

[28]  Pablo Rodriguez,et al.  Proceedings of the ACM SIGCOMM 2009 conference on Data communication , 2009, SIGCOMM 2009.

[29]  Luiz André Barroso,et al.  The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines , 2009, The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines.

[30]  Eduardo Pinheiro,et al.  Failure Trends in a Large Disk Drive Population , 2007, FAST.

[31]  Michael Dahlin,et al.  End-to-end WAN service availability , 2001, TNET.

[32]  Karl Aberer,et al.  A self-organized, fault-tolerant and scalable replication scheme for cloud storage , 2010, SoCC '10.

[33]  Albert G. Greenberg,et al.  VL2: a scalable and flexible data center network , 2009, SIGCOMM '09.

[34]  Chris Rose,et al.  A Break in the Clouds: Towards a Cloud Definition , 2011 .

[35]  Rajkumar Buyya,et al.  Article in Press Future Generation Computer Systems ( ) – Future Generation Computer Systems Cloud Computing and Emerging It Platforms: Vision, Hype, and Reality for Delivering Computing as the 5th Utility , 2022 .

[36]  A. Moorsel Metrics for the Internet Age: Quality of Experience and Quality of Business , 2001 .

[37]  Hai Jin,et al.  Live migration of virtual machine based on full system trace and replay , 2009, HPDC '09.

[38]  Daniel A. Menascé Performance and availability of Internet data centers , 2004, IEEE Internet Computing.

[39]  Kashi Venkatesh Vishwanath,et al.  Characterizing cloud computing hardware reliability , 2010, SoCC '10.

[40]  Arun Venkataramani,et al.  Black-box and Gray-box Strategies for Virtual Machine Migration , 2007, NSDI.

[41]  Jeffrey S. Chase,et al.  Proceedings of the 1st workshop on Automated control for datacenters and clouds , 2009, ICAC 2009.

[42]  Eugene Ciurana,et al.  Google App Engine , 2009 .

[43]  Borja Sotomayor,et al.  Virtual Infrastructure Management in Private and Hybrid Clouds , 2009, IEEE Internet Computing.

[44]  Luis Rodero-Merino,et al.  A break in the clouds: towards a cloud definition , 2008, CCRV.

[45]  William Hunt What Is Google Apps for Business , 2013 .

[46]  Carl E. Landwehr,et al.  Basic concepts and taxonomy of dependable and secure computing , 2004, IEEE Transactions on Dependable and Secure Computing.

[47]  Jin B. Hong,et al.  Availability Modeling and Analysis of a Virtualized System , 2009, 2009 15th IEEE Pacific Rim International Symposium on Dependable Computing.