Bottleneck Detection in Cloud Computing Performance and Dependability: Sensitivity Rankings for Hierarchical Models

Cloud computing became widespread on IT industry, saving costs of acquisition and maintenance for companies of all sizes, and enabling fair management of resources according to the demand. Stochastic models can enable performance and dependability evaluation of cloud computing systems efficiently, what is needed for proper capacity planning. Distinct models may be combined in a hierarchy to address the huge number of components and levels of interaction among the system parts. Identification of bottlenecks in such composite models might be hard yet, due to the huge amount of input factors and variables which may interfere with the results. This paper proposes a method for bottleneck detection of computational systems represented with hierarchical models, that is remarkably applied in cloud computing systems. This is achieved through the composition of indices computed from lower level models in equations and solution methods of the top level model, for computing the sensitivity indices of all parameters with respect to a global system measure. A unified sensitivity ranking, comprising the composite indices, indicates the parameters with highest impact on output metrics. A case study supports the demonstration of accuracy and utility of our methodology. The study addresses a web service running on a private cloud with auto scaling mechanisms. The methods and algorithms presented here are helpful for decision-making when designing and managing cloud computing infrastructures, regarding incremental and architectural improvements.

[1]  S. Wittevrongel,et al.  Queueing Systems , 2019, Introduction to Stochastic Processes and Simulation.

[2]  William S. Griffith,et al.  Optimal Reliability Modeling: Principles and Applications , 2004, Technometrics.

[3]  Maria Clara Bezerra,et al.  Sensitivity Analysis Techniques Applied in Video Streaming Service on Eucalyptus Cloud Environments , 2018 .

[4]  Kishor S. Trivedi,et al.  Power-hierarchy of dependability-model types , 1994 .

[5]  Paulo Romero Martins Maciel,et al.  QoS-driven optimisation of composite web services: an approach based on GRASP and analytical models , 2013, Int. J. Web Grid Serv..

[6]  A. Kolmogoroff Über die analytischen Methoden in der Wahrscheinlichkeitsrechnung , 1931 .

[7]  Marco Ajmone Marsan,et al.  A class of generalized stochastic Petri nets for the performance evaluation of multiprocessor systems , 1984, TOCS.

[8]  James J. Filliben,et al.  An Efficient Sensitivity Analysis Method for Large Cloud Simulations , 2011, IEEE CLOUD.

[9]  Jamilson Dantas,et al.  Mercury: An Integrated Environment for Performance and Dependability Evaluation of General Systems , 2015 .

[10]  Rubens de Souza Matos Júnior Identification of Availability and Performance Bottlenecks in Cloud Computing Systems: an approach based on hierarchical models and sensitivity analysis , 2016 .

[11]  Reinhard German,et al.  Performance analysis of communication systems - modelling with non-Markovian stochastic Petri nets , 2000, Wiley-Interscience series in systems and optimization.

[12]  J.R. Watson,et al.  Applying generalized stochastic Petri nets to manufacturing systems containing nonexponential transition functions , 1991, IEEE Trans. Syst. Man Cybern..

[13]  Monika Saini,et al.  Reliability, maintainability and sensitivity analysis of physical processing unit of sewage treatment plant , 2019, SN Applied Sciences.

[14]  Kishor S. Trivedi,et al.  GSPM models: sensitivity analysis and applications , 1990, ACM-SE 28.

[15]  Michael K. Molloy Performance Analysis Using Stochastic Petri Nets , 1982, IEEE Transactions on Computers.

[16]  Ricardo J. Rodríguez,et al.  Model-based sensitivity analysis of IaaS cloud availability , 2018, Future Gener. Comput. Syst..

[17]  Jong Sou Park,et al.  Modeling and Analysis of Cloud Computing Availability Based on Eucalyptus Platform for E-Government Data Center , 2011, 2011 Fifth International Conference on Innovative Mobile and Internet Services in Ubiquitous Computing.

[18]  Carl E. Landwehr,et al.  Basic concepts and taxonomy of dependable and secure computing , 2004, IEEE Transactions on Dependable and Secure Computing.

[19]  P. Maciel,et al.  Models for Dependability Analysis of Cloud Computing Architectures for Eucalyptus Platform , 2013 .

[20]  M. Zuo,et al.  Optimal Reliability Modeling: Principles and Applications , 2002 .

[21]  Gustavo Rau de Almeida Callou,et al.  Estimating sustainability impact of high dependable data centers: a comparative study between Brazilian and US energy mixes , 2013, Computing.

[22]  P. O'Connor,et al.  Practical Reliability Engineering , 1981 .

[23]  Joost-Pieter Katoen,et al.  Model checking Markov reward models with impulse rewards , 2005, 2005 International Conference on Dependable Systems and Networks (DSN'05).

[24]  Tadao Murata,et al.  Petri nets: Properties, analysis and applications , 1989, Proc. IEEE.

[25]  Dong Seong Kim,et al.  Sensitivity Analysis of Server Virtualized System Availability , 2012, IEEE Transactions on Reliability.

[26]  Kishor S. Trivedi,et al.  Redundant Eucalyptus Private Clouds: Availability Modeling and Sensitivity Analysis , 2017, Journal of Grid Computing.

[27]  Rajkumar Buyya,et al.  Cost of Virtual Machine Live Migration in Clouds: A Performance Evaluation , 2009, CloudCom.

[28]  Reinhard German,et al.  Transient Analysis of Deterministic and Stochastic Petri Nets with TimeNET , 1995, MMB.

[29]  Kishor S. Trivedi,et al.  GSPN Models: Sensitivity Analysis and Applications , 1990 .

[30]  Kishor S. Trivedi,et al.  Sensitivity analysis of reliability and performability measures for multiprocessor systems , 1988, SIGMETRICS '88.

[31]  Kishor S. Trivedi,et al.  Sensitivity analysis of a hierarchical model of mobile cloud computing , 2015, Simul. Model. Pract. Theory.

[32]  Virgílio A. F. Almeida,et al.  Performance by Design - Computer Capacity Planning By Example , 2004 .

[33]  Myron Hlynka,et al.  Queueing Networks and Markov Chains (Modeling and Performance Evaluation With Computer Science Applications) , 2007, Technometrics.

[34]  Jinlei Qin,et al.  Reliability and Sensitivity Analysis Method for a Multistate System with Common Cause Failure , 2019, Complex..

[35]  Kishor S. Trivedi Probability and Statistics with Reliability, Queuing, and Computer Science Applications , 1984 .

[36]  Swapna S. Gokhale,et al.  Reliability prediction and sensitivity analysis based on software architecture , 2002, 13th International Symposium on Software Reliability Engineering, 2002. Proceedings..

[37]  Daniel R. Eno Practical Reliability Engineering, 4th Ed. , 2003 .

[38]  Raj Jain,et al.  The art of computer systems performance analysis - techniques for experimental design, measurement, simulation, and modeling , 1991, Wiley professional computing.

[39]  Paulo Romero Martins Maciel,et al.  Performance Evaluation of Virtual Machines Instantiation in a Private Cloud , 2015, 2015 IEEE World Congress on Services.

[40]  Kishor S. Trivedi,et al.  Modeling and performance analysis of large scale IaaS Clouds , 2013, Future Gener. Comput. Syst..

[41]  Kishor S. Trivedi,et al.  A scalable availability model for Infrastructure-as-a-Service cloud , 2011, 2011 IEEE/IFIP 41st International Conference on Dependable Systems & Networks (DSN).

[42]  Jamilson Dantas,et al.  Eucalyptus-based private clouds: availability modeling and comparison to the cost of a public cloud , 2015, Computing.

[43]  Jamilson Dantas,et al.  An availability model for eucalyptus platform: An analysis of warm-standy replication mechanism , 2012, 2012 IEEE International Conference on Systems, Man, and Cybernetics (SMC).

[44]  Gunter Bolch,et al.  Queueing Networks and Markov Chains - Modeling and Performance Evaluation with Computer Science Applications, Second Edition , 1998 .

[45]  Sara Bouchenak,et al.  Performance, Availability and Cost of Self-Adaptive Internet Services Chapter of Performance and Dependability in Service Computing: Concepts, Techniques and Research Directions , 2011 .

[46]  Chuang Lin,et al.  Dependability Modeling and Analysis for the Virtual Data Center of Cloud Computing , 2011, 2011 IEEE International Conference on High Performance Computing and Communications.

[47]  Tuan Anh Nguyen,et al.  Reliability and Availability Evaluation for Cloud Data Center Networks Using Hierarchical Models , 2019, IEEE Access.