Formulating Criticality-Based Cost-Effective Fault Tolerance Strategies for Multi-Tenant Service-Based Systems

The proliferation of cloud computing has fueled the rapid growth of multi-tenant service-based systems (SBSs), which serve multiple tenants simultaneously by composing existing services in the form of business processes. In a distributed and volatile operating environment, runtime anomalies may occur to the component services of an SBS and cause end-to-end quality violations. Engineering multi-tenant SBSs that can quickly handle runtime anomalies cost effectively has become a significant challenge. Different approaches have been proposed to formulate fault tolerance strategies for engineering SBSs. However, none of the existing approaches has sufficiently considered the service criticality based on multi-tenancy where multiple tenants share the same SBS instance with different multi-dimensional quality preferences. In this paper, we propose Criticality-based Fault Tolerance for Multi-Tenant SBSs (CFT4MTS), a novel approach that formulates cost-effective fault tolerance strategies for multi-tenant SBSs by providing redundancy for the critical component services. First, the criticality of each component service is evaluated based on its multi-dimensional quality and multiple tenants sharing the component service with differentiated quality preferences. Then, the fault tolerance problem is modelled as an Integer Programming problem to identify the optimal fault tolerance strategy. The experimental results show that, compared with three existing representative approaches, CFT4MTS can alleviate degradation in the quality of multi-tenant SBSs in a much more effective and efficient way.

[1]  Eyhab Al-Masri,et al.  Discovering the best web service , 2007, WWW '07.

[2]  David F. McAllister,et al.  An Experimental Evaluation of Software Redundancy as a Strategy For Improving Reliability , 1991, IEEE Trans. Software Eng..

[3]  Algirdas Avizienis,et al.  The N-Version Approach to Fault-Tolerant Software , 1985, IEEE Transactions on Software Engineering.

[4]  Rami Bahsoon,et al.  A decentralized self-adaptation mechanism for service-based applications in the cloud , 2013, IEEE Transactions on Software Engineering.

[5]  Ian Lumb,et al.  A Taxonomy and Survey of Cloud Computing Systems , 2009, 2009 Fifth International Joint Conference on INC, IMS and IDC.

[6]  Hai Jin,et al.  QoS-Aware Service Selection for Customisable Multi-tenant Service-Based Systems: Maturity and Approaches , 2015, 2015 IEEE 8th International Conference on Cloud Computing.

[7]  Piero A. Bonatti,et al.  On optimal service selection , 2005, WWW '05.

[8]  Hai Jin,et al.  Formulating Cost-Effective Monitoring Strategies for Service-Based Systems , 2014, IEEE Transactions on Software Engineering.

[9]  David W. Coit,et al.  Component Reliability Criticality or Importance Measures for Systems With Degrading Components , 2012, IEEE Transactions on Reliability.

[10]  Anne H. H. Ngu,et al.  QoS-aware middleware for Web services composition , 2004, IEEE Transactions on Software Engineering.

[11]  Vyacheslav S. Kharchenko,et al.  Using Inherent Service Redundancy and Diversity to Ensure Web Services Dependability , 2009, Methods, Models and Tools for Fault Tolerance.

[12]  Josep Freixas,et al.  Identifying Optimal Components in a Reliability System , 2008, IEEE Transactions on Reliability.

[13]  Qiang He,et al.  QoS-Aware Service Recommendation for Multi-tenant SaaS on the Cloud , 2015, 2015 IEEE International Conference on Services Computing.

[14]  Nancy G. Leveson,et al.  An Empirical Comparison of Software Fault Tolerance and Fault Elimination , 1991, IEEE Trans. Software Eng..

[15]  Mark Harman,et al.  Exact scalable sensitivity analysis for the next release problem , 2014, ACM Trans. Softw. Eng. Methodol..

[16]  David S. Rosenblum,et al.  Sensitivity analysis for a scenario-based reliability prediction model , 2005, WADS@ICSE.

[17]  Tao Yu,et al.  Efficient algorithms for Web services selection with end-to-end QoS constraints , 2007, TWEB.

[18]  Zibin Zheng,et al.  Component Ranking for Fault-Tolerant Cloud Applications , 2012, IEEE Transactions on Services Computing.

[19]  Ladan Tahvildari,et al.  Self-adaptive software: Landscape and research challenges , 2009, TAAS.

[20]  Zibin Zheng,et al.  A Distributed Replication Strategy Evaluation and Selection Framework for Fault Tolerant Web Services , 2008, 2008 IEEE International Conference on Web Services.

[21]  Giordano Tamburrelli,et al.  QoS-Aware Adaptive Service Orchestrations , 2012, 2012 IEEE 19th International Conference on Web Services.

[22]  Saltelli Andrea,et al.  Global Sensitivity Analysis: The Primer , 2008 .

[23]  Milan Zeleny,et al.  Multiple Criteria Decision Making , 1973 .

[24]  Randy H. Katz,et al.  A view of cloud computing , 2010, CACM.

[25]  Hai Jin,et al.  Probabilistic Critical Path Identification for Cost-Effective Monitoring of Service-Based Systems , 2012, 2012 IEEE Ninth International Conference on Services Computing.

[26]  Danilo Ardagna,et al.  Adaptive Service Composition in Flexible Processes , 2007, IEEE Transactions on Software Engineering.

[27]  Eduard Ayguadé,et al.  Non-intrusive Estimation of QoS Degradation Impact on E-Commerce User Satisfaction , 2011, 2011 IEEE 10th International Symposium on Network Computing and Applications.

[28]  Luciano Baresi,et al.  Self-Supervising BPEL Processes , 2011, IEEE Transactions on Software Engineering.

[29]  Thomas Risse,et al.  Combining global optimization with local selection for efficient QoS-aware service composition , 2009, WWW '09.

[30]  Thomas Risse,et al.  Selecting skyline services for QoS-based web service composition , 2010, WWW '10.

[31]  Hai Jin,et al.  Quality-Aware Service Selection for Service-Based Systems Based on Iterative Multi-Attribute Combinatorial Auction , 2014, IEEE Transactions on Software Engineering.

[32]  Zibin Zheng,et al.  Selecting an Optimal Fault Tolerance Strategy for Reliable Service-Oriented Systems with Local and Global Constraints , 2015, IEEE Transactions on Computers.

[33]  F. C. Meng Comparing the importance of system components by some structural characteristics , 1996, IEEE Trans. Reliab..

[34]  Paul T. Boggs,et al.  Sequential Quadratic Programming , 1995, Acta Numerica.

[35]  Vincenzo Grassi,et al.  MOSES: A Framework for QoS Driven Runtime Adaptation of Service-Oriented Systems , 2012, IEEE Transactions on Software Engineering.

[36]  Bradley R. Schmerl,et al.  Proactive self-adaptation under uncertainty: a probabilistic model checking approach , 2015, ESEC/SIGSOFT FSE.

[37]  Robbert van Renesse,et al.  Adding high availability and autonomic behavior to Web services , 2004, Proceedings. 26th International Conference on Software Engineering.

[38]  Brian Randell,et al.  System structure for software fault tolerance , 1975, IEEE Transactions on Software Engineering.

[39]  Louise E. Moser,et al.  Fault Tolerance Middleware for Cloud Computing , 2010, 2010 IEEE 3rd International Conference on Cloud Computing.

[40]  Radu Calinescu,et al.  Dynamic QoS Management and Optimization in Service-Based Systems , 2011, IEEE Transactions on Software Engineering.

[41]  Hai Jin,et al.  QoS-Driven Service Selection for Multi-tenant SaaS , 2012, 2012 IEEE Fifth International Conference on Cloud Computing.

[42]  Carlo Ghezzi,et al.  Self-adaptive software needs quantitative verification at runtime , 2012, CACM.