A taxonomy of software architecture-based reliability efforts

Due to the complexity of the current software systems and the diversity of their architectural styles and component models, architecture-based reliability is becoming a more important quality requirement than ever before. Architecture-based reliability efforts depend on the behavior of individual components and their interactions with respect to their influences on the system reliability. Depending on different viewpoints and assumptions, a component takes various definitions and forms. As a result, numerous reliability works that involve varieties of the underlying strategies, objectives, and parameters are proposed for software architectures. Classifying these efforts is important for creating and selecting potential solutions that handle the reliability of software applications. In this paper, we provide a taxonomy of architecture-based reliability efforts. We classify these efforts according to the reliability goals, component abstraction, and level of granularity. We explain the existing techniques considering their assumptions with respect to these classification parameters and provide detailed description about the specific issues and considerations of each class.

[1]  Pascal Felber Lightweight Fault Tolerance in CORBA , 2001, DOA.

[2]  Swapna S. Gokhale,et al.  Analytical Models for Architecture-Based Software Reliability Prediction: A Unification Framework , 2006, IEEE Transactions on Reliability.

[3]  Roger C. Cheung,et al.  A User-Oriented Software Reliability Model , 1978, IEEE Transactions on Software Engineering.

[4]  Mohammad Zulkernine,et al.  Improving Reliability and Safety by Trading off Software Failure Criticalities , 2007 .

[5]  W. W. Everett,et al.  Software component reliability analysis , 1999, Proceedings 1999 IEEE Symposium on Application-Specific Systems and Software Engineering and Technology. ASSET'99 (Cat. No.PR00122).

[6]  Valérie Issarny,et al.  Architecture-based exception handling , 2001, Proceedings of the 34th Annual Hawaii International Conference on System Sciences.

[7]  Yi-Min Wang,et al.  Reliability and availability issues in distributed component object model (DCOM) , 1997, 1997 Fourth International Workshop on Community Networking Processing.

[8]  Josep Freixas,et al.  Identifying Optimal Components in a Reliability System , 2008, IEEE Transactions on Reliability.

[9]  Kenneth P. Birman,et al.  The Maestro Approach to Building Reliable Interoperable Distributed Applications with Multiple Execution Styles , 1998, Theory Pract. Object Syst..

[10]  Priya Narasimhan,et al.  Experiences, strategies, and challenges in building fault-tolerant CORBA systems , 2004, IEEE Transactions on Computers.

[11]  Budi Arief,et al.  On Rigorous Design and Implementation of Fault Tolerant Ambient Systems , 2007, 10th IEEE International Symposium on Object and Component-Oriented Real-Time Distributed Computing (ISORC'07).

[12]  하수철,et al.  [서평]「Component Software」 - Beyond Object-Oriented Programming - , 2000 .

[13]  Mohammad Zulkernine,et al.  On Failure Propagation in Component-Based Software Systems , 2008, 2008 The Eighth International Conference on Quality Software.

[14]  Meng-Lai Yin,et al.  A design tool for large scale fault-tolerant software systems , 2004, Annual Symposium Reliability and Maintainability, 2004 - RAMS.

[15]  Jean Arlat,et al.  Dependability of CORBA systems: service characterization by fault injection , 2002, 21st IEEE Symposium on Reliable Distributed Systems, 2002. Proceedings..

[16]  Aamer Nadeem,et al.  An Antecedence Graph Approach for Fault Tolerance in a Multi-Agent , 2006, 7th International Conference on Mobile Data Management (MDM'06).

[17]  Mohammad Zulkernine,et al.  At What Level of Granularity Should We be Componentizing for Software Reliability? , 2008, 2008 11th IEEE High Assurance Systems Engineering Symposium.

[18]  Katerina Goseva-Popstojanova,et al.  Large empirical case study of architecture-based software reliability , 2005, 16th IEEE International Symposium on Software Reliability Engineering (ISSRE'05).

[19]  Pierre Sens,et al.  DARX - a framework for the fault-tolerant support of agent software , 2003, 14th International Symposium on Software Reliability Engineering, 2003. ISSRE 2003..

[20]  Jonathan E. Cook,et al.  Highly reliable upgrading of components , 1999, Proceedings of the 1999 International Conference on Software Engineering (IEEE Cat. No.99CB37002).

[21]  Cecília M. F. Rubira,et al.  A Fault-Tolerant Software Architecture for Component-Based Systems , 2002, WADS.

[22]  Roberto Baldoni,et al.  An Interoperable Replication Logic for CORBA systems , 2000, Proceedings DOA'00. International Symposium on Distributed Objects and Applications.

[23]  Joni da Silva Fraga,et al.  An Adaptive Fault-Tolerant Component Model , 2003, 2003 The Ninth IEEE International Workshop on Object-Oriented Real-Time Dependable Systems.

[24]  Rogério de Lemos,et al.  Integrating COTS software components into dependable software architectures , 2003, Sixth IEEE International Symposium on Object-Oriented Real-Time Distributed Computing, 2003..

[25]  Hoang Pham Recent advances in reliability and quality in design , 2008 .

[26]  Laura L. Pullum,et al.  Object-oriented executives and components for fault tolerance , 2001, 2001 IEEE Aerospace Conference Proceedings (Cat. No.01TH8542).

[27]  Aditya P. Mathur,et al.  On the estimation of reliability of a software system using reliabilities of its components , 1997, Proceedings The Eighth International Symposium on Software Reliability Engineering.

[28]  Ivica Crnkovic,et al.  Building Reliable Component-Based Software Systems , 2002 .

[29]  Richard G. Hamlet,et al.  Theory of software reliability based on components , 2001, Proceedings of the 23rd International Conference on Software Engineering. ICSE 2001.

[30]  Michael R. Lyu,et al.  A Survey of Fault Tolerant CORBA Systems , 2007, OTM Conferences.

[31]  Cecília M. F. Rubira,et al.  A fault-tolerant architectural approach for dependable systems , 2006, IEEE Software.

[32]  Jean-Pierre Briot,et al.  On fault tolerance in law-governed multi-agent systems , 2006, SELMAS '06.

[33]  Roy Friedman,et al.  FTS: a high-performance CORBA fault-tolerance service , 2002, Proceedings of the Seventh IEEE International Workshop on Object-Oriented Real-Time Dependable Systems. (WORDS 2002).

[34]  Samir Aknine,et al.  A Predictive Method for Providing Fault Tolerance in Multi-agent Systems , 2006, 2006 IEEE/WIC/ACM International Conference on Intelligent Agent Technology.

[35]  Swapna S. Gokhale,et al.  A simulation approach to structure-based software reliability analysis , 2005, IEEE Transactions on Software Engineering.

[36]  Willy Zwaenepoel,et al.  Performance and scalability of EJB applications , 2002, OOPSLA '02.

[37]  Nenad Medvidovic,et al.  Early prediction of software component reliability , 2008, 2008 ACM/IEEE 30th International Conference on Software Engineering.

[38]  Katerina Goseva-Popstojanova,et al.  Architecture-based approach to reliability assessment of software systems , 2001, Perform. Evaluation.

[39]  Aniruddha S. Gokhale,et al.  DOORS: towards high-performance fault tolerant CORBA , 2000, Proceedings DOA'00. International Symposium on Distributed Objects and Applications.

[40]  Vincenzo Grassi,et al.  A Modeling Approach to Analyze the Impact of Error Propagation on Reliability of Component-Based Systems , 2007, CBSE.

[41]  Erale De Lausanne The CORBA Object Group Service A Service Approach to Object Groups in CORBA , 1998 .

[42]  Dahlia Malkhi,et al.  Aquarius: A Data-Centric Approach to CORBA Fault-Tolerance , 2003, OTM Workshops.

[43]  Fabien Dagnat,et al.  Dependability in Software Component Deployment , 2007, 2nd International Conference on Dependability of Computer Systems (DepCoS-RELCOMEX '07).

[44]  Alexander Romanovsky,et al.  CAMA: Structured Coordination Space and Exception Propagation Mechanism for Mobile Agents , 2005 .

[45]  Karl-Erwin Großpietsch Optimizing the reliability of component-based n-version approaches , 2002, Proceedings 16th International Parallel and Distributed Processing Symposium.

[46]  Priya Narasimhan,et al.  Transparent fault tolerance for corba , 1999 .

[47]  Bojan Cukic,et al.  Error propagation in the reliability analysis of component based systems , 2005, 16th IEEE International Symposium on Software Reliability Engineering (ISSRE'05).

[48]  Hany H. Ammar,et al.  Scenario-based reliability analysis of component-based software , 1999, Proceedings 10th International Symposium on Software Reliability Engineering (Cat. No.PR00443).

[49]  Cecilia Mary Fischer Rubira,et al.  An Idealized Fault-Tolerant Architectural Component , 2002 .

[50]  Jean Arlat,et al.  Definition and analysis of hardware- and software-fault-tolerant architectures , 1990, Computer.

[51]  Olivier Marin,et al.  DimaX: a fault-tolerant multi-agent platform , 2006, SELMAS '06.

[52]  Drasko M. Sotirovski Towards fault-tolerant software architectures , 2001, Proceedings Working IEEE/IFIP Conference on Software Architecture.

[53]  William H. Sanders,et al.  Proteus: a flexible infrastructure to implement adaptive fault tolerance in AQuA , 1999, Dependable Computing for Critical Applications 7.

[54]  Cecília M. F. Rubira,et al.  An Architectural-Level Exception-Handling System for Component-Based Applications , 2003, LADC.

[55]  Hervé Grall,et al.  A Multi-stage Approach for Reliable Dynamic Reconfigurations of Component-Based Systems , 2008, DAIS.

[56]  Huanguo Zhang,et al.  An efficient fault-tolerant scheme for mobile agent execution , 2006, 2006 1st International Symposium on Systems and Control in Aerospace and Astronautics.

[57]  Alexander Romanovsky Exception handling in component-based system development , 2001, 25th Annual International Computer Software and Applications Conference. COMPSAC 2001.

[58]  Swapna S. Gokhale,et al.  Heuristic Component Placement for Maximizing Software Reliability , 2008 .

[59]  Anish Arora,et al.  Component Based Design of Multitolerant Systems , 1998, IEEE Trans. Software Eng..

[60]  Paul D. Ezhilchelvan,et al.  Design and implemantation of a CORBA fault-tolerant object group service , 1999, DAIS.

[61]  Samir Aknine,et al.  Plan-based replication for fault-tolerant multi-agent systems , 2006, Proceedings 20th IEEE International Parallel & Distributed Processing Symposium.

[62]  William H. Sanders,et al.  AQuA: An Adaptive Architecture that Provides Dependable Distributed Objects , 2003, IEEE Trans. Computers.