Concurrency in Dependable Computing

Concurrency in Dependable Computing focuses on concurrency related issues in the area of dependable computing. Failures of system components, be hardware units or software modules, can be viewed as undesirable events occurring concurrently with a set of normal system events. Achieving dependability therefore is closely related to, and also benefits from, concurrency theory and formalisms. This beneficial relationship appears to manifest into three strands of work. Application level structuring of concurrent activities. Concepts such as atomic actions, conversations, exception handling, view synchrony, etc., are useful in structuring concurrent activities so as to facilitate attempts at coping with the effects of component failures. Replication induced concurrency management. Replication is a widely used technique for achieving reliability. Replica management essentially involves ensuring that replicas perceive concurrent events identically. Application of concurrency formalisms for dependability assurance. Fault-tolerant algorithms are harder to verify than their fault-free counterparts due to the fact that the impact of component faults at each state need to be considered in addition to valid state transitions. CSP, Petri nets, CCS are useful tools to specify and verify fault-tolerant designs and protocols. Concurrency in Dependable Computing explores many significant issues in all three strands. To this end, it is composed as a collection of papers written by authors well-known in their respective areas of research. To ensure quality, the papers are reviewed by a panel of at least three experts in the relevant area.

[1]  Sally C. Johnson Reliability analysis of large, complex systems using ASSIST , 1988 .

[2]  Rick Kazman,et al.  Toward a discipline of scenario‐based architectural engineering , 2000, Ann. Softw. Eng..

[3]  Jean Arlat,et al.  Definition and analysis of hardware- and software-fault-tolerant architectures , 1990, Computer.

[4]  Gustavo Alonso,et al.  Scalable Replication in Database Clusters , 2000, DISC.

[5]  J-C. Laprie,et al.  DEPENDABLE COMPUTING AND FAULT TOLERANCE : CONCEPTS AND TERMINOLOGY , 1995, Twenty-Fifth International Symposium on Fault-Tolerant Computing, 1995, ' Highlights from Twenty-Five Years'..

[6]  Hisashi Kobayashi,et al.  Modeling and analysis , 1978 .

[7]  Edgar Nett,et al.  How to commit concurrent, non-isolated computations , 1995, Proceedings of the Fifth IEEE Computer Society Workshop on Future Trends of Distributed Computing Systems.

[8]  Mark Klein,et al.  Attribute-Based Architectural Styles , 1999 .

[9]  Flaviu Cristian,et al.  The Timed Asynchronous Distributed System Model , 1999, IEEE Trans. Parallel Distributed Syst..

[10]  Naranker Dulay,et al.  Specifying Distributed Software Architectures , 1995, ESEC.

[11]  Flaviu Cristian,et al.  A Highly Available Local Leader Election Service , 1999, IEEE Trans. Software Eng..

[12]  Edgar Nett,et al.  Nested dynamic actions: how to solve the fault containment problem in a cooperative action model , 1994, Proceedings of IEEE 13th Symposium on Reliable Distributed Systems.

[13]  Fred B. Schneider,et al.  The primary-backup approach , 1993 .

[14]  Angel Alvarez,et al.  Drago: An Ada Extension to Program Fault-Tolerant Distributed Applications , 1996, Ada-Europe.