An approach to modelling and analysis of coordinated atomic actions

Coordinated atomic (CA) action is a widely used approach to structuring concurrent activities in distributed embedded applications. Recovery from failures is one of its major concerns. Despite the intuitive simplicity, reasoning about failure recovery in CA actions is rather complex because of the interaction between various system components. This paper proposes an approach based on timed CSP for modelling CA actions and reasoning about different forms of interaction between system components, addressing issues such as failure modes, means of fault-tolerance and fail-safe mechanisms

[1]  Steve A. Schneider,et al.  Design and Verification of Distributed Recovery Blocks with CSP , 2003, Formal Methods Syst. Des..

[2]  Willem P. de Roever,et al.  Specifying fault tolerance within Stark's formalism , 1993, FTCS-23 The Twenty-Third International Symposium on Fault-Tolerant Computing.

[3]  Avelino Francisco Zorzo,et al.  Rigorous development of a safety-critical system based on coordinated atomic actions , 1999, Digest of Papers. Twenty-Ninth Annual International Symposium on Fault-Tolerant Computing (Cat. No.99CB36352).

[4]  Mathai Joseph,et al.  Specification and verification of fault-tolerance, timing, and scheduling , 1999, TOPL.

[5]  Brian Randell,et al.  Error recovery in asynchronous systems , 1986, IEEE Transactions on Software Engineering.

[6]  C. A. R. Hoare,et al.  Communicating sequential processes , 1978, CACM.

[7]  Maciej Koutny,et al.  The ERT Model of Fault-Tolerant Computing and Its Application to a Formalisation of Coordinated Atomic Actions , 1998 .

[8]  Jan Peleska,et al.  Design and verification of fault tolerant systems with CSP , 1991, Distributed Computing.

[9]  Luca Simoncini,et al.  Formally Verifying Fault Tolerant System Designs , 2000, Comput. J..

[10]  Steve A. Schneider,et al.  Concurrent and Real-time Systems: The CSP Approach , 1999 .

[11]  Hermann Kopetz,et al.  Real-time systems , 2018, CSC '73.

[12]  Rogério de Lemos,et al.  Exception handling in a cooperative object-oriented approach , 1999, Proceedings 2nd IEEE International Symposium on Object-Oriented Real-Time Distributed Computing (ISORC'99) (Cat. No.99-61702).

[13]  Simeon Veloudis A formal approach to modelling real-time critical systems , 2004 .

[14]  Steven A. Schneider,et al.  Correctness and communication in real-time systems (tcsp) , 1989 .

[15]  Jozef Hooman,et al.  A Trace-Based Compositional Proof Theory for Fault Tolerant Distributed Systems , 1994, Theor. Comput. Sci..

[16]  Mathai Joseph,et al.  Transformation of programs for fault-tolerance , 2005, Formal Aspects of Computing.

[17]  Brian Randell,et al.  Coordinated Atomic Actions: Formal Model, Case Study and System Implementation , 1998 .