Challenges in Fault-Tolerant Distributed Runtime Verification

Runtime Verification is a lightweight method for monitoring the formal specification of a system (usually in some form of temporal logics) at execution time. In a setting, where a set of distributed monitors have only a partial view of a large system and may be subject to different types of faults, the literature of runtime verification falls short in answering many fundamental questions. Examples include techniques to reason about the soundness and consistency of the collective set of verdicts computed by the set of distributed monitors. In this paper, we discuss open research problems on fault-tolerant distributed monitoring that stem from different design choices and implementation platforms.

[1]  Zohar Manna,et al.  Temporal Verification of Reactive Systems , 1995, Springer New York.

[2]  Yliès Falcone,et al.  Decentralised LTL monitoring , 2016, Formal Methods Syst. Des..

[3]  Maurice Herlihy,et al.  Distributed Computing Through Combinatorial Topology , 2013 .

[4]  Pierre Fraigniaud,et al.  Locality and Checkability in Wait-Free Computing , 2011, DISC.

[5]  Pierre Fraigniaud,et al.  On the Number of Opinions Needed for Fault-Tolerant Run-Time Monitoring in Distributed Systems , 2014, RV.

[6]  Hagit Attiya,et al.  Distributed Computing: Fundamentals, Simulations and Advanced Topics , 1998 .

[7]  Zohar Manna,et al.  Temporal verification of reactive systems - safety , 1995 .

[8]  Mennatallah Hasabelnaby,et al.  Decentralized Runtime Verification of LTL Specifications in Distributed Systems , 2016 .

[9]  Koushik Sen,et al.  Decentralized runtime analysis of multithreaded applications , 2006, Proceedings 20th IEEE International Parallel & Distributed Processing Symposium.

[10]  Maurice Herlihy,et al.  Wait-free synchronization , 1991, TOPL.

[11]  Martin Leucker,et al.  Runtime Verification for LTL and TLTL , 2011, TSEM.

[12]  Martin Leucker,et al.  Comparing LTL Semantics for Runtime Verification , 2010, J. Log. Comput..

[13]  Yliès Falcone,et al.  You Should Better Enforce Than Verify , 2010, RV.

[14]  Pierre Fraigniaud,et al.  Decentralized Asynchronous Crash-Resilient Runtime Verification , 2016, CONCUR.

[15]  Yliès Falcone,et al.  Efficient and Generalized Decentralized Monitoring of Regular Languages , 2014, FORTE.

[16]  Yoram Moses,et al.  A Layered Analysis of Consensus , 2002, SIAM J. Comput..

[17]  Pierre Fraigniaud,et al.  The Opinion Number of Set-Agreement , 2014, OPODIS.

[18]  Sebastian Fischmeister,et al.  Runtime verification with minimal intrusion through parallelism , 2015, Formal Methods Syst. Des..