Title of the Deliverable: Service Level Resilience Solutions for the Infrastructure Domain Version Information Table of Contents 3 Probabilistic Fault Detection Using Cross-layer Observations .............. 12 Contents of the Deliverable and Relation to Other Work Packages

The main objectives of WP2 are to define a resilient architecture and to develop a range of middleware solutions (i.e. algorithms, protocols, services) to address resilience requirements in the design of highly available, reliable and trustworthy distributed solutions. This deliverable presents research results concerning the development of middleware services within the HIDENETS environment, the objective of which is to facilitate the construction of resilient, dependable car2car applications operating in ad-hoc environments in cooperation with infrastructure based services. The deliverable complements the work presented in deliverable D2.3 (Service level resilience solutions for the ad-hoc domain), focusing on ideas that typically reflect improvements of the services presented in D2.3. These improvements can be achieved due to the possibility of operating over infrastructure environments, in addition to the exclusive operation over ad-hoc environments. The deliverable provides three contributions. Firstly, it addresses the problem of fault detection from the perspective of end-to-end services, which is addressed under a probabilistic scope and which is suitable for IP based communication systems in general. Then, it provides a study on the dependability/performance trade-off for replicated servers in the infrastructure domain. Finally, an extension to the Intrusion Tolerant Agreement service is introduced, which exploits the assumed availability of a reliable server in the infrastructure domain to improve the performance of the basic (non-extended) service, as presented in D2.3.

[1]  Michael Ben-Or,et al.  Another advantage of free choice (Extended Abstract): Completely asynchronous agreement protocols , 1983, PODC '83.

[2]  Nancy A. Lynch,et al.  Impossibility of distributed consensus with one faulty process , 1983, PODS '83.

[3]  Gabriel Bracha,et al.  An asynchronous [(n - 1)/3]-resilient consensus protocol , 1984, PODC '84.

[4]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[5]  C.S. Hood,et al.  Probabilistic network fault detection , 1996, Proceedings of GLOBECOM'96. 1996 IEEE Global Telecommunications Conference.

[6]  Marcos K. Aguilera,et al.  Failure Detection and Randomization: A Hybrid Approach to Solve Consensus , 1998, SIAM J. Comput..

[7]  M. Handley,et al.  SIP: Session Initiation Protocol , 1999, RFC.

[8]  Raimundo José de Araújo Macêdo,et al.  Solving the consensus problem in a mobile environment , 1999, 1999 IEEE International Performance, Computing and Communications Conference (Cat. No.99CH36305).

[9]  D. Thompson,et al.  Construction of Bayesian networks for diagnostics , 2000, 2000 IEEE Aerospace Conference. Proceedings (Cat. No.00TH8484).

[10]  Achour Mostéfaoui,et al.  The best of both worlds: A hybrid approach to solve consensus , 2000, Proceeding International Conference on Dependable Systems and Networks. DSN 2000.

[11]  Steen Andreassen,et al.  A method for diagnosing multiple diseases in MUNIN , 2001, IEEE Transactions on Biomedical Engineering.

[12]  Malgorzata Steinder,et al.  The present and future of event correlation: A need for end-to-end service fault localization , 2001 .

[13]  Finn V. Jensen,et al.  Bayesian Networks and Decision Graphs , 2001, Statistics for Engineering and Information Science.

[14]  Foster Provost,et al.  The effect of class distribution on classifier learning , 2001 .

[15]  Paulo Veríssimo,et al.  Distributed Systems for System Architects , 2001, Advances in Distributed Computing and Middleware.

[16]  Abdelmadjid Bouabdallah,et al.  Solving the consensus problem in a dynamic group: an approach suitable for a mobile environment , 2002, Proceedings ISCC 2002 Seventh International Symposium on Computers and Communications.

[17]  Antonio Casimiro,et al.  The Timely Computing Base Model and Architecture , 2002, IEEE Trans. Computers.

[18]  Stuart J. Russell,et al.  Dynamic bayesian networks: representation, inference and learning , 2002 .

[19]  John Loughney,et al.  Requirements for Reliable Server Pooling , 2002, RFC.

[20]  Miguel Correia,et al.  The Design of a COTSReal-Time Distributed Security Kernel , 2002, EDCC.

[21]  Ibrahim Matta,et al.  End-to-End Inference of Loss Nature in a Hybrid Wired/Wireless Environment , 2002 .

[22]  Ibrahim Matta,et al.  A Bayesian Approach for TCP to Distinguish Congestion from Wireless Losses , 2003 .

[23]  Malgorzata Steinder,et al.  Probabilistic fault localization in communication systems using belief networks , 2004, IEEE/ACM Transactions on Networking.

[24]  Carl E. Landwehr,et al.  Basic concepts and taxonomy of dependable and secure computing , 2004, IEEE Transactions on Dependable and Secure Computing.

[25]  Wei-Pang Yang,et al.  Byzantine Agreement on Mobile Ad-Hoc Network , 2004, IEEE International Conference on Networking, Sensing and Control, 2004.

[26]  Victor Shoup,et al.  Random Oracles in Constantinople: Practical Asynchronous Byzantine Agreement Using Cryptography , 2000, Journal of Cryptology.

[27]  Roy Friedman,et al.  Efficient Byzantine broadcast in wireless ad-hoc networks , 2005, 2005 International Conference on Dependable Systems and Networks (DSN'05).

[28]  Miguel Correia,et al.  From Consensus to Atomic Broadcast: Time-Free Byzantine-Resistant Protocols without Signatures , 2006, Comput. J..

[29]  Jiannong Cao,et al.  A hierarchical consensus protocol for mobile ad hoc networks , 2006, 14th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP'06).

[30]  Michael J. Fischer,et al.  Stabilizing Consensus in Mobile Networks , 2006, DCOSS.

[31]  Andrea Bondavalli,et al.  Application of the evaluation framework to the complete scenario(preliminary version) , 2007 .

[32]  David Heckerman,et al.  A Tutorial on Learning with Bayesian Networks , 1999, Innovations in Bayesian Networks.

[33]  Michael Tüxen,et al.  Endpoint Handlespace Redundancy Protocol (ENRP) , 2008, RFC.

[34]  Michael Tüxen,et al.  Aggregate Server Access Protocol (ASAP) , 2008, RFC.

[35]  Giorgio Ventre,et al.  Network Simulator NS2 , 2008 .