Towards a Framework for Self-Adaptive Reliable Network Services in Highly-Uncertain Environments

In future inhomogeneous, pervasive and highly dynamic networks, end-nodes may often only rely on unreliable and uncertain observations to diagnose hidden network states and decide upon possible remediation actions. Inherent challenges exists to identify good and timely decision strategies to improve resilience of end-node services. In this paper we present a framework, called ODDR (Observation, Diagnosis, Decision, Remediation), for improving resilience of network based services through integration of self-adaptive monitoring services, network diagnosis, decision actions, and finally execution (and monitoring) of remediation actions. We detail the motivations to the ODDR design, then we present its architecture, and finally we describe our current activities towards the realization and assessment of the framework services and the main results currently achieved.

[1]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[2]  Douglas C. Schmidt,et al.  Middleware techniques and optimizations for real-time, embedded systems , 1999, Proceedings 12th International Symposium on System Synthesis.

[3]  William H. Sanders,et al.  Stochastic Activity Networks: Formal Definitions and Concepts , 2002, European Educational Forum: School on Formal Methods and Performance Analysis.

[4]  Kavé Salamatian,et al.  Hidden Markov modeling for network communication channels , 2001, SIGMETRICS '01.

[5]  Mahadev Satyanarayanan,et al.  Pervasive computing: vision and challenges , 2001, IEEE Wirel. Commun..

[6]  Michael Atighetchi,et al.  Quorum Distributed Object Integration (QUOIN) , 2002 .

[7]  Jeffrey O. Kephart,et al.  The Vision of Autonomic Computing , 2003, Computer.

[8]  Kevin Jeffay,et al.  Variability in TCP round-trip times , 2003, IMC '03.

[9]  Malgorzata Steinder,et al.  Probabilistic fault localization in communication systems using belief networks , 2004, IEEE/ACM Transactions on Networking.

[10]  Ramjee Prasad,et al.  Distributed Redundancy or Cluster Solution? An Experimental Evaluation of Two Approaches for Dependable Mobile Internet Services , 2004, ISAS.

[11]  Vinny Cahill,et al.  Towards real-time middleware for vehicular ad hoc networks , 2004, VANET '04.

[12]  William H. Sanders,et al.  Automatic model-driven recovery in distributed systems , 2005, 24th IEEE Symposium on Reliable Distributed Systems (SRDS'05).

[13]  J. Gade,et al.  amPHI - ambulance record-keeping system , 2005 .

[14]  S. Rees,et al.  European Society for Computing and Technology in Anaesthesia and Intensive Care (ESCTAIC) , 2006, Journal of Clinical Monitoring and Computing.

[15]  David Sinreich,et al.  An architectural blueprint for autonomic computing , 2006 .

[16]  Vincent W. S. Wong,et al.  Comparison between Vertical Handoff Decision Algorithms for Heterogeneous Wireless Networks , 2006, 2006 IEEE 63rd Vehicular Technology Conference.

[17]  Andrea Bondavalli,et al.  Hidden Markov Models as a Support for Diagnosis: Formalization of the Problem and Synthesis of the Solution , 2006, 2006 25th IEEE Symposium on Reliable Distributed Systems (SRDS'06).

[18]  Franco Zambonelli,et al.  A survey of autonomic communications , 2006, TAAS.

[19]  Andrea Bondavalli,et al.  Foundations of Measurement Theory Applied to the Evaluation of Dependability Attributes , 2007, 37th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN'07).

[20]  Maitreya Natu,et al.  Probabilistic Fault Diagnosis Using Adaptive Probing , 2007, DSOM.

[21]  Paramvir Bahl,et al.  Towards highly reliable enterprise network services via inference of multi-level dependencies , 2007, SIGCOMM.

[22]  Julie A. McCann,et al.  A survey of autonomic computing—degrees, models, and applications , 2008, CSUR.

[23]  Andrea Bondavalli,et al.  Assuring Resilient Time Synchronization , 2008, 2008 Symposium on Reliable Distributed Systems.

[24]  Hans-Peter Schwefel,et al.  Dependability Evaluation of a Replication Service for Mobile Applications in Dynamic Ad-Hoc Networks , 2008, ISAS.

[25]  Mark J. F. Gales,et al.  Issues with uncertainty decoding for noise robust automatic speech recognition , 2008, Speech Commun..

[26]  Vincent W. S. Wong,et al.  An MDP-Based Vertical Handoff Decision Algorithm for Heterogeneous Wireless Networks , 2008, IEEE Transactions on Vehicular Technology.

[27]  Andrea Bondavalli,et al.  QoS Perceived by Users of Ubiquitous UMTS: Compositional Models and Thorough Analysis , 2009, J. Softw..

[28]  H.-P. Schwefel,et al.  Model based evaluation of policies for end-node driven fault recovery , 2009, 2009 7th International Workshop on Design of Reliable Communication Networks.

[29]  Carsten Lund,et al.  Darkstar: Using exploratory data mining to raise the bar on network reliability and performance , 2009, 2009 7th International Workshop on Design of Reliable Communication Networks.

[30]  Andrea Bondavalli,et al.  Improving Robustness of Network Fault Diagnosis to Uncertainty in Observations , 2010, 2010 Ninth IEEE International Symposium on Network Computing and Applications.