Interactive Self-Healing for Black-Box Components in Distributed Embedded Environments

As self-management features of computer systems depend on self-knowledge we have to cope with the issue that most large systems are not entirely known to the self-management software. Especially when system parts are supplied by external companies which want to preserve their intellectual property we have to cope with black-box components. This paper presents a novel approach for self-healing in distributed embedded systems containing black-box application software. The interactive self-healing process is based on well defined system knowledge and enables the construction of a functional dependency graph which serves as basis for rule-based root cause analysis and self-healing.

[1]  Deborah L. McGuinness,et al.  OWL Web ontology language overview , 2004 .

[2]  Philip Koopman,et al.  Elements of the Self-Healing System Problem Space , 2003 .

[3]  Anne-Marie Kermarrec,et al.  The many faces of publish/subscribe , 2003, CSUR.

[4]  L. Stein,et al.  OWL Web Ontology Language - Reference , 2004 .

[5]  George Candea,et al.  Recursive restartability: turning the reboot sledgehammer into a scalpel , 2001, Proceedings Eighth Workshop on Hot Topics in Operating Systems.

[6]  Jeffrey O. Kephart,et al.  The Vision of Autonomic Computing , 2003, Computer.

[7]  Michael Dinkel,et al.  Unified Communication in Heterogeneous Automotive Control Systems , 2006 .

[9]  Herbert Bos,et al.  Robust distributed systems achieving self-management through inference , 2005, Sixth IEEE International Symposium on a World of Wireless Mobile and Multimedia Networks.

[10]  Malgorzata Steinder,et al.  A survey of fault localization techniques in computer networks , 2004, Sci. Comput. Program..

[11]  Biplav Srivastava,et al.  Bringing planning to autonomic applications with ABLE , 2004, International Conference on Autonomic Computing, 2004. Proceedings..

[12]  Qun Yang,et al.  A framework for dynamic software architecture-based self-healing , 2005, 2005 IEEE International Conference on Systems, Man and Cybernetics.

[13]  Bradley R. Schmerl,et al.  Model-based adaptation for self-healing systems , 2002, WOSS '02.

[14]  Peter E. H. Hofmann,et al.  Evolutionäre Elektronikarchitektur für Kraftfahrzeuge (Evolutionary Electronic Systems for Automobiles) , 2005, it Inf. Technol..

[15]  Flaviu Cristian,et al.  Understanding fault-tolerant distributed systems , 1991, CACM.