A Petri net approach to fault detection and diagnosis in distributed systems. I. Application to telecommunication networks, motivations, and modelling

This paper presents a new use of safe Petri nets in the field of distributed discrete event systems, with application to telecommunication network management. This study has in its long range objectives to provide a generic supervisor, which can be easily distributed on a set of sensors. Petri nets are used to provide both a model and an algorithm in fault management domain. Key features of our approach are (1) we take advantage of the ability of Petri Nets to model concurrency in distributed systems, (2) we refuse using the marking graph in our algorithms in order to avoid state explosion and thus rely instead in the so-called partial order semantics of Petri nets, and (3) our algorithms use net unfolding techniques and extend them to the probabilistic case by providing a generalized Viterbi algorithm. This paper concentrates on application, motivations, and modelling.

[1]  François Baccelli,et al.  Structural, temporal and stochastic properties of unbounded free-choice Petri nets , 1994 .

[2]  M. Schwartz,et al.  On the Design of Observers for Fault Detection in Communication Networks , 1990 .

[3]  Mischa Schwartz,et al.  Fault identification using a finite state machine model with unreliable partially observed data sequences , 1993, IEEE Trans. Commun..

[4]  Albert Benveniste,et al.  A Petri net approach to fault detection and diagnosis in distributed systems. II. Extending Viterbi algorithm and HMM techniques to Petri nets , 1997, Proceedings of the 36th IEEE Conference on Decision and Control.

[5]  Claude Jard,et al.  Une approche pour des capteurs d'alarmes intelligents dans les réseaux , 1996 .

[6]  Marco Ajmone Marsan,et al.  Generalized Stochastic Petri Nets Revisitied: Random Switches and Priorities , 1987, PNPM.

[7]  Colin J. Fidge,et al.  Timestamps in Message-Passing Systems That Preserve the Partial Ordering , 1988 .

[8]  Salvatore J. Stolfo,et al.  A coding approach to event correlation , 1995, Integrated Network Management.

[9]  Yossi A. Nygate,et al.  Event correlation using rule and object based techniques , 1995, Integrated Network Management.

[10]  Walter Vogler,et al.  Modular Construction and Partial Order Semantics of Petri Nets , 1992, Lecture Notes in Computer Science.

[11]  Albert Benveniste,et al.  A Calculus of Stochastic Systems : Specification, Simulation, and Hidden State Estimation , 1994 .

[12]  René David,et al.  Petri nets for modeling of dynamic systems: A survey , 1994, Autom..

[13]  Mischa Schwartz,et al.  Simple finite-state fault detectors for communication networks , 1992, IEEE Trans. Commun..

[14]  G. Jakobson,et al.  Alarm correlation , 1993, IEEE Network.

[15]  Seraphin B. Calo,et al.  Centralized vs distributed fault localization , 1995, Integrated Network Management.

[16]  P. F. Adams,et al.  ISDN Explained: Worldwide Network and Applications Technology , 1990 .

[17]  Mark Weissman,et al.  Real-time telecommunication network management: extending event correlation with temporal constraints , 1995, Integrated Network Management.

[18]  Glynn Winskel,et al.  Petri Nets, Event Structures and Domains, Part I , 1981, Theor. Comput. Sci..

[19]  Claude Jard,et al.  Fault Detection in Telecommunication Networks Based on a Petri Net Representation of Alarm Propagation , 1997, ICATPN.

[20]  Yechiam Yemini,et al.  Managing Communication Networks by Monitoring Databases , 1991, IEEE Trans. Software Eng..

[21]  Marco Ajmone Marsan,et al.  Modelling with Generalized Stochastic Petri Nets , 1995, PERV.