EVEREST: an event recognition testbed

Due to their fundamental nature, analyzing the behavior of distributed computations is a complex task. One approach is to monitor the system activity for occurrences of user defined events. As the computation executes, the monitoring system automatically collects and evaluates information pertaining to the defined events, and recognizes event occurrences. The need exists to study various approaches to event recognition in a context in which the strategies can be created and analyzed. This paper presents an EVEnt REcognition teSTbed (EVEREST) specifically designed to address these needs. Commands are provided by which the user can define the behavior to be recognized the configuration of the monitoring system and the strategies to be used in recognizing the event occurrences. Utilizing these directives, EVEREST recognizes occurrences of the defined events during the distributed program's execution. By providing flexible structuring and dynamic configuration of the monitoring system, multiple time view protocols and the separation of event and monitoring specification, EVEREST provides the capability of developing, testing and comparing various approaches to distributed event recognition.

[1]  Dan C. Marinescu,et al.  Specification and identification of events for debugging and performance monitoring of distributed multiprocessor systems , 1990, Proceedings.,10th International Conference on Distributed Computing Systems.

[2]  Leslie Lamport,et al.  Time, clocks, and the ordering of events in a distributed system , 1978, CACM.

[3]  Richard J. LeBlanc,et al.  Event-Driven Monitoring of Distributed Programs , 1985, ICDCS.

[4]  Karsten Schwan,et al.  Application-Dependent Dynamic Monitoring of Distributed and Parallel Systems , 1993, IEEE Trans. Parallel Distributed Syst..

[5]  LamportLeslie Time, clocks, and the ordering of events in a distributed system , 1978 .

[6]  Thomas Kunz Process clustering for distributed debugging , 1993, PADD '93.

[7]  Dieter Haban DTM - A Method for Testing Distributed Systems , 1987, SRDS.

[8]  Roger King,et al.  IDD: An Interactive Distributed Debugger , 1985, ICDCS.

[9]  Madalene Spezialetti An approach to reducing delays in recognizing distributed event occurrences , 1991, PADD '91.

[10]  Colin J. Fidge,et al.  Timestamps in Message-Passing Systems That Preserve the Partial Ordering , 1988 .

[11]  Jack C. Wileden,et al.  High-level debugging of distributed systems: The behavioral abstraction approach , 1983, J. Syst. Softw..

[12]  Kuo-Chung Tai Kuo-Chung Tai Definitions and Detection of Deadlock, Livelock, and Starvation in Concurrent Programs , 1994, 1994 Internatonal Conference on Parallel Processing Vol. 2.

[13]  M. Spezialetti,et al.  A general approach to recognizing event occurrences in distributed computations , 1988, [1988] Proceedings. The 8th International Conference on Distributed.

[14]  Kenneth P. Birman,et al.  Tools for distributed application management , 1991, Computer.