Monitoring distributed systems

It is suggested that a functional model of monitoring in terms of the generation, processing, dissemination, and presentation of information can help determine the facilities needed to design and construct distributed systems. Implementation issues are also discussed, with attention given to the intrusiveness of monitoring systems and object-based implementation. It is concluded that generic monitoring services are important tools for managing distributed systems and for debugging during system development. Monitoring services may also be needed as part of the application itself, such as in process control and factory automation.<<ETX>>

[1]  Thomas Bemmerl,et al.  The Distributed Monitor System of TOPSYS , 1990, CONPAR.

[2]  Leslie Lamport,et al.  Distributed snapshots: determining global states of distributed systems , 1985, TOCS.

[3]  Colin J. Fidge,et al.  Partial orders for parallel debugging , 1988, PADD '88.

[4]  Jeff Kramer,et al.  Automated support for distributed software design , 1995, Proceedings Seventh International Workshop on Computer-Aided Software Engineering.

[5]  Dieter Haban,et al.  A Hybrid Monitor for Behavior and Performance Analysis of Distributed Systems , 1990, IEEE Trans. Software Eng..

[6]  Morris Sloman,et al.  Constructing Distributed Systems in Conic , 1989, IEEE Trans. Software Eng..

[7]  Janice M. Stone A graphical representation of concurrent processes , 1988, PADD '88.

[8]  Dan C. Marinescu,et al.  Specification and identification of events for debugging and performance monitoring of distributed multiprocessor systems , 1990, Proceedings.,10th International Conference on Distributed Computing Systems.

[9]  Bernd Mohr Performance Evaluation of Parallel Programs in Parallel and Distributed Systems , 1990, CONPAR.

[10]  Narain H. Gehani,et al.  Composite Event Specification in Active Databases: Model & Implementation , 1992, VLDB.

[11]  Bernd Mohr,et al.  SIMPLE: A Performance Evaluation Tool Environment for Parallel and Distributed Systems , 1991, EDMCC.

[12]  LamportLeslie Time, clocks, and the ordering of events in a distributed system , 1978 .

[13]  Carl Manning,et al.  Traveler: The Apiary Observatory , 1987, ECOOP.

[14]  Jean Bacon,et al.  Using events to build distributed applications , 1995, Second International Workshop on Services in Distributed and Networked Environments.

[15]  Jeffrey J. P. Tsai,et al.  A noninvasive architecture to monitor real-time distributed systems , 1990, Computer.

[16]  Richard J. LeBlanc,et al.  Event-Driven Monitoring of Distributed Programs , 1985, ICDCS.

[17]  Mark Weissman,et al.  Real-time telecommunication network management: extending event correlation with temporal constraints , 1995, Integrated Network Management.

[18]  Jeff Magee,et al.  Graphical configuration programming , 1989, Computer.

[19]  Yigal Hoffner The Management of Monitoring in Object-Based Distributed Systems , 1993, Integrated Network Management.

[20]  Peter C. Bates Debugging Heterogeneous Distributed Systems Using Event-Based Models of Behavior , 1995, ACM Trans. Comput. Syst..

[21]  Dieter Haban,et al.  Monitoring and measuring distributed systems , 1990 .

[22]  Nancy Martin,et al.  Programming Expert Systems in OPS5 - An Introduction to Rule-Based Programming(1) , 1985, Int. CMG Conference.

[23]  Partha Dasgupta,et al.  A probe-based monitoring scheme for an object-oriented distributed operating system , 1986, OOPLSA '86.

[24]  Klaus R. Dittrich,et al.  Detecting composite events in active database systems using Petri nets , 1994, Proceedings of IEEE International Workshop on Research Issues in Data Engineering: Active Databases Systems.

[25]  Yossi A. Nygate,et al.  Event correlation using rule and object based techniques , 1995, Integrated Network Management.

[26]  Andrzej Duda,et al.  Estimating Global Time in Distributed Systems , 1987, ICDCS.

[27]  Dan C. Marinescu,et al.  Models for Monitoring and Debugging Tools for Parallel and Distributed Software , 1990, J. Parallel Distributed Comput..

[28]  Kenneth P. Birman,et al.  Tools for distributed application management , 1991, Computer.

[29]  David Notkin,et al.  Voyeur: graphical views of parallel programs , 1988, PADD '88.

[30]  Umeshwar Dayal,et al.  Active Database Management Systems , 1988, JCDKB.

[31]  C. V. Ramamoorthy,et al.  Monitoring and control of distributed systems , 1990, Systems Integration '90. Proceedings of the First International Conference on Systems Integration.

[32]  Richard T. Snodgrass,et al.  A relational approach to monitoring complex systems , 1988, TOCS.

[33]  Madalene Spezialetti,et al.  EVEREST: an event recognition testbed , 1995, Proceedings of 15th International Conference on Distributed Computing Systems.

[34]  Alwyn Langsford,et al.  Distributed Systems Management , 1993 .

[35]  David Holden,et al.  MANDIS: Management of Distributed Systems , 1989, Process in Distributed Operating Systems and Distributed Systems Management.

[36]  Yechiam Yemini,et al.  Managing Communication Networks by Monitoring Databases , 1991, IEEE Trans. Software Eng..

[37]  Peter C. Bates,et al.  Distributed debugging tools for heterogeneous distributed systems , 1988, [1988] Proceedings. The 8th International Conference on Distributed.

[38]  Salvatore J. Stolfo,et al.  A coding approach to event correlation , 1995, Integrated Network Management.

[39]  Franz Sötz,et al.  Tools for a Model-driven Instrumentation for Monitoring , 1991 .

[40]  Konrad Slind,et al.  Monitoring distributed systems , 1987, TOCS.

[41]  Charles E. McDowell,et al.  Debugging concurrent programs , 1989, ACM Comput. Surv..

[42]  Thomas Bemmerl,et al.  An Integrated Environment for Programming Distributed Memory Multiprocessors , 1991, EDMCC.

[43]  Roger King,et al.  IDD: An Interactive Distributed Debugger , 1985, ICDCS.