Monitoring, Testing and Debugging of Distributed Real-Time Systems

Abstract Testing is an important part of any software development project, and can typically surpass more than half of the development cost. For safety-critical computer based systems, testing is ...

[1]  John A. McDermid,et al.  Formal Methods: Use and Relevance for the Development of Safety-Critical Systems , 1992, Comput. J..

[2]  D. Stott Parker,et al.  Saving traces for Ada debugging , 1985, SIGAda '85.

[3]  Chinya V. Ravishankar,et al.  Monitoring and debugging distributed realtime programs , 1992, Softw. Pract. Exp..

[4]  Matthew Reilly Instrumentation for application performance tuning: the M31 system , 1989 .

[5]  Wang Yi,et al.  Uppaal in a nutshell , 1997, International Journal on Software Tools for Technology Transfer.

[6]  Nancy G. Leveson,et al.  An investigation of the Therac-25 accidents , 1993, Computer.

[7]  Ralph Johnson,et al.  design patterns elements of reusable object oriented software , 2019 .

[8]  Hans A. Hansson,et al.  Testing distributed real-time systems , 2001, Microprocess. Microsystems.

[9]  Rainer Schmidt,et al.  Components as context-independent units of software , 1996 .

[10]  Thomas J. LeBlanc,et al.  Debugging Parallel Programs with Instant Replay , 1987, IEEE Transactions on Computers.

[11]  Gerhard Fohler,et al.  Handling interrupts with static scheduling in an automotive vehicle control system , 1998, Proceedings Fifth International Conference on Real-Time Computing Systems and Applications (Cat. No.98EX236).

[12]  K. H. Kim,et al.  Temporal uncertainties in interactions among real-time objects , 1990, Proceedings Ninth Symposium on Reliable Distributed Systems.

[13]  David Powell,et al.  Failure mode assumptions and assumption coverage , 1992, [1992] Digest of Papers. FTCS-22: The Twenty-Second International Symposium on Fault-Tolerant Computing.

[14]  Peter Neumann,et al.  Safeware: System Safety and Computers , 1995, SOEN.

[15]  Robyn R. Lutz,et al.  Analyzing software requirements errors in safety-critical, embedded systems , 1993, [1993] Proceedings of the IEEE International Symposium on Requirements Engineering.

[16]  Jason Gait,et al.  A probe effect in concurrent programs , 1986, Softw. Pract. Exp..

[17]  Alan Mink,et al.  Multiprocessor performance-measurement instrumentation , 1990, Computer.

[18]  G. B. Finelli,et al.  The Infeasibility of Quantifying the Reliability of Life-Critical Real-Time Software , 1993, IEEE Trans. Software Eng..

[19]  Frederick P. Brooks,et al.  No Silver Bullet: Essence and Accidents of Software Engineering , 1987 .

[20]  Bernhard Plattner Real-Time Execution Monitoring , 1984, IEEE Transactions on Software Engineering.

[21]  中島 震,et al.  Book Review—Objects, Components, and Frameworks with UML:The Catalysis Approach , 2000 .

[22]  Chung Laung Liu,et al.  Scheduling Algorithms for Multiprogramming in a Hard-Real-Time Environment , 1989, JACM.

[23]  Jukka Mäki-Turja,et al.  Towards Efficient Analysis of Interrupts in Real-Time Systems , 1999 .

[24]  Hans A. Hansson,et al.  Towards systematic testing of distributed real-time systems , 1999, Proceedings 20th IEEE Real-Time Systems Symposium (Cat. No.99CB37054).

[25]  R.-D. Yang,et al.  Path analysis testing of concurrent programs , 1992, Inf. Softw. Technol..

[26]  Kevin P. McAuliffe,et al.  RP3 performance monitoring hardware , 1989 .

[27]  Nancy G. Leveson,et al.  An experimental evaluation of the assumption of independence in multiversion programming , 1986, IEEE Transactions on Software Engineering.

[28]  Glenford J. Myers,et al.  Art of Software Testing , 1979 .

[29]  Nancy G Leveson,et al.  Software safety: why, what, and how , 1986, CSUR.

[30]  Allen D. Malony,et al.  Performance Measurement Intrusion and Perturbation Analysis , 1992, IEEE Trans. Parallel Distributed Syst..

[31]  Jeffrey J. P. Tsai,et al.  A Noninterference Monitoring and Replay Mechanism for Real-Time Software Testing and Debugging , 1990, IEEE Trans. Software Eng..

[32]  Rajeev Alur,et al.  A Theory of Timed Automata , 1994, Theor. Comput. Sci..

[33]  Richard H. Carver,et al.  Debugging Concurrent Ada Programs by Deterministic Execution , 1991, IEEE Trans. Software Eng..

[34]  Z. Jelinski,et al.  Software reliability Research , 1972, Statistical Computer Performance Evaluation.

[35]  Michael Friedman,et al.  Software Assessment: Reliability, Safety, Testability , 1995 .

[36]  Jeffrey M. Voas,et al.  Designing programs that are less likely to hide faults , 1993, J. Syst. Softw..

[37]  Adrian F. Ellis Achieving Safety in Complex Control Systems , 1995 .

[38]  Hermann Kopetz,et al.  Sparse time versus dense time in distributed real-time systems , 1992, [1992] Proceedings of the 12th International Conference on Distributed Computing Systems.

[39]  Alan Burns,et al.  Asynchronous data sharing in multiprocessor real-time systems using process consensus , 1998, Proceeding. 10th EUROMICRO Workshop on Real-Time Systems (Cat. No.98EX168).

[40]  Leslie Hatton,et al.  Unexpected (and sometimes unpleasant) lessons from data in real software systems , 1997 .

[41]  Hermann Kopetz,et al.  Distributed fault-tolerant real-time systems: the Mars approach , 1989, IEEE Micro.

[42]  Konrad Slind,et al.  Monitoring distributed systems , 1987, TOCS.

[43]  William C. Hetzel,et al.  The complete guide to software testing , 1984 .

[44]  Gregg Rothermel,et al.  Analyzing Regression Test Selection Techniques , 1996, IEEE Trans. Software Eng..

[45]  John Rushby,et al.  Formal Methods and their Role in the Certification of Critical Systems , 1997 .

[46]  Ian M. Holland,et al.  Specifying Reusable Components Using Contracts , 1992, ECOOP.

[47]  Joseph Sifakis,et al.  Compositional Specification of Timed Systems (Extended Abstract) , 1996, STACS.

[48]  An-Chi Liu,et al.  Hardware monitoring of a multiprocessor system , 1989, IEEE Micro.

[49]  Henrik Thane Safe and Reliable Computer Control Systems an Overview , 1997, SAFECOMP.

[50]  Christopher Alexander,et al.  The Timeless Way of Building , 1979 .

[51]  Colin J. Fidge Fundamentals of Distributed System Observation , 1996, IEEE Softw..

[52]  John Rushby,et al.  Formal Specification and Verification for Critical Systems: Tools, Achievements, and Prospects , 1991 .

[53]  Leslie Lamport,et al.  Time, clocks, and the ordering of events in a distributed system , 1978, CACM.

[54]  Gérard Le Lann,et al.  An analysis of the Ariane 5 flight 501 failure-a system engineering perspective , 1997, ECBS.

[55]  Ronald L. Graham,et al.  Bounds on Multiprocessing Timing Anomalies , 1969, SIAM Journal of Applied Mathematics.

[56]  Marc Donner,et al.  Run-time monitoring of real-time systems , 1991, [1991] Proceedings Twelfth Real-Time Systems Symposium.

[57]  Bev Littlewood,et al.  Stochastic Reliability-Growth: A Model for Fault-Removal in Computer-Programs and Hardware-Designs , 1981, IEEE Transactions on Reliability.

[58]  Leslie Lamport,et al.  Distributed snapshots: determining global states of distributed systems , 1985, TOCS.

[59]  Bev Littlewood,et al.  Evaluation of competing software reliability predictions , 1986, IEEE Transactions on Software Engineering.

[60]  Henrik Thane,et al.  Safe and Reliable Computer Control Systems Concepts and Methods , 1999 .

[61]  Douglas R. Miller,et al.  On the use and the performance of software reliability growth models , 1991 .

[62]  Alan Burns,et al.  Hard Real-Time Scheduling: The Deadline-Monotonic Approach , 1991 .

[63]  Kyo Chul Kang,et al.  Form, the missing piece in effective real-time system specification and simulation , 1998, Proceedings. Fourth IEEE Real-Time Technology and Applications Symposium (Cat. No.98TB100245).

[64]  Robert L. Glass Real-time: the “Lost World” of software debugging and testing , 1980, CACM.

[65]  Henrik Thane,et al.  A communication protocol for hard and soft real-time systems , 1996, Proceedings of the Eighth Euromicro Workshop on Real-Time Systems.

[66]  Hermann Kopetz,et al.  The non-blocking write protocol NBW: A solution to a real-time synchronization problem , 1993, 1993 Proceedings Real-Time Systems Symposium.

[67]  Kang G. Shin,et al.  HARTS: a distributed real-time architecture , 1991, Computer.

[68]  Michael M. Gorlick,et al.  The flight recorder: an architectural aid for system monitoring , 1991, PADD '91.

[69]  Jean Paul Calvez,et al.  Performance Monitoring and Assessment of Embedded HW/SW Systems , 1998, Des. Autom. Embed. Syst..

[70]  Edsger W. Dijkstra,et al.  Structured programming , 1972, A.P.I.C. Studies in data processing.

[71]  Henrik Thane Safety and reliability of software in embedded control systems , 1997 .

[72]  Bev Littlewood,et al.  A Bayesian Reliability Growth Model for Computer Software , 1973 .

[73]  Dieter Haban,et al.  A Hybrid Monitor for Behavior and Performance Analysis of Distributed Systems , 1990, IEEE Trans. Software Eng..

[74]  Mary Shaw,et al.  A field guide to boxology: preliminary classification of architectural styles for software systems , 1997, Proceedings Twenty-First Annual International Computer Software and Applications Conference (COMPSAC'97).

[75]  Hermann Kopetz,et al.  Clock Synchronization in Distributed Real-Time Systems , 1987, IEEE Transactions on Computers.

[76]  Hans A. Hansson,et al.  Using deterministic replay for debugging of distributed real-time systems , 2000, Proceedings 12th Euromicro Conference on Real-Time Systems. Euromicro RTS 2000.

[77]  Clemens Szyperski,et al.  Component software , 1997 .

[78]  Robert H. B. Netzer,et al.  Replaying distributed programs without message logging , 1997, Proceedings. The Sixth IEEE International Symposium on High Performance Distributed Computing (Cat. No.97TB100183).

[79]  Gwan-Hwan Hwang,et al.  Reachability testing: an approach to testing concurrent software , 1994, Proceedings of 1st Asia-Pacific Software Engineering Conference.

[80]  Günter Grünsteidl,et al.  TTP - A Protocol for Fault-Tolerant Real-Time Systems , 1994, Computer.

[81]  Lanfranco Lopriore,et al.  A programmable debugging aid for real-time software development , 1995 .

[82]  Hermann Kopetz,et al.  Event-Triggered Versus Time-Triggered Real-Time Systems , 1991, Operating Systems of the 90s and Beyond.

[83]  Henrik Thane,et al.  DESIGN FOR DETERMINISTIC MONITORING OF DISTRIBUTED REAL-TIME SYSTEMS , 2000 .

[84]  Rich Hilliard,et al.  Toward a recommended practice for architectural description , 1996, Proceedings of ICECCS '96: 2nd IEEE International Conference on Engineering of Complex Computer Systems (held jointly with 6th CSESAW and 4th IEEE RTAW).

[85]  Nancy G. Leveson,et al.  An Empirical Comparison of Software Fault Tolerance and Fault Elimination , 1991, IEEE Trans. Software Eng..

[86]  Jeffrey M. Voas,et al.  PIE: A Dynamic Failure-Based Technique , 1992, IEEE Trans. Software Eng..

[87]  Bev Littlewood,et al.  Validation of ultrahigh dependability for software-based systems , 1993, CACM.

[88]  Ragunathan Rajkumar,et al.  Monitoring timing constraints in distributed real-time systems , 1992, [1992] Proceedings Real-Time Systems Symposium.

[89]  Barton P. Miller,et al.  A distributed programs monitor for berkeley UNIX , 1985, Softw. Pract. Exp..

[90]  Mathai Joseph,et al.  Finding Response Times in a Real-Time System , 1986, Comput. J..

[91]  John F. Passafiume,et al.  Software testing and evaluation , 1987 .

[92]  Jukka Mäki-Turja,et al.  An Overview of RealTimeTalk, a Design Framework for Real-Time Systems , 1996, J. Parallel Distributed Comput..

[93]  Neeraj Suri,et al.  Advances in ULTRA-Dependable Distributed Systems , 1994 .

[94]  Peter G. Neumann,et al.  Computer-related risks , 1994 .

[95]  Clemens A. Szyperski,et al.  Component software - beyond object-oriented programming , 2002 .

[96]  Hans A. Hansson,et al.  Handling interrupts in testing of distributed real-time systems , 1999, Proceedings Sixth International Conference on Real-Time Computing Systems and Applications. RTCSA'99 (Cat. No.PR00306).

[97]  David Lorge Parnas,et al.  Scheduling Processes with Release Times, Deadlines, Precedence, and Exclusion Relations , 1990, IEEE Trans. Software Eng..

[98]  Jeffrey J. P. Tsai,et al.  Distributed Real-Time Systems: Monitoring, Visualization, Debugging, and Analysis , 1996 .

[99]  Charles E. McDowell,et al.  Debugging concurrent programs , 1989, ACM Comput. Surv..

[100]  Richard G. Hamlet,et al.  Probable Correctness Theory , 1987, Inf. Process. Lett..

[101]  James W. Layland,et al.  Scheduling Algorithms for Multiprogramming in a Hard-Real-Time Environment , 1989, JACM.

[102]  E Lloyd,et al.  Systematic safety : safety assessment of aircraft systems , 1982 .

[103]  David Lorge Parnas,et al.  Evaluation of safety-critical software , 1990, CACM.

[104]  Ian M. Holland,et al.  Contracts: specifying behavioral compositions in object-oriented systems , 1990, OOPSLA/ECOOP '90.

[105]  Thomas J. LeBlanc,et al.  A software instruction counter , 1989, ASPLOS III.

[106]  John A. McDermid,et al.  Software fault trees and weakest preconditions: a comparison and analysis , 1993, Softw. Eng. J..