A New Trace And Replay System For Shared Memory Programs Based On Lamport Clocks

One of the key pwblema iin debugging parallel pwgrama is that the behavior of a parallel program in response to a jbed input may be indeterminate. As a consequence of this non-repeatability, cyclic monitoring techniques for error isollation are not guaranteed to work. To tackle this problem, diferent trace and replay mechanwma have been proposed to support the debugging taak. In this paper we introduce a novel trace and replag mechanism for shared-memoty programa, baaed on Lamport cdocka. We compare the new scheme with Leblanc’s ‘Instant Replay’ and Netzer ’s ‘optimal tracing’. Our approach produces (much) smaller traces than both ezirting approaches. Moreover, in comparison with Ntstzer’s approach, our approach induces much less ovlerhead during the recording phase.

[1]  Seth Copen Goldstein,et al.  Hardware-assisted replay of multiprocessor programs , 1991, PADD '91.

[2]  Luk Levrouw,et al.  An efficient record-replay mechanism for shared memory programs , 1993, 1993 Euromicro Workshop on Parallel and Distributed Processing.

[3]  Leslie Lamport,et al.  Time, clocks, and the ordering of events in a distributed system , 1978, CACM.

[4]  André Schiper,et al.  Efficient Execution Replay Technique for Distributed Memory Architectures , 1991, EDMCC.

[5]  Edith Schonberg,et al.  An empirical comparison of monitoring algorithms for access anomaly detection , 2011, PPOPP '90.

[6]  E. H. Debaere,et al.  A concurrent high-level language based real-time controller , 1988 .

[7]  Mark A. Linton,et al.  Supporting reverse execution for parallel programs , 1988, PADD '88.

[8]  Barton P. Miller,et al.  Optimal tracing and replay for debugging message-passing parallel programs , 1992, Proceedings Supercomputing '92.

[9]  Richard H. Carver,et al.  Reproducible Testing of Concurrent Programs Based on Shared Variables , 1986, ICDCS.

[10]  André Schiper,et al.  Execution replay on distributed memory architectures , 1990, Proceedings of the Second IEEE Symposium on Parallel and Distributed Processing 1990.

[11]  Robert H. B. Netzer Optimal tracing and replay for debugging shared-memory parallel programs , 1993, PADD '93.

[12]  Thomas J. LeBlanc,et al.  Debugging Parallel Programs with Instant Replay , 1987, IEEE Transactions on Computers.

[13]  David Lorge Parnas,et al.  Concurrent control with “readers” and “writers” , 1971, CACM.