A Survey of Logging Uses

Logging is ubiquitous in fault-tolerant systems and its use is growing in recent research. Logs are used for many purposes: keeping histories, security, concurrency control, failure recovery, update atomicity, fast physical writes, etc. This paper surveys the uses of logging and points out the commonalities and differences of clients'' needs. It aims to provide the user and the designer of a log system with a global vision of logging.

[1]  Mahadev Satyanarayanan,et al.  Disconnected Operation in the Coda File System , 1999, Mobidata.

[2]  Alfred Z. Spector,et al.  Distributed logging for transaction processing , 1987, SIGMOD '87.

[3]  Joseph A. Fisher,et al.  Predicting conditional branch directions from previous runs of a program , 1992, ASPLOS V.

[4]  Alfred Z. Spector,et al.  Distributed Transaction Processing and The Camelot System , 1987 .

[5]  Alan L. Cox,et al.  Lazy release consistency for software distributed shared memory , 1992, ISCA '92.

[6]  Raymond A. Lorie,et al.  Physical integrity in a large segmented database , 1977, TODS.

[7]  Robert E. Strom,et al.  Optimistic recovery in distributed systems , 1985, TOCS.

[8]  Kenneth P. Kane Log-Based Recovery in Asychronous Distributed Systems , 1989 .

[9]  Philip A. Bernstein,et al.  Timestamp-Based Algorithms for Concurrency Control in Distributed Database Systems , 1980, VLDB.

[10]  R. S. Finlayson A log file service exploiting write-once storage , 1990 .

[11]  Butler W. Lampson,et al.  Crash Recovery in a Distributed Data Storage System , 1981 .

[12]  Patrick Valduriez,et al.  Relational Databases and Knowledge Bases , 1989 .

[13]  E. Nett,et al.  A generic log-service supporting fast recovery in distributed fault-tolerant systems , 1993, Proceedings 1993 IEEE Workshop on Advances in Parallel and Distributed Systems.

[14]  Charles T. Davies,et al.  Recovery semantics for a DB/DC system , 1973, ACM Annual Conference.

[15]  John H. Hartman,et al.  The Zebra striped network file system , 1995, TOCS.

[16]  Thomas J. LeBlanc,et al.  Debugging Parallel Programs with Instant Replay , 1987, IEEE Transactions on Computers.

[17]  Mahadev Satyanarayanan,et al.  The ITC distributed file system: principles and design , 1985, SOSP 1985.

[18]  Laura M. Haas,et al.  Computation and communication in R*: a distributed database manager , 1984, TOCS.

[19]  Kenneth P. Birman,et al.  Reliable communication in an unreliable environment , 1987 .

[20]  Andreas Reuter,et al.  Transaction Processing: Concepts and Techniques , 1992 .

[21]  Barbara Liskov,et al.  Reliable object storage to support atomic actions , 1983, SOSP 1985.

[22]  Edward T. Smith Debugging Tools for Message-Based, Communicating Processes , 1984, ICDCS.

[23]  Larry D. Wittie,et al.  BUGNET: A Debugging system for parallel programming environments , 1982, ICDCS.

[24]  Paul F. King,et al.  Database sharing: an efficient mechanism for supporting concurrent processes , 1973, AFIPS National Computer Conference.

[25]  David F. Bacon,et al.  Volatile logging in n-fault-tolerant distributed systems , 1988, [1988] The Eighteenth International Symposium on Fault-Tolerant Computing. Digest of Papers.

[26]  David P. Reed,et al.  Naming and synchronization in a decentralized computer system , 1978 .

[27]  David L. Presotto,et al.  Publishing: a reliable broadcast communication mechanism , 1983, SOSP '83.

[28]  Barbara Liskov,et al.  Implementation of Argus , 1987, SOSP '87.

[29]  Richard C. Holt,et al.  Some deadlock properties of computer systems , 1971, SOSP '71.

[30]  Roger L. Haskin,et al.  Recovery management in QuickSilver , 1988, TOCS.

[31]  Richard J. LeBlanc,et al.  Event-Driven Monitoring of Distributed Programs , 1985, ICDCS.

[32]  Lawrence A. Bjork Recovery scenario for a DB/DC system , 1973, ACM Annual Conference.

[33]  David B. Johnson,et al.  Recovery in Distributed Systems Using Optimistic Message Logging and Checkpointing , 1988, J. Algorithms.

[34]  Willy Zwaenepoel,et al.  Manetho: Transparent Rollback-Recovery with Low Overhead, Limited Rollback, and Fast Output Commit , 1992, IEEE Trans. Computers.

[35]  David K. Gifford,et al.  Concurrent compacting garbage collection of a persistent heap , 1993, SOSP '93.

[36]  David J. DeWitt,et al.  Crash recovery in client-server EXODUS , 1992, SIGMOD '92.

[37]  Garret Swart,et al.  Granularity and semantic level of replication in the Echo distributed file system , 1990, [1990] Proceedings. Workshop on the Management of Replicated Data.

[38]  Alfred Z. Spector,et al.  Distributed transactions for reliable systems , 1985, SOSP '85.

[39]  Dean Daniels,et al.  Shared logging services for fault-tolerant distributed computing , 1990, EW 4.

[40]  Mendel Rosenblum,et al.  The design and implementation of a log-structured file system , 1991, SOSP '91.

[41]  Barbara Liskov,et al.  Reliable object storage to support atomic actions , 1983, SOSP '85.

[42]  Michel Ruun K I T L O G : a Generic Logging Service , 1992 .

[43]  Irving L. Traiger,et al.  The Recovery Manager of the System R Database Manager , 1981, CSUR.

[44]  Andreas Reuter,et al.  Principles of transaction-oriented database recovery , 1983, CSUR.

[45]  Kenneth P. Birman,et al.  Replication and fault-tolerance in the ISIS system , 1985, SOSP '85.

[46]  Mahadev Satyanarayanan,et al.  The ITC distributed file system: principles and design , 1985, SOSP '85.

[47]  David B. Johnson,et al.  Sender-Based Message Logging , 1987 .

[48]  Ii Richard George Guy,et al.  FICUS: a very large scale reliable distributed file system , 1992 .

[49]  Joost Verhofstad,et al.  Recovery Techniques for Database Systems , 1978, CSUR.

[50]  Hamid Pirahesh,et al.  ARIES: a transaction recovery method supporting fine-granularity locking and partial rollbacks using write-ahead logging , 1998 .

[51]  Peter Bosch,et al.  Huygens File Service and Storage Architecture , 1993 .

[52]  R. Shapiro,et al.  Reliability and fault recovery in distributed processing , 1977 .

[53]  Irving L. Traiger,et al.  The notions of consistency and predicate locks in a database system , 1976, CACM.

[54]  Robert B. Hagmann,et al.  Reimplementing the Cedar file system using logging and group commit , 1987, SOSP '87.

[55]  Santosh K. Shrivastava,et al.  The Treatment of Persistent Objects in Arjuna , 1989, Comput. J..

[56]  Stephen Fox,et al.  The implementation of an integrated concurrency control and recovery scheme , 1982, SIGMOD '82.

[57]  Philip A. Bernstein,et al.  Concurrency Control in Distributed Database Systems , 1986, CSUR.

[58]  Mahadev Satyanarayanan,et al.  Log-based directory resolution in the Coda file system , 1993, [1993] Proceedings of the Second International Conference on Parallel and Distributed Information Systems.

[59]  Michael Williams,et al.  Replication in the harp file system , 1991, SOSP '91.

[60]  Hector Garcia-Molina,et al.  Consistency in a partitioned network: a survey , 1985, CSUR.

[61]  Mahadev Satyanarayanan,et al.  Integrating security in a large distributed system , 1989, TOCS.

[62]  Mahadev Satyanarayanan,et al.  Lightweight recoverable virtual memory , 1993, SOSP '93.

[63]  Sailesh Chutani,et al.  DEcorum File System Architectural Overview , 1990, USENIX Summer.

[64]  Mahadev Satyanarayanan,et al.  Transparent logging as a technique for debugging complex distributed systems , 1992, EW 5.

[65]  Joseph Pasquale,et al.  A high performance multi-structured file system design , 1991, SOSP '91.

[66]  Daniel J. Rosenkrantz,et al.  System level concurrency control for distributed database systems , 1978, TODS.

[67]  Fred Douglis,et al.  Log-structured file systems , 1989, Digest of Papers. COMPCON Spring 89. Thirty-Fourth IEEE Computer Society International Conference: Intellectual Leverage.

[68]  Mahadev Satyanarayanan,et al.  Andrew: a distributed personal computing environment , 1986, CACM.

[69]  Anita Borg,et al.  A message system supporting fault tolerance , 1983, SOSP '83.

[70]  John H. Hartman,et al.  The Zebra striped network file system , 1993, SOSP '93.

[71]  Michel Ruffin Inria An Architecture for Fault Tolerant Storage Systems , 1993 .

[72]  Anupam Bhide,et al.  Implicit replication in a network file server , 1990, [1990] Proceedings. Workshop on the Management of Replicated Data.

[73]  Randy H. Katz,et al.  Introduction to redundant arrays of inexpensive disks (RAID) , 1989, Digest of Papers. COMPCON Spring 89. Thirty-Fourth IEEE Computer Society International Conference: Intellectual Leverage.

[74]  Mahadev Satyanarayanan,et al.  On the ubiquity of logging in distributed file systems , 1992, [1992] Proceedings Third Workshop on Workstation Operating Systems.

[75]  Gerhard Weikum,et al.  A Performance Evaluation of Multi-Level Transaction Management , 1991, VLDB.