Remote File Synchronization Single-Round Algorithms

Remote file synchronization has been studied extensively over the last decade, and the existing approaches can be divided into single-round and multi-round protocols. Single-round protocols are preferable in scenarios involving small files and large network latencies (e.g., web access over slow links) due protocol complexity and computing and I/O overheads. The best-known algorithms which are used for synchronization of file systems across machines are rsync, set reconciliation, Remote Differential Compression & RSYNC based on erasure codes. In this paper we will discuss the remote file synchronization protocols and compare the performance of all these protocols on different data sets. Index Terms — Remote files synchronization (RSYNC), Remote Differential Compression (RDC), Set Reconciliation (Recon), GCC, HTML, EMACS.

[1]  Alexandre V. Evfimievski A probabilistic algorithm for updating files over a communication link , 1998, SODA '98.

[2]  Paul Mackerras,et al.  The rsync algorithm , 1996 .

[3]  Hector Garcia-Molina,et al.  Exploiting symmetries for low-cost comparison of file copies , 1988, [1988] Proceedings. The 8th International Conference on Distributed.

[4]  Torsten Suel,et al.  Algorithms for Low-Latency Remote File Synchronization , 2008, IEEE INFOCOM 2008 - The 27th Conference on Computer Communications.

[5]  Sachin Agarwal,et al.  Bandwidth Efficient String Reconciliation Using Puzzles , 2006, IEEE Transactions on Parallel and Distributed Systems.

[6]  Graham Cormode,et al.  Sequence distance embeddings , 2003 .

[7]  Ian H. Witten,et al.  Managing Gigabytes: Compressing and Indexing Documents and Images , 1999 .

[8]  Nikolaj Bjørner,et al.  Optimizing File Replication over Limited-Bandwidth Networks using Remote Differential Compression , 2006 .

[9]  Robert W. Bowdidge,et al.  Low cost comparisons of file copies , 1990, Proceedings.,10th International Conference on Distributed Computing Systems.

[10]  Torsten Suel,et al.  Improved single-round protocols for remote file synchronization , 2005, Proceedings IEEE 24th Annual Joint Conference of the IEEE Computer and Communications Societies..

[11]  Alon Orlitsky,et al.  Practical protocols for interactive communication , 2001, Proceedings. 2001 IEEE International Symposium on Information Theory (IEEE Cat. No.01CH37252).

[12]  Andrew Tridgell,et al.  Efficient Algorithms for Sorting and Synchronization , 1999 .

[13]  Torsten Suel,et al.  Improved file synchronization techniques for maintaining large replicated collections over slow networks , 2004, Proceedings. 20th International Conference on Data Engineering.