Synchronization from insertions and deletions under a non-binary, non-uniform source

We study the problem of synchronizing two files X and Y at two distant nodes A and B that are connected through a two-way communication channel. We assume that file Y at node B is obtained from file X at node A by inserting and deleting a small fraction of symbols in X. More specifically, we consider the case where X is a non-binary non-uniform string, and deletions and insertions happen uniformly with rates β<sub>d</sub> and β<sub>i</sub>, respectively. We propose a synchronization protocol between node A and node B that needs to transmit O(q/H<sub>2</sub>(β<sub>d</sub>+β<sub>i</sub>)n log 1/β<sub>d</sub>+β<sub>i</sub>) bits (where n is the length of X, q is the alphabet size and H<sub>2</sub> is the collision entropy of X) and reconstructs X at node B with error probability exponentially low in n. This protocol readily generalizes the recent result by Tabatabaei Yazdi and Dolecek that dealt with synchronization from binary uniform source and under only deletion errors.

[1]  G. Crooks On Measures of Entropy and Information , 2015 .

[2]  Tetsunao Matsuta,et al.  国際会議開催報告:2013 IEEE International Symposium on Information Theory , 2013 .

[3]  L. Dolecek,et al.  Synchronization from Insertions and Deletions , 2013 .

[4]  Lara Dolecek,et al.  Synchronization from deletions through interactive communication , 2012, 2012 7th International Symposium on Turbo Codes and Iterative Information Processing (ISTC).

[5]  Kannan Ramchandran,et al.  Efficient file synchronization: A distributed source coding approach , 2011, 2011 IEEE International Symposium on Information Theory Proceedings.

[6]  Kannan Ramchandran,et al.  Interactive low-complexity codes for synchronization from deletions and insertions , 2010, 2010 48th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[7]  Alon Orlitsky,et al.  Interactive communication: balanced distributions, correlated files, and average-case complexity , 1991, [1991] Proceedings 32nd Annual Symposium of Foundations of Computer Science.

[8]  G. Tenengolts,et al.  Nonbinary codes, correcting single deletion or insertion , 1984, IEEE Trans. Inf. Theory.

[9]  Larry Carter,et al.  Universal Classes of Hash Functions , 1979, J. Comput. Syst. Sci..

[10]  W. Hoeffding Probability inequalities for sum of bounded random variables , 1963 .