On a Recurrence Arising in Graph Compression

In a recently proposed graphical compression algorithm by Choi and Szpankowski (2012), the following tree arose in the course of the analysis. The root contains $n$ balls that are consequently distributed between two subtrees according to a simple rule: In each step, all balls independently move down to the left subtree (say with probability $p$) or the right subtree (with probability $1-p$). A new node is created as long as there is at least one ball in that node. Furthermore, a nonnegative integer $d$ is given, and at level $d$ or greater one ball is removed from the leftmost node before the balls move down to the next level. These steps are repeated until all balls are removed (i.e., after $n+d$ steps). Observe that when $d=\infty$ the above tree can be modeled as a trie that stores $n$ independent sequences generated by a binary memoryless source with parameter $p$. Therefore, we coin the name $(n,d)$-tries for the tree just described, and to which we often refer simply as $d$-tries. We study here in detail the path length, and show how much the path length of such a $d$-trie differs from that of regular tries. We use methods of analytic algorithmics, from Mellin transforms to analytic poissonization.

[1]  B. Pittel Asymptotical Growth of a Class of Random Trees , 1985 .

[2]  Wojciech Szpankowski A Characterization of Digital Search Trees from the Successful Search Viewpoint , 1991, Theor. Comput. Sci..

[3]  Philippe Jacquet,et al.  Entropy Computations via Analytic Depoissonization , 1999, IEEE Trans. Inf. Theory.

[4]  Philippe Flajolet,et al.  Mellin Transforms and Asymptotics: Harmonic Sums , 1995, Theor. Comput. Sci..

[5]  B. Pittel Paths in a random digital tree: limiting distributions , 1986, Advances in Applied Probability.

[6]  W. Szpankowski Average Case Analysis of Algorithms on Sequences , 2001 .

[7]  Wojciech Szpankowski,et al.  Profile of Tries , 2008, LATIN.

[8]  Svante Janson,et al.  Analysis of an Asymmetric Leader Election Algorithm , 1997, Electron. J. Comb..

[9]  T. F. Móri On random trees , 2002 .

[10]  Philippe Flajolet,et al.  Analytic Combinatorics , 2009 .

[11]  Donald E. Knuth,et al.  The art of computer programming: sorting and searching (volume 3) , 1973 .

[12]  Hosam M. Mahmoud,et al.  Evolution of random search trees , 1991, Wiley-Interscience series in discrete mathematics and optimization.

[13]  Guy Louchard Exact and Asymptotic Distributions in Digital and Binary Search Trees , 1987, RAIRO Theor. Informatics Appl..

[14]  Wojciech Szpankowski,et al.  Compression of graphical structures , 2009, 2009 IEEE International Symposium on Information Theory.

[15]  Wojciech Szpankowski,et al.  Height in a digital search tree and the longest phrase of the Lempel-Ziv scheme , 2000, SODA '00.

[16]  Wojciech Szpankowski,et al.  Profiles of Tries , 2008, SIAM J. Comput..

[17]  L. Devroye A Study of Trie-Like Structures Under the Density Model , 1992 .

[18]  Philippe Jacquet,et al.  Analytical Depoissonization and its Applications , 1998, Theor. Comput. Sci..

[19]  Philippe Flajolet,et al.  Singularity Analysis and Asymptotics of Bernoulli Sums , 1999, Theor. Comput. Sci..

[20]  Michael Drmota,et al.  The expected profile of digital search trees , 2011, J. Comb. Theory, Ser. A.

[21]  Wojciech Szpankowski,et al.  Compression of Graphical Structures: Fundamental Limits, Algorithms, and Experiments , 2012, IEEE Transactions on Information Theory.

[22]  Donald Ervin Knuth,et al.  The Art of Computer Programming , 1968 .