Improved Behaviour of Tries by Adaptive Branching

Abstract We introduce and analyze a method to reduce the search cost in tries. Traditional trie structures use branching factors at the nodes that are either fixed or a function of the number of elements. Instead, we let the distribution of the elements guide the choice of branching factors. This is accomplished in a strikingly simple way: in a binary trie, the i highest complete levels are replaced by a single node of degree 2 i ; the compression is repeated in the subtries. This structure, the level-compressed trie , inherits the good properties of binary tries with respect to neighbour and range searches, while the external path length is significantly decreased. It also has the advantage of being easy to implement. Our analysis shows that the expected depth of a stored element is Θ(log ∗ n ) for uniformly distributed data.

[1]  Gaston H. Gonnet,et al.  Handbook Of Algorithms And Data Structures , 1984 .

[2]  Helmut Prodinger,et al.  Some Further Results on Digital Search Trees , 1986, ICALP.

[3]  M. V. Wilkes,et al.  The Art of Computer Programming, Volume 3, Sorting and Searching , 1974 .

[4]  Robert E. Tarjan,et al.  Storing a sparse table , 1979, CACM.

[5]  L. Devroye Laws of the Iterated Logarithm for Order Statistics of Uniform Spacings , 1981 .

[6]  Edward Fredkin,et al.  Trie memory , 1960, Commun. ACM.

[7]  W. Jevons,et al.  Choice and Chance , 1870, Nature.

[8]  B. Pittel Paths in a random digital tree: limiting distributions , 1986, Advances in Applied Probability.

[9]  Dan E. Willard Log-Logarithmic Worst-Case Range Queries are Possible in Space Theta(N) , 1983, Inf. Process. Lett..

[10]  Philippe Flajolet,et al.  Digital Search Trees Revisited , 1986, SIAM J. Comput..

[11]  Torben Hagerup,et al.  A Guided Tour of Chernoff Bounds , 1990, Inf. Process. Lett..

[12]  Dan E. Willard,et al.  Log-logarithmic worst-case range queries are possible in space ⊕(N) , 1983 .

[13]  Brian Skyrms,et al.  Choice And Chance , 1966 .

[14]  Helmut Prodinger,et al.  On the Balance Property of Patricia Tries: External Path Length Viewpoint , 1989, Theor. Comput. Sci..

[15]  Dan E. Willard New Trie Data Structures Which Support Very Fast Search Operations , 1984, J. Comput. Syst. Sci..

[16]  Larry Carter,et al.  Universal Classes of Hash Functions , 1979, J. Comput. Syst. Sci..

[17]  E. Slud Entropy and maximal spacings for random partitions , 1978 .