Smoothed analysis of binary search trees

Binary search trees are one of the most fundamental data structures. While the height of such a tree may be linear in the worst case, the average height with respect to the uniform distribution is only logarithmic. The exact value is one of the best studied problems in average-case complexity. We investigate what happens in between by analysing the smoothed height of binary search trees: Randomly perturb a given (adversarial) sequence and then take the expected height of the binary search tree generated by the resulting sequence. As perturbation models, we consider partial permutations, partial alterations, and partial deletions. On the one hand, we prove tight lower and upper bounds of roughly @Q((1-p)@?n/p) for the expected height of binary search trees under partial permutations and partial alterations, where n is the number of elements and p is the smoothing parameter. This means that worst-case instances are rare and disappear under slight perturbations. On the other hand, we examine how much a perturbation can increase the height of a binary search tree, i.e. how much worse well balanced instances can become.

[1]  Luca Becchetti,et al.  Average case and smoothed competitive analysis of the multi-level feedback algorithm , 2003, 44th Annual IEEE Symposium on Foundations of Computer Science, 2003. Proceedings..

[2]  Luc Devroye,et al.  A note on the height of binary search trees , 1986, JACM.

[3]  Michael Drmota An analytic approach to the height of binary search trees II , 2003, JACM.

[4]  Kurt Mehlhorn,et al.  Smoothed Analysis of Three Combinatorial Problems , 2003, MFCS.

[5]  Donald Ervin Knuth,et al.  The Art of Computer Programming , 1968 .

[6]  Naveen Sivadasan,et al.  Topology matters: Smoothed competitiveness of metrical task systems , 2004, Theor. Comput. Sci..

[7]  Friedhelm Meyer auf der Heide,et al.  Smoothed Motion Complexity , 2003, ESA.

[8]  Vijay V. Vazirani,et al.  Approximation Algorithms , 2001, Springer Berlin Heidelberg.

[9]  Joel H. Spencer,et al.  Coloring Random and Semi-Random k-Colorable Graphs , 1995, J. Algorithms.

[10]  René Beier,et al.  Typical Properties of Winners and Losers in Discrete Optimization , 2006, SIAM J. Comput..

[11]  Daniel A. Spielman,et al.  Improved smoothed analysis of the shadow vertex simplex method , 2005, 46th Annual IEEE Symposium on Foundations of Computer Science (FOCS'05).

[12]  John Michael Robson,et al.  The Height of Binary Search Trees , 1979, Aust. Comput. J..

[13]  Bruce A. Reed,et al.  The height of a random binary search tree , 2003, JACM.

[14]  Rüdiger Reischuk,et al.  Smoothed Analysis of Binary Search Trees , 2005, ISAAC.

[15]  Alfred V. Aho,et al.  Data Structures and Algorithms , 1983 .

[16]  Shang-Hua Teng,et al.  Smoothed Analysis (Motivation and Discrete Models) , 2003, WADS.

[17]  Alfred V. Aho,et al.  The Design and Analysis of Computer Algorithms , 1974 .

[18]  Miklos Santha,et al.  Generating Quasi-random Sequences from Semi-random Sources , 1986, J. Comput. Syst. Sci..

[19]  Cecilia R. Aragon,et al.  Randomized search trees , 2005, Algorithmica.

[20]  Berthold Vöcking,et al.  Smoothed analysis of integer programming , 2007, Math. Program..

[21]  Abraham D. Flaxman,et al.  The diameter of randomly perturbed digraphs and some applications , 2007 .

[22]  John Michael Robson Constant bounds on the moments of the height of binary search trees , 2002, Theor. Comput. Sci..

[23]  René Beier,et al.  Typical properties of winners and losers in discrete optimization , 2004, STOC '04.

[24]  Ronald L. Rivest,et al.  Introduction to Algorithms , 1990 .

[25]  Shang-Hua Teng,et al.  Smoothed analysis of termination of linear programming algorithms , 2003, Math. Program..

[26]  Michael Drmota An analytic approach to the height of binary search trees , 2007, Algorithmica.

[27]  Noga Alon,et al.  The Probabilistic Method , 2015, Fundamentals of Ramsey Theory.

[28]  John Michael Robson On the Concentration of the Height of Binary Search Trees , 1997, ICALP.

[29]  D. Spielman,et al.  Smoothed analysis of algorithms: Why the simplex algorithm usually takes polynomial time , 2004 .

[30]  Bruce A. Reed,et al.  On the Variance of the Height of Random Binary Search Trees , 1995, SIAM J. Comput..

[31]  John Dunagan,et al.  Smoothed analysis of the perceptron algorithm for linear programming , 2002, SODA '02.

[32]  Clifford Stein,et al.  Introduction to Algorithms, 2nd edition. , 2001 .

[33]  Friedhelm Meyer auf der Heide,et al.  Labeling Smart Dust , 2004, ESA.

[34]  Shang-Hua Teng,et al.  Smoothed analysis of algorithms: why the simplex algorithm usually takes polynomial time , 2001, STOC '01.

[35]  Luca Becchetti,et al.  Average-Case and Smoothed Competitive Analysis of the Multilevel Feedback Algorithm , 2006, Math. Oper. Res..

[36]  Uriel Feige,et al.  Heuristics for Semirandom Graph Problems , 2001, J. Comput. Syst. Sci..

[37]  Daniel A. Spielman The Smoothed Analysis of Algorithms , 2005, FCT.

[38]  B. Pittel On growing random binary trees , 1984 .

[39]  Christian Sohler,et al.  Extreme Points Under Random Noise , 2004, ESA.