论文信息 - On the Most Likely Voronoi Diagram and Nearest Neighbor Searching

On the Most Likely Voronoi Diagram and Nearest Neighbor Searching

Let 𝒮 = {(s1,π1), (s2,π2),…, (sn,πn)} be a set of stochastic sites, where each site is a tuple (si,πi) consisting of a point si in d-dimensional space and a probability πi of existence. Given a query point q, we define its most likely nearest neighbor (LNN) as the site with the largest probability of being q’s nearest neighbor. The Most Likely Voronoi Diagram (LVD) of 𝒮 is a partition of the space into regions with the same LNN. We investigate the complexity of LVD in one dimension and show that it can have size Ω(n2) in the worst-case. We then show that under non-adversarial conditions, the size of the 1-dimensional LVD is significantly smaller: (1) Θ(kn) if the input has only k distinct probability values, (2) O(nlog n) on average, and (3) O(nn) under smoothed analysis. We also describe a framework for LNN search using Pareto sets, which gives a linear-space data structure and sub-linear query time in 1D for average and smoothed analysis models as well as the worst-case with a bounded number of distinct probabilities. The Pareto-set framework is also applicable to multi-dimensional LNN search via reduction to a sequence of nearest neighbor and spherical range queries.

Subhash Suri | Kevin Verbeek

[1] Bodo Manthey,et al. Smoothed Analysis of Binary Search Trees and Quicksort under Additive Noise , 2008, MFCS.

[2] Maarten Löffler,et al. Data Imprecision in Computational Geometry , 2009 .

[3] Bernard Chazelle,et al. Quasi-optimal range searching in spaces of finite VC-dimension , 1989, Discret. Comput. Geom..

[4] Charu C. Aggarwal,et al. Managing and Mining Uncertain Data , 2009, Advances in Database Systems.

[5] William S. Evans,et al. Guaranteed Voronoi Diagrams of Uncertain Sites , 2008, CCCG.

[6] Leonidas J. Guibas,et al. Randomized incremental construction of Delaunay and Voronoi diagrams , 1990, Algorithmica.

[7] Subhash Suri,et al. On the Most Likely Convex Hull of Uncertain Points , 2013, ESA.

[8] Mark de Berg,et al. Visibility maps of realistic terrains have linear smoothed complexity , 2009, SCG '09.

[9] Shang-Hua Teng,et al. Smoothed analysis of algorithms: why the simplex algorithm usually takes polynomial time , 2001, STOC '01.

[10] Luc Devroye,et al. A note on the height of binary search trees , 1986, JACM.

[11] Philip S. Yu,et al. A Survey of Uncertain Data Algorithms and Applications , 2009, IEEE Transactions on Knowledge and Data Engineering.