On the streaming complexity of fundamental geometric problems

In this paper, we focus on lower bounds and algorithms for some basic geometric problems in the one-pass (insertion only) streaming model. The problems considered are grouped into three categories: (i) Klee's measure (ii) Convex body approximation, geometric query, and (iii) Discrepancy Klee's measure is the problem of finding the area of the union of hyperrectangles. Under convex body approximation, we consider the problems of convex hull, convex body approximation, linear programming in fixed dimensions. The results for convex body approximation implies a property testing type result to find if a query point lies inside a convex polyhedron. Under discrepancy, we consider both the geometric and combinatorial discrepancy. For all the problems considered, we present (randomized) lower bounds on space. Most of our lower bounds are in terms of approximating the solution with respect to an error parameter $\epsilon$. We provide approximation algorithms that closely match the lower bound on space for most of the problems.

[1]  Piotr Indyk,et al.  Better algorithms for high-dimensional proximity problems via asymmetric embeddings , 2003, SODA '03.

[2]  Jon Louis Bentley,et al.  Decomposable Searching Problems I: Static-to-Dynamic Transformation , 1980, J. Algorithms.

[3]  Sudipto Guha,et al.  Tight Lower Bounds for Multi-pass Stream Computation Via Pass Elimination , 2008, ICALP.

[4]  Hamid Zarrabi-Zadeh,et al.  A Streaming Algorithm for the Convex Hull. , 2015 .

[5]  Pankaj K. Agarwal,et al.  Approximating extent measures of points , 2004, JACM.

[6]  Joan Feigenbaum,et al.  Computing Diameter in the Streaming and Sliding-Window Models , 2002, Algorithmica.

[7]  J. Ian Munro,et al.  Selection and sorting with limited storage , 1978, 19th Annual Symposium on Foundations of Computer Science (sfcs 1978).

[8]  Pankaj K. Agarwal,et al.  Streaming Algorithms for Extent Problems in High Dimensions , 2010, SODA '10.

[9]  Philippe Flajolet,et al.  Probabilistic Counting Algorithms for Data Base Applications , 1985, J. Comput. Syst. Sci..

[10]  Tim Roughgarden,et al.  Communication Complexity (for Algorithm Designers) , 2015, Found. Trends Theor. Comput. Sci..

[11]  Eyal Kushilevitz,et al.  Communication Complexity , 1997, Adv. Comput..

[12]  Gokarna Sharma,et al.  Efficient transformations for Klee's measure problem in the streaming model , 2015, Comput. Geom..

[13]  A. Razborov Communication Complexity , 2011 .

[14]  Robert H. Morris,et al.  Counting large numbers of events in small registers , 1978, CACM.

[15]  Alexandr Andoni,et al.  Width of Points in the Streaming Model , 2016, TALG.

[16]  Prabhakar Raghavan,et al.  Computing on data streams , 1999, External Memory Algorithms.

[17]  Timothy M. Chan Dynamic Streaming Algorithms for Epsilon-Kernels , 2016, SoCG.

[18]  Adi Rosén,et al.  Space-Constrained Interval Selection , 2012, ICALP.

[19]  Pablo Pérez-Lantero,et al.  Interval Selection in the Streaming Model , 2015, WADS.

[20]  Hamid Zarrabi-Zadeh An Almost Space-Optimal Streaming Algorithm for Coresets in Fixed Dimensions , 2008, ESA.

[21]  Subhash Suri,et al.  Adaptive sampling for geometric problems over data streams , 2004, PODS.

[22]  Piotr Indyk,et al.  Streaming Algorithms for Geometric Problems , 2004, FSTTCS.

[23]  Timothy M. Chan,et al.  A Simple Streaming Algorithm for Minimum Enclosing Balls , 2006, CCCG.

[24]  Noga Alon,et al.  The Space Complexity of Approximating the Frequency Moments , 1999 .

[25]  Timothy M. Chan A (slightly) faster algorithm for klee's measure problem , 2008, SCG '08.

[26]  J. Matousek,et al.  Geometric Discrepancy: An Illustrated Guide , 2009 .

[27]  David P. Woodruff,et al.  Rectangle-efficient aggregation in spatial data streams , 2012, PODS '12.

[28]  Ashwin Lall,et al.  Randomized Multi-pass Streaming Skyline Algorithms , 2009, Proc. VLDB Endow..

[29]  Mark H. Overmars,et al.  New upper bounds in Klee's measure problem , 1988, [Proceedings 1988] 29th Annual Symposium on Foundations of Computer Science.

[30]  Timothy M. Chan,et al.  Streaming and Dynamic Algorithms for Minimum Enclosing Balls in High Dimensions , 2011, WADS.

[31]  David Eppstein,et al.  Deterministic sampling and range counting in geometric data streams , 2003, TALG.

[32]  Timothy M. Chan,et al.  Multi-Pass Geometric Algorithms , 2005, Discret. Comput. Geom..

[33]  S. Muthukrishnan,et al.  Data streams: algorithms and applications , 2005, SODA '03.

[34]  David P. Woodruff Optimal space lower bounds for all frequency moments , 2004, SODA '04.

[35]  Timothy M. Chan,et al.  Better ϵ-Dependencies for Offline Approximate Nearest Neighbor Search, Euclidean Minimum Spanning Trees, and ϵ-Kernels , 2014, Symposium on Computational Geometry.

[36]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[37]  Amit Chakrabarti,et al.  On Density, Threshold and Emptiness Queries for Intervals in the Streaming Model , 2015, FSTTCS.

[38]  Csaba D. Tóth,et al.  Range Counting over Multidimensional Data Streams , 2004, SCG '04.

[39]  Piotr Indyk,et al.  Sampling in dynamic data streams and applications , 2005, Int. J. Comput. Geom. Appl..

[40]  Sudipto Guha,et al.  Graph Synopses, Sketches, and Streams: A Survey , 2012, Proc. VLDB Endow..

[41]  Timothy M. Chan Dynamic Coresets , 2008, SCG '08.

[42]  Zhengyuan Zhu,et al.  Spatial scan statistics: approximations and performance study , 2006, KDD '06.

[43]  R. Dudley Metric Entropy of Some Classes of Sets with Differentiable Boundaries , 1974 .

[44]  Edo Liberty,et al.  Optimal Quantile Approximation in Streams , 2016, 2016 IEEE 57th Annual Symposium on Foundations of Computer Science (FOCS).

[45]  David P. Woodruff,et al.  An optimal algorithm for the distinct elements problem , 2010, PODS '10.

[46]  Jan van Leeuwen,et al.  The Measure Problem for Rectangular Ranges in d-Space , 1981, J. Algorithms.

[47]  Timothy M. Chan Faster core-set constructions and data-stream algorithms in fixed dimensions , 2006, Comput. Geom..

[48]  Timothy M. Chan Klee's Measure Problem Made Easy , 2013, 2013 IEEE 54th Annual Symposium on Foundations of Computer Science.

[49]  Joan Feigenbaum,et al.  Massive data streams in graph theory and computational geometry , 2005 .

[50]  Piotr Indyk,et al.  Algorithms for dynamic geometric problems over data streams , 2004, STOC '04.

[51]  Lauwerens Kuipers,et al.  Uniform distribution of sequences , 1974 .

[52]  Sariel Har-Peled,et al.  On coresets for k-means and k-median clustering , 2004, STOC '04.