Optimal Algorithms for Range Searching over Multi-Armed Bandits

This paper studies a multi-armed bandit (MAB) version of the range-searching problem. In its basic form, range searching considers as input a set of points (on the real line) and a collection of (real) intervals. Here, with each specified point, we have an associated weight, and the problem objective is to find a maximum-weight point within every given interval. The current work addresses range searching with stochastic weights: each point corresponds to an arm (that admits sample access) and the point’s weight is the (unknown) mean of the underlying distribution. In this MAB setup, we develop sample-efficient algorithms that find, with high probability, near-optimal arms within the given intervals, i.e., we obtain PAC (probably approximately correct) guarantees. We also provide an algorithm for a generalization wherein the weight of each point is a multi-dimensional vector. The sample complexities of our algorithms depend, in particular, on the size of the optimal hitting set of the given intervals. Finally, we establish lower bounds proving that the obtained sample complexities are essentially tight. Our results highlight the significance of geometric constructs (specifically, hitting sets) in our MAB setting.

[1]  Sunil Prabhakar,et al.  Indexing Uncertain Data , 2018, Encyclopedia of Database Systems.

[2]  R. Pollack,et al.  Advances in Discrete and Computational Geometry , 1999 .

[3]  Pankaj K. Agarwal,et al.  Geometric Range Searching and Its Relatives , 2007 .

[4]  Ieee Staff,et al.  2014 IEEE International Symposium on Information Theory (ISIT) , 2014 .

[5]  Gang Chen,et al.  Indexing metric uncertain data for range queries and range joins , 2017, The VLDB Journal.

[6]  Martin J. Dürst,et al.  The design and analysis of spatial data structures. Applications of spatial data structures: computer graphics, image processing, and GIS , 1991 .

[7]  Subhash Suri,et al.  Range-Max Queries on Uncertain Data , 2016, PODS.

[8]  Michael R. Lyu,et al.  Pure Exploration of Multi-Armed Bandits with Heavy-Tailed Payoffs , 2018, UAI.

[9]  Ravi Sundaram,et al.  Skyline Identification in Multi-Arm Bandits , 2018, 2018 IEEE International Symposium on Information Theory (ISIT).

[10]  Young-Jin Kim,et al.  Multi-dimensional range queries in sensor networks , 2003, SenSys '03.

[11]  Shie Mannor,et al.  Action Elimination and Stopping Conditions for the Multi-Armed Bandit and Reinforcement Learning Problems , 2006, J. Mach. Learn. Res..

[12]  Aurélien Garivier,et al.  Optimal Best Arm Identification with Fixed Confidence , 2016, COLT.

[13]  Yufei Tao,et al.  Range search on multidimensional uncertain data , 2007, TODS.

[14]  Stefano Ermon,et al.  Best arm identification in multi-armed bandits with delayed feedback , 2018, AISTATS.

[15]  Dominik D. Freydenberger,et al.  Can We Learn to Gamble Efficiently? , 2010, COLT.

[16]  Aleksandrs Slivkins,et al.  Introduction to Multi-Armed Bandits , 2019, Found. Trends Mach. Learn..

[17]  Peter Auer,et al.  Pareto Front Identification from Stochastic Bandit Feedback , 2016, AISTATS.

[18]  E.E. Pissaloux,et al.  Image Processing , 1994, Proceedings. Second Euromicro Workshop on Parallel and Distributed Processing.

[19]  Bin Yu,et al.  Artificial intelligence and statistics , 2018, Frontiers of Information Technology & Electronic Engineering.

[20]  Sébastien Bubeck,et al.  Multiple Identifications in Multi-Armed Bandits , 2012, ICML.

[21]  Peter Stone,et al.  Efficient Selection of Multiple Bandit Arms: Theory and Practice , 2010, ICML.

[22]  Yufei Tao,et al.  A Guide to Designing Top-k Indexes , 2019, SGMD.

[23]  Jian Li,et al.  Range queries on uncertain data , 2014, Theor. Comput. Sci..

[24]  Robert D. Nowak,et al.  Top Arm Identification in Multi-Armed Bandits with Batch Arm Pulls , 2016, AISTATS.

[25]  Osamu Watanabe,et al.  Adaptive Sampling Methods for Scaling Up Knowledge Discovery Algorithms , 1999, Data Mining and Knowledge Discovery.

[26]  Sigarch,et al.  Proceedings of the ... International Conference on Embedded Networked Sensor Systems : SenSys , 2003 .

[27]  Daniel Russo,et al.  Simple Bayesian Algorithms for Best Arm Identification , 2016, COLT.

[28]  J. van Leeuwen,et al.  Theoretical Computer Science , 2003, Lecture Notes in Computer Science.

[29]  Prosenjit Gupta,et al.  Computational Geometry: Generalized (or Colored) Intersection Searching , 2018 .

[30]  Shuliang Wang,et al.  Data Mining and Knowledge Discovery , 2005, Mathematical Principles of the Internet.

[31]  Matthew Malloy,et al.  lil' UCB : An Optimal Exploration Algorithm for Multi-Armed Bandits , 2013, COLT.

[32]  P. Agarwal Simplex Range Searching and Its Variants: A Review , 2017 .

[33]  Lars Arge,et al.  Indexing Moving Points , 2003, J. Comput. Syst. Sci..

[34]  John N. Tsitsiklis,et al.  The Sample Complexity of Exploration in the Multi-Armed Bandit Problem , 2004, J. Mach. Learn. Res..