Querying a Collection of Continuous Functions

We introduce a new query primitive called Function Query (FQ). An FQ operates on a set of math functions and retrieves the functions whose output with a given input satisfies a query condition (e.g., being among top k, within a given range). While FQ finds its natural uses in querying a database of math functions, it can also be applied on a database of discrete values. We show that by interpreting the database as a set of user-defined functions, FQ can achieve the same functionality as existing analytic queries such as top-k query and scalar product query. We address the challenge of efficient execution of FQ. The core of our solution is a novel data structure called Intersection-tree. Our research takes advantage of the fact that 1) the intersections of a set of continuous functions partition their domain into a number of subdomains, and 2) in each of these subdomains, the functions can be sorted based on their output. We evaluate the performance of the proposed techniques through analysis, prototyping, and experiments using both synthetic and real-world data. When querying a database of functions, our techniques scale well. When applied on a database of discrete values, our techniques are more versatile and outperform existing techniques in terms of various performance metrics.

[1]  Dimitrios Gunopulos,et al.  Ad-hoc Top-k Query Answering for Data Streams , 2007, VLDB.

[2]  Quentin F. Stout,et al.  Tree rebalancing in optimal time and space , 1986, CACM.

[3]  Zhao Zhang,et al.  Reverse k-Ranks Query , 2014, Proc. VLDB Endow..

[4]  D. Anderson,et al.  Algorithms for minimization without derivatives , 1974 .

[5]  Raymond Chi-Wing Wong,et al.  k-Hit Query: Top-k Query with Probabilistic Utility Function , 2015, SIGMOD Conference.

[6]  Ludwig Schl fli Theorie der vielfachen Kontinuit??t , 1950 .

[7]  Yufei Tao,et al.  Branch-and-bound processing of ranked queries , 2007, Inf. Syst..

[8]  Ying Cai,et al.  Querying Improvement Strategies , 2017, EDBT.

[9]  Peter Triantafillou,et al.  Efficient Scalable Accurate Regression Queries in In-DBMS Analytics , 2017, 2017 IEEE 33rd International Conference on Data Engineering (ICDE).

[10]  Gang Chen,et al.  Answering Why-not Questions on Reverse Top-k Queries , 2015, Proc. VLDB Endow..

[11]  Arijit Khan,et al.  Towards indexing functions: answering scalar product queries , 2014, SIGMOD Conference.

[12]  Christos Doulkeridis,et al.  Monochromatic and Bichromatic Reverse Top-k Queries , 2011, IEEE Transactions on Knowledge and Data Engineering.

[13]  Franco P. Preparata,et al.  Plane-sweep algorithms for intersecting geometric figures , 1982, CACM.

[14]  Vagelis Hristidis,et al.  PREFER: a system for the efficient execution of multi-parametric ranked queries , 2001, SIGMOD '01.

[15]  Kyriakos Mouratidis,et al.  Determining the Impact Regions of Competing Options in Preference Space , 2017, SIGMOD Conference.

[16]  R. Winder Partitions of N-Space by Hyperplanes , 1966 .

[17]  A. Prasad Sistla,et al.  Modeling and querying moving objects , 1997, Proceedings 13th International Conference on Data Engineering.

[18]  Zhitao Shen,et al.  A Unified Framework for Efficiently Processing Ranking Related Queries , 2014, EDBT.

[19]  Kyriakos Mouratidis,et al.  Maximum Rank Query , 2015, Proc. VLDB Endow..

[20]  Dimitrios Gunopulos,et al.  Answering top-k queries using views , 2006, VLDB.

[21]  Kyriakos Mouratidis,et al.  Global immutable region computation , 2014, SIGMOD Conference.

[22]  Eric Lo,et al.  Answering Why-Not Questions on Top-K Queries , 2012, IEEE Transactions on Knowledge and Data Engineering.

[23]  Yannis Papakonstantinou,et al.  Combining Databases and Signal Processing in Plato , 2015, CIDR.

[24]  Luis Gravano,et al.  Evaluating Top-k Selection Queries , 1999, VLDB.

[25]  Samuel Madden,et al.  Querying continuous functions in a database system , 2008, SIGMOD Conference.

[26]  Karl Aberer,et al.  Model-view sensor data management in the cloud , 2013, 2013 IEEE International Conference on Big Data.

[27]  John R. Smith,et al.  The onion technique: indexing for linear optimization queries , 2000, SIGMOD '00.