Grid-Index Algorithm for Reverse Rank Queries

In Rank-aware query processing, reverse rank queries have already attracted significant interests. Reverse rank queries can find matching customers for a given product based on individual customers’ preference. The results are used in numerous real-life applications, such as market analysis and product placement. Efficient processing of reverse rank queries is challenging because it needs to consider the combination on the given data set of user preferences and the data set of products. Currently, there are two typical reverse rank queries: Reverse top-k and reverse k-ranks. Both prefer top-ranking products and the most efficient algorithms for them have a common methodology that indexes and prunes the data set using R-trees. This kind of tree-based algorithms suffers the problem that their performance in high-dimensional data declines sharply while high-dimensional data are significant for real-life applications. In this paper, we propose an efficient scan algorithm, named Grid-index algorithm (GIR), for processing reverse rank queries efficiently. GIR algorithm uses an approximate values index to save computations in scanning and only requires a little memory cost. Our theoretical analysis guarantees the efficiency and the experimental results confirm that GIR has superior performance compared to tree-based methods in high-dimensional applications. CCS Concepts •Theory of computation → Database query processing and optimization (theory); c ©2017, Copyright is with the authors. Published in Proc. 20th International Conference on Extending Database Technology (EDBT), March 21-24, 2017 Venice, Italy: ISBN 978-3-89318-073-8, on OpenProceedings.org. Distribution of this paper is permitted under the terms of the Creative Commons license CC-by-nc-nd 4.0 Figure 1: Example for RTK and RKR queries. (a): the top2 cell phones appreciated by users. (b): the RT-2 of each phone. (c): the rank list and the R1-R of each phone.

[1]  John R. Smith,et al.  The onion technique: indexing for linear optimization queries , 2000, SIGMOD '00.

[2]  Xiang Lian,et al.  Monochromatic and bichromatic reverse skyline search over uncertain databases , 2008, SIGMOD Conference.

[3]  Hans-Peter Kriegel,et al.  The R*-tree: an efficient and robust access method for points and rectangles , 1990, SIGMOD '90.

[4]  Christos Doulkeridis,et al.  Identifying the most influential data objects with reverse top-k queries , 2010, Proc. VLDB Endow..

[5]  Alex Thomo,et al.  Indexing Reverse Top-k Queries in Two Dimensions , 2013, DASFAA.

[6]  Christos Doulkeridis,et al.  Monochromatic and Bichromatic Reverse Top-k Queries , 2011, IEEE Transactions on Knowledge and Data Engineering.

[7]  Hiroyuki Kitagawa,et al.  Aggregate Reverse Rank Queries , 2016, DEXA.

[8]  Feifei Li,et al.  Reverse Furthest Neighbors in Spatial Databases , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[9]  Vagelis Hristidis,et al.  PREFER: a system for the efficient execution of multi-parametric ranked queries , 2001, SIGMOD '01.

[10]  S. Muthukrishnan,et al.  Influence sets based on reverse nearest neighbor queries , 2000, SIGMOD '00.

[11]  Hans-Peter Kriegel,et al.  The X-tree : An Index Structure for High-Dimensional Data , 2001, VLDB.

[12]  Dongxi Liu,et al.  Efficiently computing reverse k furthest neighbors , 2016, 2016 IEEE 32nd International Conference on Data Engineering (ICDE).

[13]  Muhammad Aamir Cheema,et al.  SLICE: Reviving regions-based pruning for reverse k nearest neighbors queries , 2014, 2014 IEEE 30th International Conference on Data Engineering.

[14]  Bernhard Seeger,et al.  Efficient Computation of Reverse Skyline Queries , 2007, VLDB.

[15]  Christos Faloutsos,et al.  Hilbert R-tree: An Improved R-tree using Fractals , 1994, VLDB.

[16]  J. Uspensky,et al.  Introduction to Mathematical Probability , 1938, Nature.

[17]  Christos Doulkeridis,et al.  Reverse top-k queries , 2010, 2010 IEEE 26th International Conference on Data Engineering (ICDE 2010).

[18]  Hans-Jörg Schek,et al.  A Quantitative Analysis and Performance Study for Similarity-Search Methods in High-Dimensional Spaces , 1998, VLDB.

[19]  Christos Doulkeridis,et al.  Branch-and-bound algorithm for reverse top-k queries , 2013, SIGMOD '13.

[20]  Jianquan Liu,et al.  Indexing expensive functions for efficient multi-dimensional similarity search , 2010, Knowledge and Information Systems.

[21]  Christos Doulkeridis,et al.  Monitoring reverse top-k queries over mobile devices , 2011, MobiDE '11.

[22]  Zhao Zhang,et al.  Reverse k-Ranks Query , 2014, Proc. VLDB Endow..