Answering why-not questions on KNN queries

Being decades of study, the usability of database systems have received more attention in recent years. Now it is especially able to explain missing objects in a query result, which is called “why-not” questions, and is the focus of concern. This paper studies the problem of answering why-not questions on KNN queries. In our real life, many users would like to use KNN queries to investigate the surrounding circumstances. Nevertheless, they often feel disappointed when finding the result not including their expected objects. In this paper, we use the query refinement approach to resolve the problem. Given the original KNN query and a set of missing objects as input, our algorithm offer a refined KNN query that includes the missing objects to the user. The experimental results demonstrate the efficiency of our proposed optimizations and algorithms.

[1]  Melanie Herschel,et al.  Explaining missing answers to SPJUA queries , 2010, Proc. VLDB Endow..

[2]  Chengfei Liu,et al.  On answering why-not questions in reverse skyline queries , 2013, 2013 IEEE 29th International Conference on Data Engineering (ICDE).

[3]  Adriane Chapman,et al.  Making database systems usable , 2007, SIGMOD '07.

[4]  Quoc Trung Tran,et al.  How to ConQueR why-not questions , 2010, SIGMOD Conference.

[5]  Eric Lo,et al.  Answering Why-Not Questions on Top-K Queries , 2012, IEEE Transactions on Knowledge and Data Engineering.

[6]  Gang Chen,et al.  Answering why-not questions on metric probabilistic range queries , 2016, 2016 IEEE 32nd International Conference on Data Engineering (ICDE).

[7]  Gang Chen,et al.  Answering Why-not Questions on Reverse Top-k Queries , 2015, Proc. VLDB Endow..

[8]  Hans-Peter Kriegel,et al.  The R*-tree: an efficient and robust access method for points and rectangles , 1990, SIGMOD '90.

[9]  Brad A. Myers,et al.  Designing the whyline: a debugging interface for asking questions about program behavior , 2004, CHI.

[10]  Brad A. Myers,et al.  Answering why and why not questions in user interfaces , 2006, CHI.

[11]  Christian Böhm,et al.  A cost model for query processing in high dimensional data spaces , 2000, TODS.

[12]  Jeffrey F. Naughton,et al.  On the provenance of non-answers to queries over extracted data , 2008, Proc. VLDB Endow..

[13]  Donald Kossmann,et al.  The Skyline operator , 2001, Proceedings 17th International Conference on Data Engineering.