Efficient User Preferences-Based Top-k Skyline Using MapReduce

As an important variant of skyline query, top-k skyline can find the best k points as the final results. In order to generate the results meeting the needs of users for massive data, we propose an efficient user preferences-based top-k skyline combining partially and totally ordered domains in MapReduce, named \(P/T\_SKY\_MR\). The whole course contains two main phases, partially ordered domains processing and totally ordered domains processing. In partially ordered domains processing, we propose the binary encoding and a pruning strategy to present the precedence relationship about the partially ordered domains and different user preferences. Meanwhile, in totally ordered domains processing, for finding the final results, a defined ranking criterion is also proposed in order to reduce the calculation cost and minimize the response time. A large number of experiments show that our method is effective, flexible and scalable.

[1]  Mohammad Anisuzzaman Siddique,et al.  k-Dominant Skyline Query Computation in MapReduce Environment , 2015, IEICE Trans. Inf. Syst..

[2]  Djamal Benslimane,et al.  Selecting Skyline Web Services for Multiple Users Preferences , 2012, 2012 IEEE 19th International Conference on Web Services.

[3]  Baoyan Song,et al.  Efficient Top-k Skyline Computation in MapReduce , 2015, 2015 12th Web Information System and Application Conference (WISA).

[4]  Arbee L. P. Chen,et al.  Determining Top-K Candidates by Reverse Constrained Skyline Queries , 2015, DATA.

[5]  Mohammad Anisuzzaman Siddique,et al.  Efficient Selection of Various k-Objects for a Keyword Query Based on MapReduce Skyline Algorithm , 2014, DNIS.

[6]  Donald Kossmann,et al.  The Skyline operator , 2001, Proceedings 17th International Conference on Data Engineering.

[7]  Djamal Benslimane,et al.  Selecting Skyline Web Services from Uncertain QoS , 2012, 2012 IEEE Ninth International Conference on Services Computing.

[8]  Seung-won Hwang,et al.  Supporting personalized top-k skyline queries using partial compressed skycube , 2007, WIDM '07.

[9]  Jinli Cao,et al.  Preference-Based Top-k Representative Skyline Queries on Uncertain Databases , 2015, PAKDD.

[10]  Yunjun Gao,et al.  Finding the Most Desirable Skyline Objects , 2010, DASFAA.

[11]  Marie-Odile Cordier,et al.  Computing Skyline Incrementally in Response to Online Preference Modification , 2013, Trans. Large Scale Data Knowl. Centered Syst..

[12]  Bin Zhang,et al.  Incremental evaluation of top-k combinatorial metric skyline query , 2015, Knowledge-Based Systems.

[13]  Zhiyang Li,et al.  Skyline Query Based on User Preference with MapReduce , 2014, 2014 IEEE 12th International Conference on Dependable, Autonomic and Secure Computing.

[14]  Xiaoyong Du,et al.  Extract Interesting Skyline Points in High Dimension , 2010, DASFAA.

[15]  Seung-won Hwang,et al.  Personalized top-k skyline queries in high-dimensional space , 2009, Inf. Syst..

[16]  Mohammad Anisuzzaman Siddique,et al.  Selecting Representative Objects from Large Database by Using K-Skyband and Top-k Dominating Queries in MapReduce Environment , 2014, ADMA.