An efficient algorithm for skyline queries in cloud computing environments

Skyline query processing has recently received a lot of attention in database and data mining communities. However, most existing algorithms consider how to efficiently process skyline queries from base tables. Obviously, when the data size and the number of skyline queries increase, the time cost of skyline queries will increase exponentially, which will seriously influence the query efficiency. Motivated by the above, in this paper, we consider improving the query efficiency via skyline views and propose a cost-based algorithm (abbr. CA) to efficiently select the optimal set of skyline views for storage. The CA algorithm mainly includes two phases: (i) reduce the skyline views selection to the minimum steiner tree problem and obtain the approximate optimal set AOS of skyline views, and (ii) adjust AOS and produce the final optimal set FOS of skyline views based on the simulated annealing. Moreover, in order to improve the extendibility of the CA algorithm, we implement it based on the map/reduce distributed computation model in cloud computing environments. The detailed theoretical analyses and extensive experiments demonstrate that the CA algorithm is both efficient and effective.

[1]  Christos Doulkeridis,et al.  APSkyline: Improved Skyline Computation for Multicore Architectures , 2014, DASFAA.

[2]  Yong Tang,et al.  Efficient Processing of the SkyEXP Query Over Big Data , 2017, GSKI.

[3]  Wei Wang,et al.  Efficient mining of skyline objects in subspaces over data streams , 2010, Knowledge and Information Systems.

[4]  Carsten Thomassen The even cycle problem for directed graphs , 1992 .

[5]  De-Shuang Huang,et al.  Pupylation sites prediction with ensemble classification model , 2017, Int. J. Data Min. Bioinform..

[6]  Jianzhong Li,et al.  SEPT: an efficient skyline join algorithm on massive data , 2014, Knowledge and Information Systems.

[7]  Christos Doulkeridis,et al.  Efficient skyline query processing in SpatialHadoop , 2015, Inf. Syst..

[8]  Yang Xiang,et al.  A clustering based approach for skyline diversity , 2011, Expert Syst. Appl..

[9]  Md. Arifuzzaman,et al.  Moisture damage evaluation in SBS and lime modified asphalt using AFM and artificial intelligence , 2015, Neural Computing and Applications.

[10]  Asif Zaman,et al.  Friend Recommendation by Using Skyline Query and Location Information , 2016 .

[11]  Hamidah Ibrahim,et al.  A Model for Processing Skyline Queries over a Database with Missing Data , 2015 .

[12]  Yu Liu,et al.  Efficient ELM-Based Two Stages Query Processing Optimization for Big Data , 2015 .

[13]  Minghao Yin,et al.  GRASP for connected dominating set problems , 2017, Neural Computing and Applications.

[14]  Chiang Lee,et al.  Neural skyline filter for accelerating skyline search algorithms , 2015, Expert Syst. J. Knowl. Eng..