An Effective Framework for Skyline Queries Using Principal Component Analysis

Skyline operators are fascinating concepts that let the users extend and evolve a database system. SKY-MR+ algorithm is an efficient framework implemented for skyline operators and queries which uses quad-tree-based histogram, but faces serious limitations and provides inconsistent execution time especially for High datasets. In such cases, it also reports higher processing time with the increase of number of machines in the system. In this paper, an effective framework for skyline queries using principal component analysis (EFSQ-PCA) is proposed and developed which reduces the execution time for High datasets even in the cases of increase in number of machines in the system. The proposed mechanism finds the “Points in Region” using principal component analysis and this forms the base to increase the processing capabilities of skyline queries on various synthetic datasets. Experimental results show improvement in execution time of the proposed EFSQ-PCA as compared to current state of the art under different numbers of dimensions for dataset.

[1]  Hongjun Lu,et al.  Stabbing the sky: efficient skyline computation over sliding windows , 2005, 21st International Conference on Data Engineering (ICDE'05).

[2]  Jianliang Xu,et al.  Range-Based Skyline Queries in Mobile Environments , 2013, IEEE Transactions on Knowledge and Data Engineering.

[3]  Xiang Lian,et al.  Dynamic skyline queries in metric spaces , 2008, EDBT '08.

[4]  Bernhard Seeger,et al.  Progressive skyline computation in database systems , 2005, TODS.

[5]  Bhupendra Verma,et al.  A Novel Approach to Classify High Dimensional Datasets Using Supervised Manifold Learning , 2011 .

[6]  Beng Chin Ooi,et al.  Efficient Progressive Skyline Computation , 2001, VLDB.

[7]  Bhupendra Verma,et al.  A New Hashing Scheme to Overcome the Problem of Overloading of Articles in Usenet , 2012 .

[8]  Donald Kossmann,et al.  Shooting Stars in the Sky: An Online Algorithm for Skyline Queries , 2002, VLDB.

[9]  Hector Garcia-Molina,et al.  One torus to rule them all: multi-dimensional queries in P2P systems , 2004, WebDB '04.

[10]  Ilaria Bartolini,et al.  Efficient sort-based skyline evaluation , 2008, TODS.

[11]  Heng Tao Shen,et al.  Multi-source Skyline Query Processing in Road Networks , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[12]  Beng Chin Ooi,et al.  Skyline Queries Against Mobile Lightweight Devices in MANETs , 2006, 22nd International Conference on Data Engineering (ICDE'06).

[13]  Lei Zou,et al.  Dynamic Skyline Queries in Large Graphs , 2010, DASFAA.

[14]  Ke Gong,et al.  Energy-Efficient Skycube Query Processing in Wireless Sensor Networks , 2013 .

[15]  Gang Chen,et al.  Efficient Reverse Top-k Boolean Spatial Keyword Queries on Road Networks , 2015, IEEE Transactions on Knowledge and Data Engineering.

[16]  Jan Chomicki,et al.  Skyline with presorting , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).