Efficient Skyline Maintenance over Frequently Updated Evidential Databases

In many recent applications, data are intrinsically uncertain, noisy and error-prone. That is why, uncertain database management has attracted the attention of several researchers. Data uncertainty can be modeled in the evidence theory setting. On the other hand, skyline analysis is a powerful tool in a wide spectrum of real applications involving multi-criteria optimal decision making. It relies on Pareto dominance relationship. However, the skyline maintenance is not an easy task when the queried database is updated. This paper addresses the problem of the maintenance of the skyline objects of frequently updated evidential databases. In particular, we propose algorithms for maintaining evidential skyline in the case of object insertion or deletion. Extensive experiments are conducted to demonstrate the efficiency and scalability of our proposal.

[1]  Bin Jiang,et al.  Probabilistic Skylines on Uncertain Data , 2007, VLDB.

[2]  Ömer Egecioglu,et al.  DeltaSky: Optimal Maintenance of Skyline Deletions without Exclusive Dominance Region Generation , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[3]  David A. Bell,et al.  Generalized Union and Project Operations for Pooling Uncertain and Imprecise Information , 1996, Data Knowl. Eng..

[4]  Donald Kossmann,et al.  The Skyline operator , 2001, Proceedings 17th International Conference on Data Engineering.

[5]  Seung-won Hwang,et al.  Skyline ranking for uncertain data with maybe confidence , 2008, 2008 IEEE 24th International Conference on Data Engineering Workshop.

[6]  Bin Jiang,et al.  Probabilistic skylines on uncertain data: model and bounding-pruning-refining methods , 2010, Journal of Intelligent Information Systems.

[7]  Allel HadjAli,et al.  Computing Skyline from Evidential Data , 2014, SUM.

[8]  Arthur P. Dempster,et al.  A Generalization of Bayesian Inference , 1968, Classic Works of the Dempster-Shafer Theory of Belief Functions.

[9]  Mohamed Anis Bach Tobji,et al.  Maintaining Evidential Frequent Itemsets in Case of Data Deletion , 2010, IPMU.

[10]  Yunhao Liu,et al.  Underground coal mine monitoring with wireless sensor networks , 2009, TOSN.

[11]  Yi Yang,et al.  Personal health indexing based on medical examinations: A data mining approach , 2016, Decis. Support Syst..

[12]  Patrick Bosc,et al.  Modeling and Querying Uncertain Relational Databases: a Survey of Approaches Based on the Possible Worlds Semantics , 2010, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[13]  Philip S. Yu,et al.  A Survey of Uncertain Data Algorithms and Applications , 2009, IEEE Transactions on Knowledge and Data Engineering.

[14]  Bernhard Seeger,et al.  Progressive skyline computation in database systems , 2005, TODS.

[15]  Tian Xia,et al.  Refreshing the sky: the compressed skycube with efficient support for frequent updates , 2006, SIGMOD Conference.

[16]  Anthony K. H. Tung,et al.  Minimizing the communication cost for continuous skyline maintenance , 2009, SIGMOD Conference.

[17]  Suk Kyoon Lee,et al.  Imprecise and uncertain information in databases: an evidential approach , 1992, [1992] Eighth International Conference on Data Engineering.

[18]  Christian Böhm,et al.  The Gauss-Tree: Efficient Object Identification in Databases of Probabilistic Feature Vectors , 2006, 22nd International Conference on Data Engineering (ICDE'06).