Mining over a Reliable Evidential Database: Application on Amphiphilic Chemical Database

In recent years, the mining of frequent itemsets from uncertain databases has attracted much attention. Several researches have been conducted using different uncertain frameworks as probabilities, fuzzy sets and, most recently, evidence theory. There is very little study paid to mining pertinent knowledge from data where reliability is questionable. In this paper, we study and extend the evidential database framework in accounting data reliability. We propose new measures of support and confidence under uncertainty that consider the reliability and extend the state-of-the-art works. The proposed framework is thoroughly experimented on a real case problem for developing classification model from a chemical database.

[1]  Suk Kyoon Lee,et al.  Imprecise and uncertain information in databases: an evidential approach , 1992, [1992] Eighth International Conference on Data Engineering.

[2]  Mei-Ling Shyu,et al.  Rule Mining and Classification in a Situation Assessment Application: A Belief-Theoretic Approach for Handling Data Imperfections , 2007, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[3]  Arie Tzvieli Possibility theory: An approach to computerized processing of uncertainty , 1990, J. Am. Soc. Inf. Sci..

[4]  Arthur P. Dempster,et al.  Upper and Lower Probabilities Induced by a Multivalued Mapping , 1967, Classic Works of the Dempster-Shafer Theory of Belief Functions.

[5]  Sadok Ben Yahia,et al.  Classification with Evidential Associative Rules , 2014, IPMU.

[6]  Thierry Denoeux,et al.  ECM: An evidential version of the fuzzy c , 2008, Pattern Recognit..

[7]  Anne Laurent,et al.  Extracting compact and information lossless sets of fuzzy association rules , 2011, Fuzzy Sets Syst..

[8]  Charu C. Aggarwal,et al.  Managing and Mining Uncertain Data , 2009, Advances in Database Systems.

[9]  Edward Hung,et al.  Mining Frequent Itemsets from Uncertain Data , 2007, PAKDD.

[10]  Philip S. Yu,et al.  Mining Frequent Itemsets over Uncertain Databases , 2012, Proc. VLDB Endow..

[11]  M. H. Margahny,et al.  FAST ALGORITHM FOR MINING ASSOCIATION RULES , 2014 .

[12]  Charu C. Aggarwal,et al.  Managing and Mining Graph Data , 2010, Managing and Mining Graph Data.

[13]  Sadok Ben Yahia,et al.  Evidential Database: A New Generalization of Databases? , 2014, Belief Functions.

[14]  Ahmed K. Elmagarmid,et al.  Duplicate Record Detection: A Survey , 2007, IEEE Transactions on Knowledge and Data Engineering.