Mining fuzzy association rules from uncertain data

Association rule mining is an important data analysis method that can discover associations within data. There are numerous previous studies that focus on finding fuzzy association rules from precise and certain data. Unfortunately, real-world data tends to be uncertain due to human errors, instrument errors, recording errors, and so on. Therefore, a question arising immediately is how we can mine fuzzy association rules from uncertain data. To this end, this paper proposes a representation scheme to represent uncertain data. This representation is based on possibility distributions because the possibility theory establishes a close connection between the concepts of similarity and uncertainty, providing an excellent framework for handling uncertain data. Then, we develop an algorithm to mine fuzzy association rules from uncertain data represented by possibility distributions. Experimental results from the survey data show that the proposed approach can discover interesting and valuable patterns with high certainty.

[1]  Yen-Liang Chen,et al.  Discovering fuzzy time-interval sequential patterns in sequence databases , 2005, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[2]  C. Ordonez,et al.  Constraining and summarizing association rules in medical data , 2006 .

[3]  Keun Ho Ryu,et al.  Mining association rules on significant rare data using relative support , 2003, J. Syst. Softw..

[4]  Yen-Liang Chen,et al.  Market basket analysis in a multiple store environment , 2005, Decis. Support Syst..

[5]  Francisco Herrera,et al.  Learning the membership function contexts for mining fuzzy association rules by using genetic algorithms , 2009, Fuzzy Sets Syst..

[6]  Yi-Chung Hu,et al.  Discovering fuzzy association rules using fuzzy partition methods , 2003, Knowl. Based Syst..

[7]  Yen-Liang Chen,et al.  Mining association rules from imprecise ordinal data , 2008, Fuzzy Sets Syst..

[8]  Kyuseok Shim,et al.  Mining optimized support rules for numeric attributes , 2001, Inf. Syst..

[9]  Jacky Montmain,et al.  Propagation of uncertainty by the possibility theory in Choquet integral-based decision making: application to an E-commerce website choice support , 2006, IEEE Transactions on Instrumentation and Measurement.

[10]  Ping-Yu Hsu,et al.  Algorithms for mining association rules in bag databases , 2004, Inf. Sci..

[11]  Yi-Chung Hu,et al.  Elicitation of classification rules by fuzzy data mining , 2003 .

[12]  Yassine Djouadi,et al.  Mining Association Rules under Imprecision and Vagueness: towards a Possibilistic Approach , 2007, 2007 IEEE International Fuzzy Systems Conference.

[13]  Tzung-Pei Hong,et al.  Finding relevant attributes and membership functions , 1999, Fuzzy Sets Syst..

[14]  Arbee L. P. Chen,et al.  Efficient Graph-Based Algorithms for Discovering and Maintaining Association Rules in Large Databases , 2001, Knowledge and Information Systems.

[15]  Yen-Liang Chen,et al.  Mining generalized knowledge from ordered data through attribute-oriented induction techniques , 2005, Eur. J. Oper. Res..

[16]  Daniel Sánchez,et al.  Fuzzy association rules: general model and applications , 2003, IEEE Trans. Fuzzy Syst..

[17]  Mohand Boughanem,et al.  Qualitative pattern matching with linguistic terms , 2004, AI Commun..

[18]  Tzung-Pei Hong,et al.  A fuzzy AprioriTid mining algorithm with reduced computational time , 2001, 10th IEEE International Conference on Fuzzy Systems. (Cat. No.01CH37297).

[19]  D. S. Yeung,et al.  Query fuzzy association rules in relational database , 2001, Proceedings Joint 9th IFSA World Congress and 20th NAFIPS International Conference (Cat. No. 01TH8569).

[20]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[21]  Witold Pedrycz,et al.  Fuzzy set technology in knowledge discovery , 1998, Fuzzy Sets Syst..

[22]  Soon Myoung Chung,et al.  Efficient mining of maximal frequent itemsets from databases on a cluster of workstations , 2004, Knowledge and Information Systems.

[23]  Supriya Kumar De,et al.  Clustering web transactions using rough approximation , 2004, Fuzzy Sets Syst..

[24]  Tzung-Pei Hong,et al.  Fuzzy data mining for interesting generalized association rules , 2003, Fuzzy Sets Syst..

[25]  L. Zadeh Fuzzy sets as a basis for a theory of possibility , 1999 .

[26]  Soon Myoung Chung,et al.  Mining association rules using inverted hashing and pruning , 2002, Inf. Process. Lett..

[27]  Xindong Wu,et al.  Computing the minimum-support for mining frequent patterns , 2008, Knowledge and Information Systems.

[28]  Arthur P. Dempster,et al.  Upper and Lower Probabilities Induced by a Multivalued Mapping , 1967, Classic Works of the Dempster-Shafer Theory of Belief Functions.

[29]  Wilfred Ng,et al.  An information-theoretic approach to quantitative association rule mining , 2008, Knowledge and Information Systems.

[30]  Kok-Leong Ong,et al.  Online mining of frequent sets in data streams with error guarantee , 2008, Knowledge and Information Systems.

[31]  Eyke Hüllermeier,et al.  Possibilistic instance-based learning , 2003, Artif. Intell..

[32]  Xindong Wu,et al.  Database classification for multi-database mining , 2005, Inf. Syst..

[33]  Glenn Shafer,et al.  A Mathematical Theory of Evidence , 2020, A Mathematical Theory of Evidence.

[34]  Yen-Liang Chen,et al.  A new approach for discovering fuzzy quantitative sequential patterns in sequence databases , 2006, Fuzzy Sets Syst..

[35]  Christoph F. Eick,et al.  Using clustering to learn distance functions for supervised similarity assessment , 2005, Eng. Appl. Artif. Intell..

[36]  Laure Berti-Équille,et al.  Data quality awareness: a case study for cost optimal association rule mining , 2007, Knowledge and Information Systems.

[37]  Hichem Maaref,et al.  New fusion methodology approach and application to mobile robotics: investigation in the framework of possibility theory , 2001, Inf. Fusion.

[38]  Yen-Liang Chen,et al.  A Sampling-Based Method for Mining Frequent Patterns from Databases , 2005, FSKD.

[39]  Jiawei Han,et al.  Discovery of Multiple-Level Association Rules from Large Databases , 1995, VLDB.

[40]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[41]  Didier Dubois,et al.  Possibility theory and statistical reasoning , 2006, Comput. Stat. Data Anal..

[42]  Korris Fu-Lai Chung,et al.  Knowledge and Information Systems , 2017 .

[43]  Ramakrishnan Srikant,et al.  Fast algorithms for mining association rules , 1998, VLDB 1998.

[44]  David Wai-Lok Cheung,et al.  Efficient Mining of Association Rules in Distributed Databases , 1996, IEEE Trans. Knowl. Data Eng..

[45]  Edward Hung,et al.  Mining Frequent Itemsets from Uncertain Data , 2007, PAKDD.

[46]  Tzung-Pei Hong,et al.  Induction of fuzzy rules and membership functions from training examples , 1996, Fuzzy Sets Syst..

[47]  Wilfred Ng,et al.  A survey on algorithms for mining frequent itemsets over data streams , 2008, Knowledge and Information Systems.

[48]  Hsi-Mei Hsu,et al.  Possibilistic programming in production planning of assemble-to-order environments , 2001, Fuzzy Sets Syst..

[49]  Michael J. A. Berry,et al.  Data mining techniques - for marketing, sales, and customer support , 1997, Wiley computer publishing.

[50]  Henri Prade,et al.  Generalizing Database Relational Algebra for the Treatment of Incomplete/Uncertain Information and Vague Queries , 1984, Inf. Sci..

[51]  Aura Conci,et al.  Image mining by content , 2002, Expert Syst. Appl..

[52]  Henri Prade,et al.  Representation and combination of uncertainty with belief functions and possibility measures , 1988, Comput. Intell..

[53]  Mehmet Kaya,et al.  Determination of fuzzy logic membership functions using genetic algorithms , 2001, Fuzzy Sets Syst..

[54]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .