Efficient mining product-based fuzzy association rules through central limit theorem

Abstract In this study, we propose a fast algorithm to form product-based fuzzy association rules from large quantitative dataset, which reduces data size and ensures the quality of the obtained results. A method is designed to transform mining of fuzzy association rules to the binary counterpart. It is shown that the final results are not affected by this transformation. Then, an efficient sampling method is developed, where a sample is taken to replace the original large dataset, so the size of the dataset is reduced and the cost of scanning is also decreased. Through the central limit theorem, the size of sample can be set reasonably, so the deviation of support of any fuzzy itemset caused by sampling is limited in a small range with a high probability. Through a series of experiments, we show the advantages of the approach both the speed of the proposed algorithm and its reliability.

[1]  S. G. Deshmukh,et al.  A new approach for evaluating agility in supply chains using Fuzzy Association Rules Mining , 2008, Eng. Appl. Artif. Intell..

[2]  Francisco Herrera,et al.  NICGAR: A Niching Genetic Algorithm to mine a diverse set of interesting quantitative association rules , 2016, Inf. Sci..

[3]  Mario A. Góngora,et al.  Web usage mining with evolutionary extraction of temporal fuzzy association rules , 2013, Knowl. Based Syst..

[4]  Abdolreza Mirzaei,et al.  Intrusion detection using fuzzy association rules , 2009, Appl. Soft Comput..

[5]  Thomas Sudkamp,et al.  Examples, counterexamples, and measuring fuzzy associations , 2005, Fuzzy Sets Syst..

[6]  Vikram Pudi,et al.  Fuzzy association rule mining algorithm for fast and efficient performance on very large datasets , 2009, 2009 IEEE International Conference on Fuzzy Systems.

[7]  Eli Upfal,et al.  Efficient Discovery of Association Rules and Frequent Itemsets through Sampling with Tight Performance Guarantees , 2011, TKDD.

[8]  Tzung-Pei Hong,et al.  A fuzzy AprioriTid mining algorithm with reduced computational time , 2001, 10th IEEE International Conference on Fuzzy Systems. (Cat. No.01CH37297).

[9]  Jianer Chen,et al.  Application-oriented purely semantic precision and recall for ontology mapping evaluation , 2008, Knowl. Based Syst..

[10]  Ming-Syan Chen,et al.  Progressive Sampling for Association Rules Based on Sampling Error Estimation , 2005, PAKDD.

[11]  Ruqian Lu,et al.  Sampling Ensembles for Frequent Patterns , 2005, FSKD.

[12]  János Abonyi,et al.  Compact fuzzy association rule-based classifier , 2008, Expert Syst. Appl..

[13]  Tzung-Pei Hong,et al.  Mining association rules from quantitative data , 1999, Intell. Data Anal..

[14]  C. Alsina On a family of connectives for fuzzy sets , 1985 .

[15]  Nikolai F. Rulkov,et al.  Online Decorrelation of Humidity and Temperature in Chemical Sensors for Continuous Monitoring , 2016, ArXiv.

[16]  Witold Pedrycz,et al.  Mining axiomatic fuzzy set association rules for classification problems , 2012, Eur. J. Oper. Res..

[17]  Tzung-Pei Hong,et al.  A fast Algorithm for mining fuzzy frequent itemsets , 2015, J. Intell. Fuzzy Syst..

[18]  Man Hon Wong,et al.  Mining fuzzy association rules in databases , 1998, SGMD.

[19]  Francisco Herrera,et al.  QAR-CIP-NSGA-II: A new multi-objective evolutionary algorithm to mine quantitative association rules , 2014, Inf. Sci..

[20]  H. Ishibuchi,et al.  Fuzzy association rules for handling continuous attributes , 2001, ISIE 2001. 2001 IEEE International Symposium on Industrial Electronics Proceedings (Cat. No.01TH8570).

[21]  Yi-Chung Hu,et al.  Discovering fuzzy association rules using fuzzy partition methods , 2003, Knowl. Based Syst..

[22]  Tzung-Pei Hong,et al.  A fuzzy AprioriTid mining algorithm with reduced computational time , 2004, Appl. Soft Comput..

[23]  Yanchun Zhang,et al.  Optimized fuzzy association rule mining for quantitative data , 2014, 2014 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE).

[24]  Tzung-Pei Hong,et al.  An UBMFFP Tree for Mining Multiple Fuzzy Frequent Itemsets , 2015, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[25]  Jie Dong,et al.  BitTableFI: An efficient mining frequent itemsets algorithm , 2007, Knowl. Based Syst..

[26]  Daniel Sánchez,et al.  Fuzzy frameworks for mining data associations: fuzzy association rules and beyond , 2016, WIREs Data Mining Knowl. Discov..

[27]  Hannu Toivonen,et al.  Sampling Large Databases for Association Rules , 1996, VLDB.

[28]  Witold Pedrycz,et al.  Efficient frequent itemsets mining through sampling and information granulation , 2017, Eng. Appl. Artif. Intell..

[29]  Tzung-Pei Hong,et al.  An Effective Gradual Data-Reduction Strategy for Fuzzy Itemset Mining , 2013 .

[30]  Eyke Hüllermeier,et al.  In Defense of Fuzzy Association Analysis , 2007, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[31]  Raj P. Gopalan,et al.  Effective Sampling for Mining Association Rules , 2004, Australian Conference on Artificial Intelligence.

[32]  Yen-Liang Chen,et al.  Mining fuzzy association rules from questionnaire data , 2009, Knowl. Based Syst..

[33]  Vikram Pudi,et al.  FAR-miner: a fast and efficient algorithm for fuzzy association rule mining , 2012, Int. J. Bus. Intell. Data Min..

[34]  O. Kosheleva,et al.  Why Trapezoidal and Triangular Membership Functions Work So Well: Towards a Theoretical Explanation , 2014 .

[35]  Eyke Hüllermeier,et al.  A systematic approach to the assessment of fuzzy association rules , 2006, Data Mining and Knowledge Discovery.

[36]  W. Pedrycz Why triangular membership functions , 1994 .

[37]  Ramakrishnan Srikant,et al.  Mining quantitative association rules in large relational tables , 1996, SIGMOD '96.

[38]  Tzung-Pei Hong,et al.  An Efficient Tree-based Fuzzy Data Mining Approach , 2010 .

[39]  Han Liu,et al.  Rule-based systems: a granular computing perspective , 2016, Granular Computing.

[40]  Tzung-Pei Hong,et al.  Linguistic data mining with fuzzy FP-trees , 2010, Expert Syst. Appl..

[41]  Tzung-Pei Hong,et al.  A CMFFP-tree algorithm to mine complete multiple fuzzy frequent itemsets , 2015, Appl. Soft Comput..

[42]  Tzung-Pei Hong,et al.  A Survey of Fuzzy Data Mining Techniques , 2016 .

[43]  Tzung-Pei Hong,et al.  THE MFFP‐TREE FUZZY MINING ALGORITHM TO DISCOVER COMPLETE LINGUISTIC FREQUENT ITEMSETS , 2014, Comput. Intell..

[44]  Daniel Sánchez,et al.  Fuzzy association rules: general model and applications , 2003, IEEE Trans. Fuzzy Syst..

[45]  Cai-Yan Jia,et al.  Multi-Scaling Sampling: An Adaptive Sampling Method for Discovering Approximate Association Rules , 2005, Journal of Computer Science and Technology.

[46]  Hong Chen,et al.  FARP: Mining fuzzy association rules from a probabilistic quantitative database , 2013, Inf. Sci..

[47]  Eli Upfal,et al.  Mining Frequent Itemsets through Progressive Sampling with Rademacher Averages , 2015, KDD.

[48]  Tzung-Pei Hong,et al.  Mining fuzzy frequent itemsets based on UBFFP trees , 2014, J. Intell. Fuzzy Syst..

[49]  Susan M. Bridges,et al.  Mining fuzzy association rules and fuzzy frequency episodes for intrusion detection , 2000 .

[50]  Martine De Cock,et al.  Fuzzy versus quantitative association rules: a fair data-driven comparison , 2005, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[51]  Yi-Chung Hu,et al.  Determining membership functions and minimum fuzzy support in finding fuzzy association rules for classification problems , 2006, Knowledge-Based Systems.

[52]  Sadique Sheik,et al.  Reservoir computing compensates slow response of chemosensor arrays exposed to fast varying gas concentrations in continuous monitoring , 2015 .

[53]  Ming-Syan Chen,et al.  Power-law relationship and self-similarity in the itemset support distribution: analysis and applications , 2008, The VLDB Journal.

[54]  Pierre Baldi,et al.  Parameterized neural networks for high-energy physics , 2016, The European Physical Journal C.

[55]  Srinivasan Parthasarathy,et al.  Efficient progressive sampling for association rules , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[56]  P. Baldi,et al.  Searching for exotic particles in high-energy physics with deep learning , 2014, Nature Communications.

[57]  Yogish Sabharwal,et al.  Analysis of sampling techniques for association rule mining , 2009, ICDT '09.