New open source modules in KEEL to analyze and export fuzzy association rules

The KEEL software provides several methods to mine fuzzy association rules, however, few modules are available to study them. In this paper we introduce four new KEEL modules to analyze and export fuzzy association rules. Firstly, we present a module to calculate various interest measures for existing sets of fuzzy association rules in order to analyze rules on the basis of their potential interest to the user. Then, two visualization modules are proposed to generate different kinds of graphs from interest measures, making easier the analysis of the rules. Finally, we present a module to export obtained rules to the standard PMML in order to be able to take advantages of other software. The utility of these new modules is illustrated in a case study.

[1]  Tzung-Pei Hong,et al.  MOGA-based fuzzy data mining with taxonomy , 2013, Knowl. Based Syst..

[2]  Francisco Herrera,et al.  Learning the membership function contexts for mining fuzzy association rules by using genetic algorithms , 2009, Fuzzy Sets Syst..

[3]  Tzung-Pei Hong,et al.  A GA-based Fuzzy Mining Approach to Achieve a Trade-off Between Number of Rules and Suitability of Membership Functions , 2006, Soft Comput..

[4]  Harold W. Thimbleby,et al.  Explaining code for publication , 2003, Softw. Pract. Exp..

[5]  Jesús Alcalá-Fdez,et al.  A Fuzzy Association Rule-Based Classification Model for High-Dimensional Problems With Genetic Rule Selection and Lateral Tuning , 2011, IEEE Transactions on Fuzzy Systems.

[6]  Jesús Alcalá-Fdez,et al.  KEEL Data-Mining Software Tool: Data Set Repository, Integration of Algorithms and Experimental Analysis Framework , 2011, J. Multiple Valued Log. Soft Comput..

[7]  José M. Alonso,et al.  A Survey of Fuzzy Systems Software: Taxonomy, Current Research Trends, and Prospects , 2016, IEEE Transactions on Fuzzy Systems.

[8]  Ian H. Witten,et al.  Data mining - practical machine learning tools and techniques, Second Edition , 2005, The Morgan Kaufmann series in data management systems.

[9]  Edward H. Shortliffe,et al.  A model of inexact reasoning in medicine , 1990 .

[10]  Xiaojin Zhu,et al.  Introduction to Semi-Supervised Learning , 2009, Synthesis Lectures on Artificial Intelligence and Machine Learning.

[11]  Sridhar Ramaswamy,et al.  On the Discovery of Interesting Patterns in Association Rules , 1998, VLDB.

[12]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[13]  Howard J. Hamilton,et al.  Interestingness measures for data mining: A survey , 2006, CSUR.

[14]  Das Amrita,et al.  Mining Association Rules between Sets of Items in Large Databases , 2013 .

[15]  Francisco Herrera,et al.  A practical tutorial on the use of nonparametric statistical tests as a methodology for comparing evolutionary and swarm intelligence algorithms , 2011, Swarm Evol. Comput..

[16]  Daniel Sánchez,et al.  Fuzzy association rules: general model and applications , 2003, IEEE Trans. Fuzzy Syst..

[17]  Shichao Zhang,et al.  Association Rule Mining: Models and Algorithms , 2002 .

[18]  Jaideep Srivastava,et al.  Selecting the right interestingness measure for association patterns , 2002, KDD.

[19]  Tzung-Pei Hong,et al.  Trade-off Between Computation Time and Number of Rules for Fuzzy Mining from Quantitative Data , 2001, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[20]  Jesús Alcalá-Fdez,et al.  Analysis of the Effectiveness of the Genetic Algorithms based on Extraction of Association Rules , 2010, Fundam. Informaticae.

[21]  Kwang-Il Ahn,et al.  Efficient Mining of Frequent Itemsets and a Measure of Interest for Association Rule Mining , 2004, J. Inf. Knowl. Manag..

[22]  María José del Jesús,et al.  KEEL: a software tool to assess evolutionary algorithms for data mining problems , 2008, Soft Comput..

[23]  Armando Blanco,et al.  Biomedical application of fuzzy association rules for identifying breast cancer biomarkers , 2012, Medical & Biological Engineering & Computing.

[24]  Gustavo E. A. P. A. Batista,et al.  A study of the behavior of several methods for balancing machine learning training data , 2004, SKDD.

[25]  Rajeev Motwani,et al.  Dynamic itemset counting and implication rules for market basket data , 1997, SIGMOD '97.

[26]  Eyke Hüllermeier,et al.  A systematic approach to the assessment of fuzzy association rules , 2006, Data Mining and Knowledge Discovery.