Genetic algorithm with a structure-based representation for genetic-fuzzy data mining

Mining association rules is an important data mining technology aiming to find the relationship among items in the databases. Genetic-fuzzy data mining uses evolutionary algorithm, such as genetic algorithm (GA), to optimize the membership functions for mining fuzzy association rules, and has received considerable success. The increase in data, especially in big data analytics, poses serious challenges to GA in the effectiveness and efficiency of finding appropriate membership functions. This study proposes a GA for enhancing genetic-fuzzy mining of association rules. First, we design a novel chromosome representation considering the structures of membership functions. The representation facilitates arrangement of membership functions. Second, this study presents two heuristics in the light of overlap and coverage for removing inappropriate arrangement. A series of experiments is conducted to examine the proposed GA on different amounts of transactions. The experimental results show that GA benefits from the proposed representation. The two heuristics help to explore the structures of membership functions and achieve significant improvement on GA in terms of solution quality and convergence speed. The satisfactory outcomes validate the high capability of the proposed GA in genetic-fuzzy mining of association rules.

[1]  Man Hon Wong,et al.  Mining fuzzy association rules in databases , 1998, SGMD.

[2]  King Lun Choy,et al.  A RFID-based recursive process mining system for quality assurance in the garment industry , 2014 .

[3]  Debahuti Mishra,et al.  Particle swarm optimization based fuzzy frequent pattern mining from gene expression data , 2011, 2011 2nd International Conference on Computer and Communication Technology (ICCCT-2011).

[4]  Behrouz Minaei-Bidgoli,et al.  Multi objective association rule mining with genetic algorithm without specifying minimum support and minimum confidence , 2011, Expert Syst. Appl..

[5]  Ramakrishnan Srikant,et al.  Fast algorithms for mining association rules , 1998, VLDB 1998.

[6]  Tzung-Pei Hong,et al.  Mining association rules from quantitative data , 1999, Intell. Data Anal..

[7]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[8]  George T. S. Ho,et al.  A slippery genetic algorithm-based process mining system for achieving better quality assurance in the garment industry , 2016, Expert Syst. Appl..

[9]  Venansius Baryamureeba,et al.  Extraction of interesting association rules using genetic algorithms , 2007 .

[10]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[11]  Tzung-Pei Hong,et al.  Induction of fuzzy rules and membership functions from training examples , 1996, Fuzzy Sets Syst..

[12]  Francisco Herrera,et al.  A Review of the Application of Multiobjective Evolutionary Fuzzy Systems: Current Status and Further Directions , 2013, IEEE Transactions on Fuzzy Systems.

[13]  Claire Cardie,et al.  Constrained K-means Clustering with Background Knowledge , 2001, ICML.

[14]  Yeong-Chyi Lee,et al.  fuzzy coherent rule mining algorithm , 2013 .

[15]  Francisco Herrera,et al.  Fuzzy connectives based crossover operators to model genetic algorithms population diversity , 1997, Fuzzy Sets Syst..

[16]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[17]  Padhraic Smyth,et al.  From Data Mining to Knowledge Discovery in Databases , 1996, AI Mag..

[18]  Wynne Hsu,et al.  Analyzing the Subjective Interestingness of Association Rules , 2000, IEEE Intell. Syst..

[19]  Shaozi Li,et al.  Mining Fuzzy Association Rules by Using Nonlinear Particle Swarm Optimization , 2010 .

[20]  Padhraic Smyth,et al.  From Data Mining to Knowledge Discovery: An Overview , 1996, Advances in Knowledge Discovery and Data Mining.

[21]  Keith C. C. Chan,et al.  Mining fuzzy association rules , 1997, CIKM '97.

[22]  Dan Meng,et al.  Extracting linguistic rules from data sets using fuzzy logic and genetic algorithms , 2012, Neurocomputing.

[23]  Geoffrey I. Webb,et al.  Advances in Knowledge Discovery and Data Mining , 2018, Lecture Notes in Computer Science.

[24]  Behrouz Minaei-Bidgoli,et al.  Deriving support threshold values and membership functions using the multiple-level cluster-based master–slave IFG approach , 2013, Soft Comput..

[25]  Jesús Alcalá-Fdez,et al.  A Fuzzy Association Rule-Based Classification Model for High-Dimensional Problems With Genetic Rule Selection and Lateral Tuning , 2011, IEEE Transactions on Fuzzy Systems.

[26]  Tzung-Pei Hong,et al.  A GA-based Fuzzy Mining Approach to Achieve a Trade-off Between Number of Rules and Suitability of Membership Functions , 2006, Soft Comput..

[27]  V. Srividhya,et al.  Genetic Fuzzy Data Mining With Divide-And- Conquer Strategy , 2011 .

[28]  Filip Rudzinski,et al.  A multi-objective genetic optimization of interpretability-oriented fuzzy rule-based classifiers , 2016, Appl. Soft Comput..

[29]  Michela Antonelli,et al.  A fast and efficient multi-objective evolutionary learning scheme for fuzzy rule-based classifiers , 2014, Inf. Sci..

[30]  Evangelos Simoudis,et al.  An Overview of Issues in Developing Industrial Data Mining and Knowledge Discovery Applications , 1996, KDD.

[31]  Ramakrishnan Srikant,et al.  Mining quantitative association rules in large relational tables , 1996, SIGMOD '96.

[32]  Wang Linshuang Cluster-based evaluation in fuzzy-genetic data mining , 2010 .

[33]  Behrouz Minaei-Bidgoli,et al.  Mining numerical association rules via multi-objective genetic algorithms , 2013, Inf. Sci..