Association Rule and Decision Tree based Methodsfor Fuzzy Rule Base Generation

This paper focuses on the data-driven generation of fuzzy IF...THEN rules. The resulted fuzzy rule base can be applied to build a classifier, a model used for prediction, or it can be applied to form a decision support system. Among the wide range of possible approaches, the decision tree and the association rule based algorithms are overviewed, and two new approaches are presented based on the a priori fuzzy clustering based partitioning of the continuous input variables. An application study is also presented, where the developed methods are tested on the well known Wisconsin Breast Cancer classification problem.

[1]  Peter Arva,et al.  Supervised Clustering and Fuzzy Decision Tree Induction for the Identification of Compact Classifiers , 2004 .

[2]  Roberto J. Bayardo,et al.  Efficiently mining long patterns from databases , 1998, SIGMOD '98.

[3]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[4]  Guoqing Chen,et al.  Fuzzy association rules and the extended mining algorithms , 2002, Inf. Sci..

[5]  Dimitrios Gunopulos,et al.  Constraint-Based Rule Mining in Large, Dense Databases , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[6]  S. Németh,et al.  Fuzzy Association Rule Mining for the Analysis of Historical Process Data , 2006 .

[7]  Cezary Z. Janikow,et al.  Fuzzy decision trees: issues and methods , 1998, IEEE Trans. Syst. Man Cybern. Part B.

[8]  Roberto J. Bayardo,et al.  Mining the most interesting rules , 1999, KDD '99.

[9]  Jim Esch Computational Intelligence Methods For Rule-Based Data Understanding , 2004, Proc. IEEE.

[10]  Wai-Ho Au,et al.  FARM: a data mining system for discovering fuzzy association rules , 1999, FUZZ-IEEE'99. 1999 IEEE International Fuzzy Systems. Conference Proceedings (Cat. No.99CH36315).

[11]  Luc De Raedt,et al.  CorClass: Correlated Association Rule Mining for Classification , 2004, Discovery Science.

[12]  Sunwon Park,et al.  A knowledge bases fuzzy decision tree classifier for time series modeling , 1989 .

[13]  Nitesh V. Chawla,et al.  Classification and knowledge discovery in protein databases , 2004, J. Biomed. Informatics.

[14]  Jianwei Zhang,et al.  Extracting compact fuzzy rules based on adaptive data approximation using B-splines , 2000, Inf. Sci..

[15]  Amit Bhaya,et al.  Evolving fuzzy rules to model gene expression , 2007, Biosyst..

[16]  B. Henderson,et al.  Australia-wide predictions of soil properties using decision trees , 2005 .

[17]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[18]  Yiming Ma,et al.  Improving an Association Rule Based Classifier , 2000, PKDD.

[19]  Donald Gustafson,et al.  Fuzzy clustering with a fuzzy covariance matrix , 1978, 1978 IEEE Conference on Decision and Control including the 17th Symposium on Adaptive Processes.

[20]  Jinyan Li,et al.  CAEP: Classification by Aggregating Emerging Patterns , 1999, Discovery Science.

[21]  Johannes Gehrke,et al.  MAFIA: a maximal frequent itemset algorithm for transactional databases , 2001, Proceedings 17th International Conference on Data Engineering.

[22]  Jean-Yves Potvin,et al.  Generating trading rules on the stock markets with genetic programming , 2004, Comput. Oper. Res..

[23]  Henri Prade,et al.  What are fuzzy rules and how to use them , 1996, Fuzzy Sets Syst..

[24]  Ke Wang,et al.  Growing decision trees on support-less association rules , 2000, KDD '00.

[25]  Rajeev Motwani,et al.  Dynamic itemset counting and implication rules for market basket data , 1997, SIGMOD '97.

[26]  Keith C. C. Chan,et al.  Mining fuzzy association rules , 1997, CIKM '97.

[27]  Jiawei Han,et al.  CPAR: Classification based on Predictive Association Rules , 2003, SDM.

[28]  János Abonyi,et al.  Fuzzy association rule mining for model structure identification , 2006 .

[29]  Abraham Kandel,et al.  Certain computational aspects of fuzzy decision trees , 1988 .

[30]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[31]  Wynne Hsu,et al.  Integrating Classification and Association Rule Mining , 1998, KDD.

[32]  Etienne Kerre,et al.  Fuzzy Data Mining: Discovery of Fuzzy Generalized Association Rules+ , 2000 .

[33]  Richard Weber,et al.  Fuzzy-ID3: A class of methods for automatic knowledge acquisition , 1992 .

[34]  Frans Coenen,et al.  The effect of threshold values on association rule based classification accuracy , 2007, Data Knowl. Eng..

[35]  Dimitrios Gunopulos,et al.  Constraint-Based Rule Mining in Large, Dense Databases , 2004, Data Mining and Knowledge Discovery.

[36]  Sándor Németh,et al.  Fuzzy Association Rule Mining for Data Driven Analysis of Dynamical Systems , 2005 .

[37]  Gerd Stumme,et al.  Mining frequent patterns with counting inference , 2000, SKDD.

[38]  Tzung-Pei Hong,et al.  Fuzzy data mining for interesting generalized association rules , 2003, Fuzzy Sets Syst..

[39]  Isak Gath,et al.  Unsupervised Optimal Fuzzy Clustering , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[40]  Ramakrishnan Srikant,et al.  Mining generalized association rules , 1995, Future Gener. Comput. Syst..

[41]  Éric Lepage,et al.  Measuring performance in health care: case-mix adjustment by boosted decision trees , 2004, Artif. Intell. Medicine.

[42]  Cezary Z. Janikow Fuzzy Partitioning with FID 3 . 1 , 1992 .

[43]  Venansius Baryamureeba,et al.  PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 8 , 2005 .

[44]  Nikhil R. Pal,et al.  A fuzzy rule based approach to cloud cover estimation , 2006 .

[45]  CoenenFrans,et al.  The effect of threshold values on association rule based classification accuracy , 2007 .

[46]  Keith C. C. Chan,et al.  An effective algorithm for discovering fuzzy rules in relational databases , 1998, 1998 IEEE International Conference on Fuzzy Systems Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98CH36228).

[47]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[48]  Sébastien Thomassey,et al.  A hybrid sales forecasting system based on clustering and decision trees , 2006, Decis. Support Syst..

[49]  Yi-Chung Hu,et al.  Elicitation of classification rules by fuzzy data mining , 2003 .

[50]  J. Adamo Fuzzy decision trees , 1980 .

[51]  Heikki Mannila,et al.  Fast Discovery of Association Rules , 1996, Advances in Knowledge Discovery and Data Mining.

[52]  Witold Pedrycz,et al.  The design of decision trees in the framework of granular data and their application to software quality models , 2001, Fuzzy Sets Syst..

[53]  E. S. Karapidakis Machine learning for frequency estimation of power systems , 2007, Appl. Soft Comput..

[54]  János Abonyi,et al.  Modified Gath-Geva clustering for fuzzy segmentation of multivariate time-series , 2005, Fuzzy Sets Syst..

[55]  Dimitris Meretakis,et al.  Extending naïve Bayes classifiers using long itemsets , 1999, KDD '99.

[56]  Yi-Chung Hu,et al.  Mining fuzzy association rules for classification problems , 2002 .

[57]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[58]  Pei-Chann Chang,et al.  A hybrid model by clustering and evolving fuzzy rules for sales decision supports in printed circuit board industry , 2006, Decis. Support Syst..

[59]  Elena Baralis,et al.  Essential classification rule sets , 2004, TODS.

[60]  Georgios Dounias,et al.  Evolving rule-based systems in two medical domains using genetic programming , 2004, Artif. Intell. Medicine.

[61]  Robert H. Fraser,et al.  A method for detecting large-scale forest cover change using coarse spatial resolution imagery , 2005 .