Fuzzification and reduction of information-theoretic rule sets

If-then rules are one of the most common forms of knowledge discovered by data mining methods. The number and the length of extracted rules tend to increase with the size of a database, making the rulesets less interpretable and useful. Existing methods of extracting fuzzy rules from numerical data improve the interpretability aspect, but the dimensionality of fuzzy rulesets remains high. In this paper, we present a new methodology for reducing the dimensionality of rulesets discovered in data. Our method builds upon the information-theoretic fuzzy approach to knowledge discovery. We start with constructing an information-theoretic network from a data table and extracting a set of association rules based on the network connections. The set of information-theoretic rules is fuzzified and significantly reduced by using the principles of the Computational Theory of Perception (CTP). We demonstrate the method on a real-world database from semiconductor industry.

[1]  Abraham Kandel,et al.  Applying Fuzzy Hypothesis Testing to Medical Data , 1999, RSFDGrC.

[2]  A. Krone,et al.  A hybrid evolutionary search concept for data-based generation of relevant fuzzy rules in high dimensional spaces , 1999, FUZZ-IEEE'99. 1999 IEEE International Fuzzy Systems. Conference Proceedings (Cat. No.99CH36315).

[3]  Oded Maimon,et al.  Information-Theoretic Fuzzy Approach to Knowledge Discovery in Databases , 1999 .

[4]  Li-Xin Wang,et al.  A Course In Fuzzy Systems and Control , 1996 .

[5]  Dale Borowiak,et al.  Linear Models, Least Squares and Alternatives , 2001, Technometrics.

[6]  Sujeet Shenoi,et al.  Multilevel database security using information clouding , 1993, [Proceedings 1993] Second IEEE International Conference on Fuzzy Systems.

[7]  Wai-Ho Au,et al.  FARM: a data mining system for discovering fuzzy association rules , 1999, FUZZ-IEEE'99. 1999 IEEE International Fuzzy Systems. Conference Proceedings (Cat. No.99CH36315).

[8]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[9]  A. Kandel,et al.  Automated perceptions in data mining , 1999, FUZZ-IEEE'99. 1999 IEEE International Fuzzy Systems. Conference Proceedings (Cat. No.99CH36315).

[10]  Usama M. Fayyad,et al.  Multi-Interval Discretization of Continuous-Valued Attributes for Classification Learning , 1993, IJCAI.

[11]  Lotfi A. Zadeh A New Direction in System Analysis: From Computation with Measurements to Computation with Perceptions (Abstract) , 1999, RSFDGrC.

[12]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[13]  Jerry M. Mendel,et al.  Generating fuzzy rules by learning from examples , 1992, IEEE Trans. Syst. Man Cybern..

[14]  M. Shaw,et al.  Induction of fuzzy decision trees , 1995 .

[15]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[16]  Rajkumar Roy,et al.  Advances in Soft Computing: Engineering Design and Manufacturing , 1998 .

[17]  Padhraic Smyth,et al.  From Data Mining to Knowledge Discovery: An Overview , 1996, Advances in Knowledge Discovery and Data Mining.

[18]  Ramakrishnan Srikant,et al.  The Quest Data Mining System , 1996, KDD.

[19]  Ramakrishnan Srikant,et al.  Mining quantitative association rules in large relational tables , 1996, SIGMOD '96.

[20]  Abraham Silberschatz,et al.  Database System Concepts , 1980 .