FARM: a data mining system for discovering fuzzy association rules

In this paper, we introduce a novel technique, called FARM, for mining fuzzy association rules. FARM employs linguistic terms to represent the revealed regularities and exceptions. The linguistic representation is especially useful when those rules discovered are presented to human experts for examination because of the affinity with the human knowledge representations. The definition of linguistic terms is based on fuzzy set theory and hence we call the rules having these terms fuzzy association rules. The use of fuzzy technique makes FARM resilient to noises such as inaccuracies in physical measurements of real-life entities and missing values in the databases. Furthermore, FARM utilizes adjusted difference analysis which has the advantage that it does not require any user-supplied thresholds which are often hard to determine. In addition to this interestingness measure, FARM has another unique feature that the conclusions of a fuzzy association rule can contain linguistic terms. Our technique also provides a mechanism to allow quantitative values be inferred from fuzzy association rules. Unlike other data mining techniques that can only discover association rules between different discretized values, FARM is able to reveal interesting relationships between different quantitative values. Our experimental results showed that FARM is capable of discovering meaningful and useful fuzzy association rules in an effective manner from a real-life database.

[1]  Doheon Lee,et al.  Database summarization using fuzzy ISA hierarchies , 1997, IEEE Trans. Syst. Man Cybern. Part B.

[2]  Keith C. C. Chan,et al.  An effective algorithm for discovering fuzzy rules in relational databases , 1998, 1998 IEEE International Conference on Fuzzy Systems Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98CH36228).

[3]  Keith C. C. Chan,et al.  Mining fuzzy association rules , 1997, CIKM '97.

[4]  William Frawley,et al.  Knowledge Discovery in Databases , 1991 .

[5]  Andrew K. C. Wong,et al.  Statistical Technique for Extracting Classificatory Knowledge from Databases , 1991, Knowledge Discovery in Databases.

[6]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[7]  Ronald R. Yager,et al.  On Linguistic Summaries of Data , 1991, Knowledge Discovery in Databases.

[8]  D. E. Goldberg,et al.  Genetic Algorithms in Search , 1989 .

[9]  Keith C. C. Chan,et al.  An effective algorithm for mining interesting quantitative association rules , 1997, SAC '97.

[10]  Jörg Rech,et al.  Knowledge Discovery in Databases , 2001, Künstliche Intell..

[11]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[12]  W. Pedrycz,et al.  Linguistic data mining and fuzzy modelling , 1996, Proceedings of IEEE 5th International Fuzzy Systems.

[13]  Ramakrishnan Srikant,et al.  Mining quantitative association rules in large relational tables , 1996, SIGMOD '96.