论文信息 - Using General Impressions to Analyze Discovered Classification Rules

Using General Impressions to Analyze Discovered Classification Rules

One of the important problems in data mining is the evaluation of subjective interestingness of the discovered rules. Past research has found that in many real-life applications it is easy to generate a large number of rules from the database, but most of the rules are not useful or interesting to the user. Due to the large number of rules, it is difficult for the user to analyze them manually in order to identify those interesting ones. Whether a rule is of interest to a user depends on his/her existing knowledge of the domain, and his/her interests. In this paper, we propose a technique that analyzes the discovered rules against a specific type of existing knowledge, which we call general impressions, to help the user identify interesting rules. We first propose a representation language to allow general impressions to be specified. We then present some algorithms to analyze the discovered classification rules against a set of general impressions. The results of the analysis tell us which rules conform to the general impressions and which rules are unexpected. Unexpected rules are by definition interesting.

[1] Wynne Hsu,et al. Post-Analysis of Learned Rules , 1996, AAAI/IAAI, Vol. 1.

[2] Heikki Mannila,et al. Finding interesting rules from large sets of discovered association rules , 1994, CIKM '94.

[3] Fred S. Roberts,et al. Applied Combinatorics , 1984 .

[4] Rajjan Shinghal,et al. Evaluating the Interestingness of Characteristic Rules , 1996, KDD.

[5] Douglas H. Fisher,et al. Overcoming process delays with decision tree induction , 1994, IEEE Expert.

[6] Abraham Silberschatz,et al. What Makes Patterns Interesting in Knowledge Discovery Systems , 1996, IEEE Trans. Knowl. Data Eng..

[7] W. Hsu,et al. Discovering Conforming and Unexpected Classification Rules , 1997 .

[8] Padhraic Smyth,et al. From Data Mining to Knowledge Discovery in Databases , 1996, AI Mag..

[9] Stan Matwin,et al. Using Qualitative Models to Guide Inductive Learning , 1993, ICML.

[10] Julio Ortega,et al. Flexibly Exploiting Prior Knowledge in Empirical Learning , 1995, IJCAI.

[11] Gregory Piatetsky-Shapiro,et al. KDD-93: Progress and Challenges in Knowledge Discovery in Databases , 1994, AI Mag..

[12] John H. Boose,et al. A survey of knowledge acquisition techniques and tools , 1993 .

[13] Gregory Piatetsky-Shapiro,et al. The interestingness of deviations , 1994 .

[14] Wynne Hsu,et al. Discovering Interesting Holes in Data , 1997, IJCAI.