This paper presents an autonomous algorithm for discovering exception rules from data sets. An exception rule, which is defined as a deviational pattern to a well-known fact, exhibits unexpectedness and is sometimes extremely useful in spite of its obscurity. Previous discovery approaches for this type of knowledge have neglected the problem of evaluating the reliability of the rules extracted from a data set. It is clear, however, that this question is mandatory in distinguishing knowledge from unreliable patterns without annoying the users. In order to circumvent these difficulties we propose a probabilistic estimation approach in which we obtain an exception rule associated with a common sense rule in the form of a ride pair. Our approach discovers, based on the normal approximations of the multinomial distributions, rule pairs which satisfy, with high confidence, all the specified conditions. The time efficiency of the discovery process is improved by the newly-derived stopping criteria. PEDRE, which is a data mining system based on our approach, has been validated using the benchmark data sets in the machine learning community.
[1]
Heikki Mannila,et al.
Efficient Algorithms for Discovering Association Rules
,
1994,
KDD Workshop.
[2]
Padhraic Smyth,et al.
An Information Theoretic Approach to Rule Induction from Databases
,
1992,
IEEE Trans. Knowl. Data Eng..
[3]
Arno Siebes.
Homogeneous Discoveries Contain No Surprises: Inferring Risk Profiles from Large Databases
,
1994,
KDD Workshop.
[4]
Heikki Mannila,et al.
Fast Discovery of Association Rules
,
1996,
Advances in Knowledge Discovery and Data Mining.
[5]
Gregory Piatetsky-Shapiro,et al.
The interestingness of deviations
,
1994
.
[6]
Masamichi Shimura,et al.
Exceptional Knowledge Discovery in Databases Based on Information Theory
,
1996,
KDD.