Mining for association rules is one of the fundamental tasks of data mining. Association rule mining searches for interesting relationships amongst items for a given dataset based mainly on the support and confidence measures. Support is used for filtering out infrequent rules, while confidence measures the implication relationships from a set of items to one another. However, one of the main drawbacks of the confidence measure is that it presents the absolute value of implication that does not reflect truthfully the relationships amongst items. For example, if two items have a very high frequency, then they will probably form a rule with a high confidence even if there is no relationship between them at all. In this paper, we propose a new measure known as relative confidence for mining association rules, which is able to reflect truthfully the relationships of items. The effectiveness of the relative confidence measure is evaluated in comparison with the confidence measure in mining interesting relationships between terms from textual documents and in associative classification.
[1]
Wynne Hsu,et al.
Integrating Classification and Association Rule Mining
,
1998,
KDD.
[2]
Rajeev Motwani,et al.
Beyond market baskets: generalizing association rules to correlations
,
1997,
SIGMOD '97.
[3]
Osmar R. Zaïane,et al.
Text document categorization by term association
,
2002,
2002 IEEE International Conference on Data Mining, 2002. Proceedings..
[4]
Nagwa M. El-Makky,et al.
A note on "beyond market baskets: generalizing association rules to correlations"
,
2000,
SKDD.
[5]
Bing Liu,et al.
Classification Using Association Rules: Weaknesses and Enhancements
,
2001
.
[6]
Ramakrishnan Srikant,et al.
Fast algorithms for mining association rules
,
1998,
VLDB 1998.
[7]
Ramakrishnan Srikant,et al.
Fast Algorithms for Mining Association Rules in Large Databases
,
1994,
VLDB.
[8]
Rajeev Motwani,et al.
Dynamic itemset counting and implication rules for market basket data
,
1997,
SIGMOD '97.