With rapid advance of the network and data mining techniques, the protection of the confidentiality of sensitive information in a database becomes a critical issue when releasing data to outside parties. Association analysis is a powerful and popular tool for discovering relationships hidden in large data sets. The relationships can be represented in a form of frequent itemsets or association rules. One rule is categorized as sensitive if its disclosure risk is above some given threshold. Privacy-preserving data mining is an important issue which can be applied to various domains, such as Web commerce, crime reconnoitering, health care, and customer's consumption analysis. The main approach to hide sensitive association rules is to reduce the support or the confidence of the rules. This is done by modifying transactions or items in the database. However, the modifications will generate side effects, i.e., nonsensitive rule falsely hidden (i.e., lost rules) and spurious rules falsely generated (i.e., new rules). There is a trade-off between sensitive rules hidden and side effects generated. In this study, we propose an efficient algorithm, FHSAR, for fast hiding sensitive association rules(SAR). The algorithm can completely hide any given SAR by scanning database only once, which significantly reduces the execution time. Experimental results show that FHSAR outperforms previous works in terms of execution time required and side effects generated in most cases.
[1]
Elisa Bertino,et al.
Association rule hiding
,
2004,
IEEE Transactions on Knowledge and Data Engineering.
[2]
Vassilios S. Verykios,et al.
Disclosure limitation of sensitive rules
,
1999,
Proceedings 1999 Workshop on Knowledge and Data Engineering Exchange (KDEX'99) (Cat. No.PR00453).
[3]
Shyue-Liang Wang,et al.
Hiding informative association rule sets
,
2007,
Expert Syst. Appl..
[4]
Hong Shen,et al.
Mining the smallest association rule set for predictions
,
2001,
Proceedings 2001 IEEE International Conference on Data Mining.
[5]
Ali Amiri,et al.
Dare to share: Protecting sensitive knowledge with data sanitization
,
2007,
Decis. Support Syst..
[6]
Arbee L. P. Chen,et al.
Hiding Sensitive Association Rules with Limited Side Effects
,
2007,
IEEE Transactions on Knowledge and Data Engineering.
[7]
Tzung-Pei Hong,et al.
Maintenance of discovered informative rule sets: incremental deletion
,
2003,
2005 IEEE International Conference on Systems, Man and Cybernetics.
[8]
Shyue-Liang Wang,et al.
Hiding sensitive predictive association rules
,
2005,
2005 IEEE International Conference on Systems, Man and Cybernetics.