Rough set based Privacy Preserving Attribute Reduction on Horizontally partitioned data and generation of Rules

Data mining is the process of extracting knowledge from various databases. For security reasons organizations may partition their data horizontally or vertically. Another situation is hospitals maintaining sensitive information about patients. This can also be treated as horizontal partitioning of data if all the hospitals maintain data in the same format. Secure multiparty technique is a cryptographic method which can be used to derive useful results from partitioned data bases without violating the privacy of individuals or organizations. This paper describes a method for finding a rough set concept called reduct set from the partitioned databases using secure multiparty technique. From this reduct set, rules are generated using Naive Bayesian algorithm. This paper also describes attribute reduction technique for horizontally partitioning databases .Generally it is necessary to implement data preprocessing operations on the data bases. This paper also describes an enhanced Transitive Closure algorithm for data cleaning.

[1]  Mingquan Ye,et al.  Privacy Preserving Attribute Reduction for Vertically Partitioned Data , 2010, 2010 International Conference on Artificial Intelligence and Computational Intelligence.

[2]  Chris Clifton,et al.  Privacy-preserving clustering with distributed EM mixture modeling , 2004, Knowledge and Information Systems.

[3]  Yashvardhan Sharma,et al.  HADCLEAN: A hybrid approach to data cleaning in data warehouses , 2012, 2012 International Conference on Information Retrieval & Knowledge Management.

[4]  Mingquan Ye,et al.  Privacy preserving attribute reduction for horizontally partitioned data , 2010, 2010 IEEE International Conference on Intelligent Systems and Knowledge Engineering.