Huge volume of detailed personal data is regularly collected and sharing of these data is proved to be beneficial for data mining application. Such data include shopping habits, criminal records, medical history, credit records etc .On one hand such data is an important asset to business organization and governments for decision making by analyzing it .On the other hand privacy regulations and other privacy concerns may prevent data owners from sharing information for data analysis. In order to share data while preserving privacy data owner must come up with a solution which achieves the dual goal of privacy preservation as well as accurate clustering result. Trying to give solution for this we implemented vector quantization approach piecewise on the datasets which segmentize each row of datasets and quantization approach is performed on each segment using K means which later are again united to form a transformed data set. Some experimental results are presented which tries to finds the optimum value of segment size and quantization parameter which gives optimum in the tradeoff between clustering utility and data privacy in the input dataset.
[1]
Ian Witten,et al.
Data Mining
,
2000
.
[2]
Max Bramer,et al.
Principles of Data Mining
,
2013,
Undergraduate Topics in Computer Science.
[3]
Pavel Berkhin,et al.
A Survey of Clustering Data Mining Techniques
,
2006,
Grouping Multidimensional Data.
[4]
Yunfeng Wang,et al.
Privacy Preserving Data Mining Research: Current Status and Key Issues
,
2007,
International Conference on Computational Science.
[5]
Qiang Wang,et al.
A dimensionality reduction technique for efficient time series similarity analysis
,
2008,
Inf. Syst..
[6]
Ramakrishnan Srikant,et al.
Privacy-preserving data mining
,
2000,
SIGMOD '00.
[7]
Philip S. Yu,et al.
Privacy-Preserving Data Mining - Models and Algorithms
,
2008,
Advances in Database Systems.
[8]
Osmar R. Zaïane,et al.
A privacy-preserving clustering approach toward secure and effective data analysis for business collaboration
,
2007,
Comput. Secur..