A model for privacy preserving in data mining using Soft Computing techniques

Data mining is branch of computer science that delivers valuable information hidden in large volumes of data. The success of data mining depends on the quality of data and the algorithms used to extract information. A large number of tools and techniques have been developed for the purpose. Soft Computing methods have also emerged as a powerful tool for data mining as soft computing is tolerant to uncertainty, partial truth and imprecision. It helps in achieving solutions that are low cost, robust and tractable. Neural Networks are being extensively used for analysis purposes in every field of life from business to health sectors. In the current scenario where privacy of an individual is an important issue, people are reluctant to share their confidential information. Thereby privacy preserving in data mining (PPDM) has emerged as an indistinguishable component of data mining. The aim of this paper is to propose a model that preserves the privacy of individuals without affecting the final results of the Neural Networks.

[1]  Lotfi A. Zadeh,et al.  Fuzzy logic, neural networks, and soft computing , 1993, CACM.

[2]  Latanya Sweeney,et al.  Achieving k-Anonymity Privacy Protection Using Generalization and Suppression , 2002, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[3]  Sankar K. Pal,et al.  Data mining in soft computing framework: a survey , 2002, IEEE Trans. Neural Networks.

[4]  Cali M. Davis,et al.  Data Mining Applications in Higher Education , 2007 .

[5]  M. B. Malik,et al.  Privacy Preserving Data Mining Techniques: Current Scenario and Future Prospects , 2012, 2012 Third International Conference on Computer and Communication Technology.

[6]  Philip S. Yu,et al.  A Condensation Approach to Privacy Preserving Data Mining , 2004, EDBT.

[7]  David J. Hand,et al.  Statistics and data mining: intersecting disciplines , 1999, SKDD.

[8]  John Castellani,et al.  Enhancing Learning Environments through Solution-Based Knowledge Discovery Tools: Forecasting for Self-perpetuating Systemic Reform , 2001 .

[9]  Lipika Dey,et al.  A k-mean clustering algorithm for mixed numeric and categorical data , 2007, Data Knowl. Eng..

[10]  Zhihua Xiao Statistics and Data Mining , 2000 .

[11]  Gautam Biswas,et al.  Unsupervised Learning with Mixed Numeric and Nominal Data , 2002, IEEE Trans. Knowl. Data Eng..

[12]  M. Sordo Introduction to Neural Networks in Healthcare , 2002 .

[13]  Philip S. Yu,et al.  Privacy-preserving data publishing: A survey of recent developments , 2010, CSUR.

[14]  Yehuda Lindell,et al.  Privacy Preserving Data Mining , 2000, Journal of Cryptology.

[15]  Vinod Sharma,et al.  Comparative analysis of machine learning techniques in prognosis of type II diabetes , 2014, AI & SOCIETY.

[16]  Philip S. Yu,et al.  A General Survey of Privacy-Preserving Data Mining Models and Algorithms , 2008, Privacy-Preserving Data Mining.

[17]  Syed,et al.  A FUZZY BASED APPROACH FOR PRIVACY PRESERVING CLUSTERING , 2014 .

[18]  Chris Clifton,et al.  Tools for privacy preserving distributed data mining , 2002, SKDD.