Big Data Privacy and Anonymization

Data privacy has been studied in the area of statistics (statistical disclosure control) and computer science (privacy preserving data mining and privacy enhancing technologies) for at least 40 years. In this period models, measures, methods, and technologies have been developed to effectively protect the disclosure of sensitive information.

[1]  David Chaum,et al.  Untraceable electronic mail, return addresses, and digital pseudonyms , 1981, CACM.

[2]  Dorothy E. Denning,et al.  A fast procedure for finding a tracker in a statistical database , 1980, TODS.

[3]  Chris Clifton,et al.  Privacy-Preserving Data Mining , 2006, Encyclopedia of Database Systems.

[4]  T. Graepel,et al.  Private traits and attributes are predictable from digital records of human behavior , 2013, Proceedings of the National Academy of Sciences.

[5]  Marianne Winslett,et al.  Introducing secure provenance: problems and challenges , 2007, StorageSS '07.

[6]  Shouhuai Xu,et al.  A roadmap for privacy-enhanced secure data provenance , 2014, Journal of Intelligent Information Systems.

[7]  Marianne Winslett,et al.  The Case of the Fake Picasso: Preventing History Forgery with Secure Provenance , 2009, FAST.

[8]  Josep Domingo-Ferrer,et al.  Privacy by design in big data: An overview of privacy enhancing technologies in the era of big data analytics , 2015, ArXiv.

[9]  Vicenç Torra,et al.  Integral Privacy , 2016, CANS.

[10]  Josep Domingo-Ferrer,et al.  Statistical Disclosure Control , 2012 .

[11]  Jing Zhang,et al.  Do You Know Where Your Data's Been? - Tamper-Evident Database Provenance , 2009, Secure Data Management.

[12]  Vicenç Torra,et al.  Data privacy , 2014, Advanced Research in Data Privacy.

[13]  Marianne Winslett,et al.  Towards a Secure and Efficient System for End-to-End Provenance , 2010, TaPP.

[14]  Margo I. Seltzer,et al.  Securing Provenance , 2008, HotSec.

[15]  Marianne Winslett,et al.  Preventing history forgery with secure provenance , 2009, TOS.

[16]  George T. Duncan,et al.  Why Statistical Confidentiality , 2011 .

[17]  Josep Domingo-Ferrer,et al.  Big Data Privacy: Challenges to Privacy Principles and Models , 2015, Data Science and Engineering.

[18]  Hans Hedbom,et al.  Privacy Impact Assessment Template for Provenance , 2016, 2016 11th International Conference on Availability, Reliability and Security (ARES).

[19]  Josep Domingo-Ferrer,et al.  Statistical Disclosure Control: Hundepool/Statistical Disclosure Control , 2012 .