With the rapid development of “Internet +”, the express delivery industry has exposed more privacy leakage problems. One way is the circulation of the express orders, and the other way is the express data release. For the second problem, this paper proposes a clustering-differential privacy preserving method combining with the theory of anonymization. Firstly, we use DBSCAN density clustering algorithm to initialize the original data set to achieve the first clustering. Secondly, in order to reduce the data generalization we combine the micro-aggregation technology to achieve the second clustering of the data set. Finally, adding Laplace noise to the clustering data record and correct the data that does not satisfy the differential privacy model to ensure the data availability. Simulation experiments show that the clustering-differential privacy preserving method can apply on the express data release, and it can keep higher data availability relative to the traditional differential privacy preserving.
[1]
Frank McSherry,et al.
Privacy integrated queries: an extensible platform for privacy-preserving data analysis
,
2009,
SIGMOD Conference.
[2]
Wei Qia.
Express information protection application based on K-anonymity
,
2014
.
[3]
Adam D. Smith,et al.
Discovering frequent patterns in sensitive data
,
2010,
KDD.
[4]
Zhang Yong,et al.
A Privacy-Preserving Data Publishing Algorithm for Clustering Application
,
2010
.
[5]
Zhou Chun-qia.
Research on Privacy Protection in Express Information Management System
,
2015
.
[6]
John Riedl,et al.
Item-based collaborative filtering recommendation algorithms
,
2001,
WWW '01.
[7]
Latanya Sweeney,et al.
k-Anonymity: A Model for Protecting Privacy
,
2002,
Int. J. Uncertain. Fuzziness Knowl. Based Syst..