Research on Clustering-Differential Privacy for Express Data Release

With the rapid development of “Internet +”, the express delivery industry has exposed more privacy leakage problems. One way is the circulation of the express orders, and the other way is the express data release. For the second problem, this paper proposes a clustering-differential privacy preserving method combining with the theory of anonymization. Firstly, we use DBSCAN density clustering algorithm to initialize the original data set to achieve the first clustering. Secondly, in order to reduce the data generalization we combine the micro-aggregation technology to achieve the second clustering of the data set. Finally, adding Laplace noise to the clustering data record and correct the data that does not satisfy the differential privacy model to ensure the data availability. Simulation experiments show that the clustering-differential privacy preserving method can apply on the express data release, and it can keep higher data availability relative to the traditional differential privacy preserving.