Differentially Private Real-time Streaming Data Publication Based on Sliding Window under Exponential Decay

Continuous response of range query on steaming data provides useful information for many practical applications as well as the risk of privacy disclosure. The existing research on differential privacy streaming data publication mostly pay close attention to boosting query accuracy, but pay less attention to query efficiency, and ignore the effect of timeliness on data weight. In this paper, we propose an effective algorithm of differential privacy streaming data publication under exponential decay mode. Firstly, by introducing the Fenwick tree to divide and reorganize data items in the stream, we achieve a constant time complexity for inserting a new item and getting the prefix sum. Meanwhile, we achieve time complicity linear to the number of data item for building a tree. After that, we use the advantage of matrix mechanism to deal with relevant queries and reduce the global sensitivity. In addition, we choose proper diagonal matrix further improve the range query accuracy. Finally, considering about exponential decay, every data item is weighted by the decay factor. By putting the Fenwick tree and matrix optimization together, we present complete algorithm for differentiate private real-time streaming data publication. The experiment is designed to compare the algorithm in this paper with similar algorithms for streaming data release in exponential decay. Experimental results show that the algorithm in this paper effectively improve the query efficiency while ensuring the quality of the query.

[1]  Elisa Bertino,et al.  Efficient and accurate strategies for differentially-private sliding window queries , 2013, EDBT '13.

[2]  Cynthia Dwork,et al.  Differential Privacy , 2006, ICALP.

[3]  P Xiong,et al.  A Survey on Differential Privacy and Applications , 2014 .

[4]  Elaine Shi,et al.  Private and Continual Release of Statistics , 2010, TSEC.

[5]  Moni Naor,et al.  Differential privacy under continual observation , 2010, STOC '10.

[6]  Philip S. Yu,et al.  Privacy-preserving data publishing: A survey of recent developments , 2010, CSUR.

[7]  Stavros Papadopoulos,et al.  Differentially Private Event Sequences over Infinite Streams , 2014, Proc. VLDB Endow..

[8]  Dan Suciu,et al.  Boosting the accuracy of differentially private histograms through consistency , 2009, Proc. VLDB Endow..

[9]  Zhou Shui Privacy Preservation in Database Applications:A Survey , 2009 .

[10]  Grace Hui Yang,et al.  Privacy-Preserving IR 2015: When Information Retrieval Meets Privacy and Security , 2015, SIGIR.

[11]  Yingjie Wu,et al.  An algorithm for differential privacy streaming data publication based on matrix mechanism under exponential decay mode , 2017 .

[12]  Aleksandar Nikolov,et al.  Private decayed predicate sums on streams , 2013, ICDT '13.

[13]  Yin Yang,et al.  Differential privacy in data publication and analysis , 2012, SIGMOD Conference.

[14]  Andrew McGregor,et al.  Optimizing linear counting queries under differential privacy , 2009, PODS.

[15]  Peter M. Fenwick,et al.  A new data structure for cumulative frequency tables , 1994, Softw. Pract. Exp..

[16]  Yin Yang,et al.  Low-Rank Mechanism: Optimizing Batch Queries under Differential Privacy , 2012, Proc. VLDB Endow..