SPoFC: A framework for stream data aggregation with local differential privacy

Collecting and analysing customers' data plays an essential role in the more intense market competition. It is critical to perform data analysis effectively while ensuring the user's privacy, especially after various privacy regulations are enacted. In this paper, we consider the problem of aggregating the stream data generated from wearable devices in a specific time period in a privacy‐preserving manner. Specifically, we adopt the local differential privacy mechanism to provide a strong privacy guarantee for users. One major challenge is that all values of the stream need to be perturbed. The additive noise makes it hard to release an accurate data stream. One way to reduce the noise scale is to select some data points to perturb instead of all. The intuition is that more privacy budgets are applied to a single data point, which ensures the statistical accuracy. The perturbed data points are used to predict the un‐selected data points without consuming an extra privacy budget. Based on this idea, we propose a novel stream data statistical framework, which includes four components, data fitting, skeleton point selection, noisy stream generation, and data aggregation. Extensive experiment results show that our proposed method achieves a much smaller mean square error given a fixed privacy budget compared with the state‐of‐the‐art.

[1]  J. Du,et al.  Geo-Ellipse-Indistinguishability: Community-Aware Location Privacy Protection for Directional Distribution , 2023, IEEE Transactions on Knowledge and Data Engineering.

[2]  Jinjun Chen,et al.  A Numerical Splitting and Adaptive Privacy Budget-Allocation-Based LDP Mechanism for Privacy Preservation in Blockchain-Powered IoT , 2023, IEEE Internet of Things Journal.

[3]  D. Kotzinos,et al.  Landmark Privacy: Configurable Differential Privacy Protection for Time Series , 2022, CODASPY.

[4]  Jianzhang Chen,et al.  B-DP: Dynamic Collection and Publishing of Continuous Check-In Data with Best-Effort Differential Privacy , 2022, Entropy.

[5]  M. Henzinger,et al.  Constant matters: Fine-grained Complexity of Differentially Private Continual Observation Using Completely Bounded Norms , 2022, IACR Cryptol. ePrint Arch..

[6]  Ying Zhao,et al.  A Survey on Differential Privacy for Unstructured Data Content , 2022, ACM Comput. Surv..

[7]  M. H. Rehmani,et al.  Anomaly Detection in Blockchain Networks: A Comprehensive Survey , 2021, IEEE Communications Surveys & Tutorials.

[8]  Annabelle McIver,et al.  The Laplace Mechanism has optimal utility for differential privacy over continuous queries , 2021, 2021 36th Annual ACM/IEEE Symposium on Logic in Computer Science (LICS).

[9]  Ninghui Li,et al.  Continuous Release of Data Streams under both Centralized and Local Differential Privacy , 2020, CCS.

[10]  Úlfar Erlingsson,et al.  Amplification by Shuffling: From Local to Central Differential Privacy via Anonymity , 2018, SODA.

[11]  Jong Wook Kim,et al.  Privacy-preserving aggregation of personal health data streams , 2018, PloS one.

[12]  Aaron Roth,et al.  Local Differential Privacy for Evolving Data , 2018, NeurIPS.

[13]  Janardhan Kulkarni,et al.  Collecting Telemetry Data Privately , 2017, NIPS.

[14]  Masatoshi Yoshikawa,et al.  Quantifying Differential Privacy in Continuous Data Release Under Temporal Correlations , 2017, IEEE Transactions on Knowledge and Data Engineering.

[15]  Martin J. Wainwright,et al.  Minimax Optimal Procedures for Locally Private Estimation , 2016, ArXiv.

[16]  Aaron Roth,et al.  The Algorithmic Foundations of Differential Privacy , 2014, Found. Trends Theor. Comput. Sci..

[17]  Úlfar Erlingsson,et al.  RAPPOR: Randomized Aggregatable Privacy-Preserving Ordinal Response , 2014, CCS.

[18]  Anand D. Sarwate,et al.  Signal Processing and Machine Learning with Differential Privacy: Algorithms and Challenges for Continuous Data , 2013, IEEE Signal Processing Magazine.

[19]  Martin J. Wainwright,et al.  Local privacy and statistical minimax rates , 2013, 2013 51st Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[20]  Bolin Ding,et al.  CGM: An Enhanced Mechanism for Streaming Data Collectionwith Local Differential Privacy , 2021, Proc. VLDB Endow..

[21]  Teng Wang,et al.  Real-Time Stream Statistics via Local Differential Privacy in Mobile Crowdsensing , 2021, Mobile Multimedia Communications.

[22]  Gaoming Yang,et al.  Local Differential Privacy for Data Streams , 2020, SPDE.