PDCS: A Privacy-Preserving Distinct Counting Scheme for Mobile Sensing

Mobile sensing mines group information through sensing and aggregating users’ data. Among major mobile sensing applications, the distinct counting problem aiming to find the number of distinct elements in a data stream with repeated elements, is extremely important for avoiding waste of resources. Besides, the privacy protection of users is also a critical issue for aggregation security. However, it is a challenge to meet these two requirements simultaneously since normal privacy-preserving methods would have negative influence on the accuracy and efficiency of distinct counting. In this paper, we propose a Privacy-preserving Distinct Counting Scheme (PDCS) for mobile sensing. By integrating the basic idea of homomorphic encryption into Flajolet-Martin (FM) sketch, PDCS allows an aggregator to conduct distinct counting over large-scale data sets without knowing privacy of users. Moreover, PDCS supports various forms of sensing data, including camera images, location data, etc. PDCS expands each bit of the hashing values of users’ original data, FM sketch is thus enhanced for encryption to protect users’ privacy. We prove the security of PDCS under known-plaintext model. The theoretic and experimental results show that PDCS achieves high counting accuracy and practical efficiency with scalability over large-scale data sets.

[1]  Jeffrey Considine,et al.  Spatio-temporal aggregation using sketches , 2004, Proceedings. 20th International Conference on Data Engineering.

[2]  Martin Mauve,et al.  A probabilistic method for cooperative hierarchical aggregation of data in VANETs , 2010, Ad Hoc Networks.

[3]  Sheng Zhong,et al.  Efficient and Privacy-Preserving Min and $k$ th Min Computations in Mobile Sensing Systems , 2017, IEEE Transactions on Dependable and Secure Computing.

[4]  Martin Mauve,et al.  Scalable Data Dissemination for Inter-Vehicle-Communication: Aggregation versus Peer-to-Peer (Skalierbare Informationsverbreitung für die Fahrzeug-Fahrzeug-Kommunikation: Aggregation versus Peer-to-Peer) , 2008, it Inf. Technol..

[5]  Hwangnam Kim,et al.  Preserving privacy and efficiency in data communication and aggregation for AMI network , 2016, J. Netw. Comput. Appl..

[6]  Martin Mauve,et al.  Probabilistic aggregation for data dissemination in VANETs , 2007, VANET '07.

[7]  Luca Trevisan,et al.  Counting Distinct Elements in a Data Stream , 2002, RANDOM.

[8]  Xiaobo Tan,et al.  An Autonomous Robotic Fish for Mobile Sensing , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[9]  Joseph K. Liu,et al.  Privacy-preserving personal data operation on mobile cloud - Chances and challenges over advanced persistent threat , 2018, Future Gener. Comput. Syst..

[10]  Wen Hu,et al.  Ear-phone: an end-to-end participatory urban noise mapping system , 2010, IPSN '10.

[11]  Xiaohui Liang,et al.  EPPA: An Efficient and Privacy-Preserving Aggregation Scheme for Secure Smart Grid Communications , 2012, IEEE Transactions on Parallel and Distributed Systems.

[12]  Jeffrey Considine,et al.  Approximate aggregation techniques for sensor databases , 2004, Proceedings. 20th International Conference on Data Engineering.

[13]  Sivan Toledo,et al.  VTrack: accurate, energy-aware road traffic delay estimation using mobile phones , 2009, SenSys '09.

[14]  Qinghua Li,et al.  Efficient and privacy-preserving data aggregation in mobile sensing , 2012, 2012 20th IEEE International Conference on Network Protocols (ICNP).

[15]  Martin Mauve,et al.  Near-Optimal Compression of Probabilistic Counting Sketches for Networking Applications , 2007, DIALM-POMC.

[16]  Helmut Prodinger,et al.  How to Count Quickly and Accurately: A Unified Analysis of Probabilistic Counting and Other Related Problems , 1992, ICALP.

[17]  Qinghua Li,et al.  Efficient and Privacy-Aware Data Aggregation in Mobile Sensing , 2014, IEEE Transactions on Dependable and Secure Computing.

[18]  Zhenfu Cao,et al.  Serial number based encryption and its application for mobile social networks , 2016, Peer-to-Peer Networking and Applications.

[19]  Qi Han,et al.  SAS: A Secure Data Aggregation Scheme in Vehicular Sensing Networks , 2010, 2010 IEEE International Conference on Communications.

[20]  Elmar Schoch,et al.  A fuzzy logic based approach for structure-free aggregation in vehicular ad-hoc networks , 2009, VANET '09.

[21]  Wei Jiang,et al.  A Probabilistic Encryption Based MIN/MAX Computation in Wireless Sensor Networks , 2013, 2013 IEEE 14th International Conference on Mobile Data Management.

[22]  Joseph M. Hellerstein,et al.  Proof Sketches: Verifiable In-Network Aggregation , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[23]  Hao Wang,et al.  Fine-Grained Probability Counting: Refined LogLog Algorithm , 2018, 2018 IEEE International Conference on Big Data and Smart Computing (BigComp).

[24]  Philippe Flajolet,et al.  Probabilistic Counting Algorithms for Data Base Applications , 1985, J. Comput. Syst. Sci..

[25]  Liviu Iftode,et al.  TrafficView: traffic data dissemination using car-to-car communication , 2004, MOCO.