Multi-Group ObScure Logging (MG-OSLo) A Privacy-Preserving Protocol for Private Web Search

The Web Search Engine (WSE) is a software system used to retrieve data from the web successfully. WSE uses the user’s search queries to build the user’s profile and provide personalized results. Users’ search queries hold identifiable information that could compromise the privacy of the respective user. This work proposes a multi-group distributed privacy-preserving protocol (MG-OSLo) and tries to investigate the state-of-the-art distributed privacy-preserving protocols for computing web search privacy. The MG-OSLo comprises multiple groups in which each group has a fixed number of users. The MG-OSLo measures the impact of the multi-group on the user’s privacy. The primary objective of this work is to assess local privacy and profile privacy. It aims at evaluating the impact of group size and group count on a user’s privacy. Two grouping approaches are used to group the users in MG-OSLo, i.e. a non-overlapping group design and overlapping group design. The local privacy results reveal that the probability of linking a query to the user depends on the group size and group count. The higher the group size or group count, the lower the likelihood of relating the query to the user. The profile privacy computes the profile obfuscation level using a privacy metric Profile Exposure Level (PEL). Different experiments have been performed to compute the profile privacy of the subset of an AOL query log for two situations: i) self-query submissions allowed and ii) self-query submissions not allowed. The privacy achieved by MG-OSLo is compared with the modern privacy-preserving protocol UUP(e), OSLo, and Co-utile protocols. The results show that the MG-OSLo provided better results as compared to OSLo, UUP, and Co-utile. Similarly, the multi-group has a positive impact on local privacy and user profile privacy.

[1]  Douglas R. Stinson,et al.  Extended results on privacy against coalitions of users in user-private information retrieval protocols , 2015, Cryptography and Communications.

[2]  Rishiraj Saha Roy,et al.  Towards Query Logs for Privacy Studies: On Deriving Search Queries from Questions , 2020, ECIR.

[3]  Muhammad Arshad Islam,et al.  QuPiD Attack: Machine Learning-Based Privacy Quantification Mechanism for PIR Protocols in Health-Related Web Search , 2020, Sci. Program..

[4]  Douglas R. Stinson,et al.  Extended combinatorial constructions for peer-to-peer user-private information retrieval , 2012, Adv. Math. Commun..

[5]  Krishnaram Kenthapadi,et al.  Privacy-preserving Data Mining in Industry , 2019, WSDM.

[6]  Josep Domingo-Ferrer,et al.  Co-Utility: Self-Enforcing protocols for the mutual benefit of participants , 2017, Eng. Appl. Artif. Intell..

[7]  Muhammad Arshad Islam,et al.  ObSecure Logging (OSLo): A Framework to Protect and Evaluate the Web Search Privacy in Health Care Domain , 2019 .

[8]  Hao Wang,et al.  Achieve Web Search Privacy by Obfuscation , 2019 .

[9]  Pradeep Reddy Ch,et al.  Prediction of user interest fluctuation using fuzzy neural networks in web search , 2020 .

[10]  Josep Domingo-Ferrer,et al.  Co-utile P2P anonymous keyword search , 2018, CDC 2018.

[11]  Yehuda Lindell,et al.  Private Web Search with Malicious Adversaries , 2010, Privacy Enhancing Technologies.

[12]  Jordi Castellà-Roca,et al.  Distributed system for private web search with untrusted partners , 2014, Comput. Networks.

[13]  B. N. Mandal,et al.  Balanced treatment incomplete block designs through integer programming , 2017 .

[14]  Josep Maria Mateo-Sanz,et al.  Single-party private web search , 2012, 2012 Tenth Annual International Conference on Privacy, Security and Trust.

[15]  Avi Arampatzis,et al.  Versatile Query Scrambling for Private Web Search , 2015, Information Retrieval Journal.

[16]  Jordi Herrera-Joancomartí,et al.  Preserving user's privacy in web search engines , 2009, Comput. Commun..

[17]  Muhammad Arshad Islam,et al.  Poshida II, a Multi Group Distributed Peer to Peer Protocol for Private Web Search , 2016, 2016 International Conference on Frontiers of Information Technology (FIT).

[18]  Maria Bras-Amorós,et al.  Optimal configurations for peer-to-peer user-private information retrieval , 2010, Comput. Math. Appl..

[19]  Joaquin Garcia-Alfaro,et al.  A Real-Time Query Log Protection Method for Web Search Engines , 2020, IEEE Access.

[20]  CooperAlissa A survey of query log privacy-enhancing techniques from a policy perspective , 2008 .

[21]  Wenzhi Chen,et al.  OB-WSPES: A Uniform Evaluation System for Obfuscation-Based Web Search Privacy , 2021, IEEE Transactions on Dependable and Secure Computing.

[22]  Alissa Cooper,et al.  A survey of query log privacy-enhancing techniques from a policy perspective , 2008, TWEB.

[23]  Claudio Carpineto,et al.  A Review of Ten Year Research on Query Log Privacy , 2016, IIR.

[24]  Nitesh Saxena,et al.  On the Privacy of Web Search Based on Query Obfuscation: A Case Study of TrackMeNot , 2010, Privacy Enhancing Technologies.

[25]  Muhammad Arshad Islam,et al.  Poshida, a protocol for private information retrieval , 2016, 2016 Sixth International Conference on Innovative Computing Technology (INTECH).

[26]  ALI RAZA,et al.  A Framework for Privacy Preserving, Distributed Search Engine Using Topology of DLT and Onion Routing , 2020, IEEE Access.

[27]  Josep Maria Mateo-Sanz,et al.  Exploiting social networks to provide privacy in personalized web search , 2011, J. Syst. Softw..

[28]  Alexandre Viejo,et al.  Using social networks to distort users' profiles generated by web search engines , 2010, Comput. Networks.

[29]  Joan Feigenbaum,et al.  Private web search , 2007, WPES '07.

[30]  Robert E. Bechhofer,et al.  Incomplete Block Designs for Comparing Treatments Wth a Control: General Theory , 1981 .

[31]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[32]  Josep Domingo-Ferrer,et al.  User-private information retrieval based on a peer-to-peer community , 2009, Data Knowl. Eng..

[33]  Muhammad Arshad Islam,et al.  Privacy Exposure Measure: A Privacy-Preserving Technique for Health-Related Web Search , 2019 .