QueryGuard: Privacy-Preserving Latency-Aware Query Optimization for Edge Computing

The emerging edge computing paradigm has enabled applications having low response time requirements to meet the quality of service needs of applications by moving the computations to the edge of the network that is geographically closer to the end-users and end-devices. Despite the low latency advantages provided by the edge computing model, there are significant privacy risks associated with the adoption of edge computing services for applications dealing with sensitive data. In contrast to cloud data centers where system infrastructures are managed through strict and regularized policies, edge computing nodes are scattered geographically and may not have the same degree of regulatory and monitoring oversight. This can lead to higher privacy risks for the data processed and stored at the edge nodes, thus making them less trusted. In this paper, we show that a direct application of traditional performance-based query optimization techniques in edge computing can lead to unexpected data disclosure risks at the edge nodes. We propose a new privacy-preserving latency-aware query optimization framework, QueryGuard, that simultaneously tackles the privacy-aware distributed query processing problem while optimizing the queries for latency. Our experimental evaluation demonstrates that QueryGuard achieves better performance in terms of execution time and memory usage than conventional distributed query optimization techniques while also enforcing the required constraints related to data privacy.

[1]  Yannis E. Ioannidis,et al.  Query optimization , 1996, CSUR.

[2]  Abel N. Kho,et al.  SMCQL: Secure Query Processing for Private Data Networks , 2016, Proc. VLDB Endow..

[3]  Margo I. Seltzer,et al.  Network-Aware Operator Placement for Stream-Processing Systems , 2006, 22nd International Conference on Data Engineering (ICDE'06).

[4]  Jennifer Widom,et al.  STREAM: The Stanford Data Stream Management System , 2016, Data Stream Management.

[5]  Surajit Chaudhuri,et al.  An overview of query optimization in relational systems , 1998, PODS.

[6]  Cong Wang,et al.  Privacy-Preserving Query over Encrypted Graph-Structured Data in Cloud Computing , 2011, 2011 31st International Conference on Distributed Computing Systems.

[7]  Daniel J. Abadi,et al.  Query optimization of distributed pattern matching , 2014, 2014 IEEE 30th International Conference on Data Engineering.

[8]  Yannis E. Ioannidis,et al.  Query optimization in distributed networks of autonomous database systems , 2006, TODS.

[9]  Weisong Shi,et al.  Edge Computing: Vision and Challenges , 2016, IEEE Internet of Things Journal.

[10]  Rafic Bachnak,et al.  Modeling Latency in a Network Distribution , 2012, Netw. Commun. Technol..

[11]  Teruo Higashino,et al.  Edge-centric Computing: Vision and Challenges , 2015, CCRV.

[12]  Yanif Ahmad,et al.  Networked Query Processing for Distributed Stream-Based Applications , 2004, VLDB.

[13]  Ting Yu,et al.  PAQO: Preference-aware query optimization for decentralized database systems , 2014, 2014 IEEE 30th International Conference on Data Engineering.

[14]  Donald Kossmann,et al.  Iterative dynamic programming: a new class of query optimization algorithms , 2000, TODS.

[15]  Kenneth A. Ross,et al.  Track join: distributed joins with minimal network traffic , 2014, SIGMOD Conference.

[16]  Hari Balakrishnan,et al.  CryptDB: protecting confidentiality with encrypted query processing , 2011, SOSP.

[17]  Jennifer Widom,et al.  Operator placement for in-network stream query processing , 2005, PODS.

[18]  Laura M. Haas,et al.  Optimizing Queries Across Diverse Data Sources , 1997, VLDB.

[19]  Mahadev Satyanarayanan,et al.  The Emergence of Edge Computing , 2017, Computer.

[20]  K. Selçuk Candan,et al.  Load and network aware query routing for information integration , 2005, 21st International Conference on Data Engineering (ICDE'05).

[21]  Donald Kossmann,et al.  The state of the art in distributed query processing , 2000, CSUR.