Privacy Preserving Elastic Stream Processing with Clouds Using Homomorphic Encryption

Prevalence of the Infrastructure as a Service (IaaS) clouds has enabled organizations to elastically scale their stream processing applications to public clouds. However, current approaches for elastic stream processing do not consider the potential security vulnerabilities in cloud environments. In this paper we describe the design and implementation of an Elastic Switching Mechanism for data stream processing which is based on Homomorphic Encryption (HomoESM). The HomoESM not only does elastically scale data stream processing applications into public clouds but also preserves the privacy of such applications. Using a real world test setup, which includes an email filter benchmark and a web server access log processor benchmark (EDGAR) we demonstrate the effectiveness of our approach. Multiple experiments on Amazon EC2 indicate that the proposed approach for Homomorphic encryption provides significant results which is 10% to 17% improvement of average latency in the case of email filter benchmark and EDGAR benchmarks respectively. Furthermore, EDGAR add/subtract operations and comparison operations showed 6.13% and 26.17% average latency improvements respectively. These promising results pave the way for real world deployments of privacy preserving elastic stream processing in the cloud.

[1]  Peter R. Pietzuch,et al.  Adaptive Provisioning of Stream Processing Systems in the Cloud , 2012, 2012 IEEE 28th International Conference on Data Engineering Workshops.

[2]  Yiming Yang,et al.  Introducing the Enron Corpus , 2004, CEAS.

[3]  Toyotaro Suzumura,et al.  A Mechanism for Stream Program Performance Recovery in Resource Limited Compute Clusters , 2013, DASFAA.

[4]  Shai Halevi,et al.  Homomorphic Encryption , 2017, Tutorials on the Foundations of Cryptography.

[5]  Dan S. Chiaburu Analytics , 2015, Journal of Management Inquiry.

[6]  Tommaso Cucinotta,et al.  Towards the optimization of a parallel streaming engine for telco applications , 2014, Bell Labs Technical Journal.

[7]  Toyotaro Suzumura,et al.  A Performance Analysis of System S, S4, and Esper via Two Level Benchmarking , 2013, QEST.

[8]  Christof Fetzer,et al.  StreamApprox: approximate computing for stream analytics , 2017, Middleware.

[9]  J. M. Eklund,et al.  Real-Time Analysis for Intensive Care: Development and Deployment of the Artemis Analytic System , 2010, IEEE Engineering in Medicine and Biology Magazine.

[10]  Murat Kantarcioglu,et al.  SGX-BigMatrix: A Practical Encrypted Data Analytic Framework With Trusted Processors , 2017, CCS.

[11]  Sanath Jayasena,et al.  Latency Aware Elastic Switching-based Stream Processing Over Compressed Data Streams , 2017, ICPE.

[12]  Craig Gentry,et al.  Fully homomorphic encryption using ideal lattices , 2009, STOC '09.

[13]  Srinath Perera,et al.  Continuous analytics on geospatial data streams with WSO2 complex event processor , 2015, DEBS.

[14]  Shai Halevi,et al.  Algorithms in HElib , 2014, CRYPTO.

[15]  Berk Sunar,et al.  cuHE: A Homomorphic Encryption Accelerator Library , 2015, IACR Cryptol. ePrint Arch..

[16]  Tim Kraska,et al.  Stormy: an elastic and highly available streaming service in the cloud , 2012, EDBT-ICDT '12.

[17]  Cezar Plesca,et al.  Comparison-based computations over fully homomorphic encrypted data , 2014, 2014 10th International Conference on Communications (COMM).

[18]  Sharma Chakravarthy,et al.  Event-based lossy compression for effective and efficient OLAP over data streams , 2010, Data Knowl. Eng..

[19]  Srinath Perera,et al.  Recent Advancements in Event Processing , 2018, ACM Comput. Surv..

[20]  Muthuramakrishnan Venkitasubramaniam,et al.  Cloud-based secure health monitoring: Optimizing fully-homomorphic encryption for streaming algorithms , 2014, 2014 IEEE Globecom Workshops (GC Wkshps).

[21]  Schahram Dustdar,et al.  Elastic stream processing in the Cloud , 2013, WIREs Data Mining Knowl. Discov..

[22]  Toyotaro Suzumura,et al.  Elastic Stream Computing with Clouds , 2011, 2011 IEEE 4th International Conference on Cloud Computing.