Content Based Image Retrieval with Enhanced Privacy in Cloud Using Apache Spark

Content Based Image Retrieval (CBIR) is the application through the integration of various traditional computer vision techniques to the image retrieval problem. CBIR usually contains high volume of personal and authenticated information which makes image privacy as a major concern. The proposed scheme for privacy allows the data owner to outsource the image database and the CBIR service to the cloud, without revealing the actual content of the database to the cloud server. The system uses only the index of the query image to prevent revealing the content of the image. Second, the image is encrypted using the session key generated between the client and the cloud server and then sent to the server for computation. Being the reduced information, it is computing stiff for the server to know the client’s interest. The proposed architecture also uses a novel framework called Apache Spark which is considered to 100x faster than MapReduce in certain applications. Result shown Apache Spark is in-memory computations which helps a lot in the retrieval of similar images. Apache Spark framework has been used when the images are newly inserted into the Cloud storage at the point of which features are extracted from the image using transformation and action functionalities and stored in the feature database.

[1]  Bin Wu,et al.  Map/Reduce in CBIR application , 2011, Proceedings of 2011 International Conference on Computer Science and Network Technology.

[2]  Sergio Ramírez-Gallego,et al.  Distributed Entropy Minimization Discretizer for Big Data Analysis under Apache Spark , 2015, 2015 IEEE Trustcom/BigDataSE/ISPA.

[3]  Yangyong Zhu,et al.  A Spark-Based Big Data Platform for Massive Remote Sensing Data Processing , 2015, ICDS.

[4]  Subrahmanyam Murala,et al.  Local Tetra Patterns: A New Feature Descriptor for Content-Based Image Retrieval , 2012, IEEE Transactions on Image Processing.

[5]  Mahesh U. Shankarwar,et al.  Security and Privacy in Cloud Computing: A Survey , 2014, FICTA.

[6]  Arun K. Pujari,et al.  Texture element feature characterizations for CBIR , 2005, Proceedings. 2005 IEEE International Geoscience and Remote Sensing Symposium, 2005. IGARSS '05..

[7]  James Ze Wang,et al.  SIMPLIcity: Semantics-Sensitive Integrated Matching for Picture LIbraries , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Scott Shenker,et al.  Spark: Cluster Computing with Working Sets , 2010, HotCloud.

[9]  Yi Zhu,et al.  Towards Privacy-Preserving Content-Based Image Retrieval in Cloud Computing , 2018, IEEE Transactions on Cloud Computing.

[10]  Anwar Haque,et al.  Large-scale machine learning based on functional networks for biomedical big data with high performance computing platforms , 2015, J. Comput. Sci..

[11]  Laurent Amsaleg,et al.  A Privacy-Preserving Framework for Large-Scale Content-Based Information Retrieval , 2015, IEEE Transactions on Information Forensics and Security.

[12]  Wan-Chi Siu,et al.  Multimedia Information Retrieval and Management: Technological Fundamentals and Applications , 2010 .