Data protection on hadoop distributed file system by using encryption algorithms: A systematic literature review

Big data has capability to process huge amount of unstructured and structured data. Nowadays, technology is able to support business need by extracting massive amount of data and recognizing its pattern to predict future trends. It brings right insight in business strategy to gain tremendous benefit. Hadoop is a reliable technology which developed to distribute process and storage on big data efficiently. However, Hadoop doesn't have any built-in provision to encrypt data by default. Hadoop additional feature in encryption zone has security issue which key management does outside of HDFS. Sensitive and confidential data in HDFS can be exposed against security attack. Information security is fundamental concern and new set challenge for the world of big data. The main purpose of this paper is to protect Hadoop Distributed File System data by using encryption algorithm. This is to ensure data is secured at storage level of HDFS. Dealing with big data, it is important to choose fast enough encryption algorithm that has great performance. Research methodology is SLR (System Literature Review) by using methodology of PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses).

[1]  David Moher,et al.  Corrigendum to: Preferred reporting items for systematic reviews and meta-analyses: The PRISMA statement. International Journal of Surgery 2010;8:336–341 , 2010 .

[2]  Anand Kumar,et al.  Implementation Issues and Analysis of Cryptographic Algorithms based on different Security Parameters , 2015 .

[3]  Kuldeep Singh,et al.  Efficiency and Security of Data with Symmetric Encryption Algorithms , 2012 .

[4]  D. Moher,et al.  Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement. , 2010, International journal of surgery.

[5]  Serge Vaudenay A classical introduction to cryptography - applications for communications security , 2005 .

[6]  M. Padmavathamma,et al.  Comparative study of encryption algorithm over big data in cloud systems , 2016, 2016 3rd International Conference on Computing for Sustainable Global Development (INDIACom).

[7]  Babak Sokouti,et al.  A PRISMA-compliant systematic review and analysis on color image encryption using DNA properties , 2018, Comput. Sci. Rev..

[8]  Chao Yang,et al.  A Novel Triple Encryption Scheme for Hadoop-Based Cloud Data Security , 2013, 2013 Fourth International Conference on Emerging Intelligent Data and Web Technologies.

[9]  William Stallings,et al.  Cryptography and Network Security: Principles and Practice , 1998 .

[10]  Bhushan Lakhe Practical Hadoop Security , 2014, Apress.

[11]  Jason Cohen,et al.  Towards a Trusted Hadoop Storage Platform: Design Considerations of an AES Based Encryption Scheme with TPM Rooted Key Protections , 2013, 2013 IEEE 10th International Conference on Ubiquitous Intelligence and Computing and 2013 IEEE 10th International Conference on Autonomic and Trusted Computing.

[12]  Jae-Woo Chang,et al.  Design and implementation of HDFS data encryption scheme using ARIA algorithm on Hadoop , 2017, 2017 IEEE International Conference on Big Data and Smart Computing (BigComp).

[13]  D. H. Manjaiah,et al.  Data security in Hadoop distributed file system , 2016, 2016 International Conference on Emerging Technological Trends (ICETT).

[14]  E. Ramadevi,et al.  PERFORMANCE EVALUATION OF SYMMETRIC ALGORITHMS , 2012 .

[15]  Wen-Guey Tzeng,et al.  Toward Data Confidentiality via Integrating Hybrid Encryption Schemes and Hadoop Distributed File System , 2012, 2012 IEEE 26th International Conference on Advanced Information Networking and Applications.

[16]  Sitesh Kumar Sinha,et al.  A New Way of Design and Implementation of Hybrid Encryption to Protect Confidential Information from Malicious Attack in Network , 2013 .

[17]  Mohey M. Hadhoud,et al.  Evaluating The Performance of Symmetric Encryption Algorithms , 2010, Int. J. Netw. Secur..

[18]  Hadeer Mahmoud,et al.  An approach for big data security based on Hadoop distributed file system , 2018, 2018 International Conference on Innovative Trends in Computer Engineering (ITCE).

[19]  Sudipta Roy,et al.  Large-Scale Encryption in the Hadoop Environment: Challenges and Solutions , 2017, IEEE Access.

[20]  Zarina Mohamad,et al.  Design and Implementation of Data-at-Rest Encryption for Hadoop , 2018 .