An approach for big data security based on Hadoop distributed file system

Cloud computing appeared for huge data because of its ability to provide users with on-demand, reliable, flexible, and low-cost services. With the increasing use of cloud applications, data security protection has become an important issue for the cloud. In this work, the proposed approach was used to improve the performance of encryption /Decryption file by using AES and OTP algorithms integrated on Hadoop. Where files are encrypted within the HDFS and decrypted within the Map Task. Encryption /Decryption in previous works used AES algorithm, the size of the encrypted file increased by 50% from the original file size. The proposed approach improved this ratio as the size of the encrypted file increased by 20% from the original file size. Also, we have compared this approach with the previously implemented method, we implement this new approach to secure HDFS, and some experimental studies were conducted to verify its effectiveness.

[1]  Pankaj Singh,et al.  Big Data: Technologies, Trends and Applications , 2015 .

[2]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[3]  William Stallings,et al.  THE ADVANCED ENCRYPTION STANDARD , 2002, Cryptologia.

[4]  Mehedi Hasan,et al.  A New HDFS Structure Model to Evaluate The Performance of Word Count Application on Different File Size , 2015 .

[5]  Fatma A. Omara,et al.  A Comparative Study of HDFS Replication Approaches , 2015 .

[6]  Diksha Sharma Challenges Involved in Big Data Processing & Methods to Solve Big Data Processing Problems , 2017 .

[7]  Youngseok Lee,et al.  Secure Hadoop with Encrypted HDFS , 2013, GPC.

[8]  Wen-Guey Tzeng,et al.  Toward Data Confidentiality via Integrating Hybrid Encryption Schemes and Hadoop Distributed File System , 2012, 2012 IEEE 26th International Conference on Advanced Information Networking and Applications.

[9]  Chao Yang,et al.  A Novel Triple Encryption Scheme for Hadoop-Based Cloud Data Security , 2013, 2013 Fourth International Conference on Emerging Intelligent Data and Web Technologies.

[10]  Mohie M. Hadhoud,et al.  Performance Evaluation of Symmetric Encryption Algorithms , 2008 .

[11]  Rajkumar Buyya,et al.  Article in Press Future Generation Computer Systems ( ) – Future Generation Computer Systems Cloud Computing and Emerging It Platforms: Vision, Hype, and Reality for Delivering Computing as the 5th Utility , 2022 .

[12]  Ayushi A Symmetric Key Cryptographic Algorithm , 2010 .