Horizontal Scaling Enhancement for Optimized Big Data Processing

Big Data, as we all know, is becoming a new technological trend in the industries, in science and even businesses. Indefinite data scalability allows organizations to process huge amounts of data in parallel, assisting dramatically decrease the amount of time it takes to manage several amount of work, optimize hardware resource usage and permit the extreme quantity of data per node to be handled. Optimization is to done to attain the finest strategy relative to a set of selected constraints which include maximizing factors such as efficiency, productivity, reliability, strength, and utilization. When the current system becomes insufficient, instead of upgrading it by adding more components to the existing structure you just add more computers to a cluster. This research discusses a hierarchical architecture of Hadoop Nodes namely Name nodes and Data nodes and mainly focuses on the optimization of Data Node by distributing some of its work load to Name Node.

[1]  Siddharth Swarup Rautaray,et al.  Name node performance enlarging by aggregator based HADOOP framework , 2017, 2017 International Conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud) (I-SMAC).

[2]  Siddharth Swarup Rautaray,et al.  Real time financial analysis using big data technologies , 2017, 2017 International Conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud) (I-SMAC).

[3]  Yon Dohn Chung,et al.  Parallel data processing with MapReduce: a survey , 2012, SGMD.

[4]  Lei Zhang,et al.  Review of hadoop performance optimization , 2016, 2016 2nd IEEE International Conference on Computer and Communications (ICCC).

[5]  Tom White,et al.  Hadoop: The Definitive Guide , 2009 .

[6]  Siddharth Swarup Rautaray,et al.  Feedback analysis using big data tools , 2016, 2016 International Conference on ICT in Business Industry & Government (ICTBIG).

[7]  Siddharth Swarup Rautaray,et al.  A Survey Work on Optimization Techniques Utilizing Map Reduce Framework in Hadoop Cluster , 2017 .

[8]  Christopher Ré,et al.  Automatic Optimization for MapReduce Programs , 2011, Proc. VLDB Endow..

[9]  Siddharth Swarup Rautaray,et al.  A Proposal for High Availability of HDFS Architecture based on Threshold Limit and Saturation Limit of the Namenode , 2017 .

[10]  Hairong Kuang,et al.  The Hadoop Distributed File System , 2010, 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST).