An Automatic Alignment and Grouping of Hadoop Cluster

The MapReduce Framework and Hadoop is the platform for scalable analysis on large Datasets in recent years. The primary concern in the Hadoop is to minimize the completion length (i.e., makespan) and fixed number of MapReduce jobs. This makes performance low and causes low resource utilization. To overcome this, we propose a system which dynamically allocates the map and reduce jobs, thus leading to high resource utilization and reduced completion length. The dynamic allocation of MapReduce jobs is achieved by implementing Combiner Interface in MapReduce Framework. The Proposed solution is implemented in the Amazon EC2 Cluster in both Homogeneous and Heterogeneous Clusters. The experimental results show the effectiveness and robustness of our proposed system under simple workloads.nbsp

[1]  Prajesh P. Anchalia Improved MapReduce k-Means Clustering Algorithm with Combiner , 2014, 2014 UKSim-AMSS 16th International Conference on Computer Modelling and Simulation.

[2]  Yi Yao,et al.  Self-Adjusting Slot Configurations for Homogeneous and Heterogeneous Hadoop Clusters , 2017, IEEE Transactions on Cloud Computing.

[3]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.