Development of Retargetable Hadoop Simulation EnvironmentBased on DEVS Formalism

Hadoop platform is a representative storing and managing platform for big data. Hadoop consists of distributed computing system called MapReduce and distributed file system called HDFS. It is important to analyse the effectiveness according to the change of cluster constructions and several parameters. However, since it is hard to construct thousands of clusters and analyse the constructed system, simulation method is required to analyse the system. This paper proposes Hadoop simulator based on DEVS formalism which provides hierarchical and modular modeling. Hadoop simulator provides a retargetable experimental environment that is possible to change of various parameters, algorithms and models. It is also possible to design input models reflecting the characteristics of Hadoop applications. To maximize the user's convenience, the user interface, real-time model viewer, and input scenario editor are also provided. In this paper, we validate Hadoop Simulator through the comparison with the Hadoop execution results and perform various experiments.

[1]  Guanying Wang,et al.  Using realistic simulation for performance analysis of mapreduce setups , 2009, LSAP '09.

[2]  Ian Gorton,et al.  Exploring performance models of Hadoop applications on cloud architecture , 2015, 2015 11th International ACM SIGSOFT Conference on Quality of Software Architectures (QoSA).

[3]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[4]  Chang Ho Sung,et al.  Objective-driven DEVS modeling using OPI matrix for performance evaluation of discrete event systems , 2007, SCSC.

[5]  Maozhen Li,et al.  MRSim: A discrete event based MapReduce simulator , 2010, 2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery.

[6]  N. B. Anuar,et al.  The rise of "big data" on cloud computing: Review and open research issues , 2015, Inf. Syst..

[7]  Maozhen Li,et al.  HSim: A MapReduce simulator in enabling Cloud Computing , 2013, Future Gener. Comput. Syst..

[8]  Su-Youn Hong,et al.  DEVSim++ Toolset for Defense Modeling and Simulation and Interoperation , 2011 .

[9]  Tag Gon Kim,et al.  Cooperation between data modeling and simulation modeling for performance analysis of Hadoop , 2017, 2017 International Symposium on Performance Evaluation of Computer and Telecommunication Systems (SPECTS).