Distributed multipliers in MWM for analyzing job arrival processes in massive HPC workload datasets

There are three distributed multipliers multifractal wavelet model (MWM) on network traffic. The multifractal wavelet model ( MWM) recently has been introduced as a good choice to yield long range dependence (LRD) and fractal behavior for a job arrival process for parallel workload analysis. In this paper, based on the Multifractal Wavelet Model (MWM), we choose the three kinds of distributions for the multipliers, namely the symmetric -distribution, a symmetric point-mass distribution, and a hybrid distribution, the influence which distributed multipliers had on the MWM was analyzed with the analysis and comparison between the real job arrival process and the synthesized one. We find that the choice of -distribution wavelet multipliers Aj,k is not necessary. For the job arrival process, we can use different distributed multipliers to control the wavelet energy decay in MWM. We compared three multifractal wavelet models for the job arrival process.We find that the point-mass MWM can well match that of the real process.We find that the hybrid MWM can also well match that of the real process.We find that hybrid MWM has no greater advantage than beta MWM.

[1]  Xingming Sun,et al.  Model Based on Different Distributed Multipliers , 2008 .

[2]  Lizhe Wang,et al.  A grid infrastructure for hybrid simulations , 2011, Comput. Syst. Sci. Eng..

[3]  Dror G. Feitelson,et al.  The workload on parallel supercomputers: modeling the characteristics of rigid jobs , 2003, J. Parallel Distributed Comput..

[4]  Tran Ngoc Minh,et al.  Modeling Job Arrival Process with Long Range Dependence and Burstiness Characteristics , 2009, 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid.

[5]  Ingrid Daubechies,et al.  Ten Lectures on Wavelets , 1992 .

[6]  Denis Trystram,et al.  A synthetic workload generator for cluster computing , 2004, 18th International Parallel and Distributed Processing Symposium, 2004. Proceedings..

[7]  Emmanouel A. Varvarigos,et al.  Statistical Analysis and Modeling of Jobs in a Grid Environment , 2007, Journal of Grid Computing.

[8]  Albert Y. Zomaya,et al.  CONCURRENCY AND COMPUTATION: PRACTICE AND EXPERIENCE Concurrency Computat.: Pract. Exper. (2012) Published online in Wiley Online Library (wileyonlinelibrary.com). DOI: 10.1002/cpe.2839 SPECIAL ISSUE PAPER Energy efficient genetic-based schedulers in comp , 2022 .

[9]  Patricia Morreale,et al.  Applications of Computational Science: Data-Intensive Computing for Student Projects , 2012, Computing in Science & Engineering.

[10]  Lizhe Wang,et al.  Implementation and performance evaluation of the parallel CORBA application on computational grids , 2008, Adv. Eng. Softw..

[11]  Hui Li,et al.  Workload dynamics on clusters and grids , 2008, The Journal of Supercomputing.

[12]  Fatos Xhafa,et al.  Enhancing the genetic-based scheduling in computational grids by a structured hierarchical population , 2011, Future Gener. Comput. Syst..

[13]  Michalis Faloutsos,et al.  Long-range dependence: now you see it, now you don't! , 2002, Global Telecommunications Conference, 2002. GLOBECOM '02. IEEE.

[14]  Michael Muskulus,et al.  Analysis and modeling of job arrivals in a production grid , 2007, PERV.

[15]  Lizhe Wang,et al.  A general metric and parallel framework for adaptive image fusion in clusters , 2014, Concurr. Comput. Pract. Exp..

[16]  Lizhe Wang,et al.  Thermal aware workload placement with task-temperature profiles in a data center , 2011, The Journal of Supercomputing.

[17]  Rajiv Ranjan,et al.  MediaWise cloud content orchestrator , 2013, Journal of Internet Services and Applications.

[18]  Dror G. Feitelson,et al.  Workload Modeling for Performance Evaluation , 2002, Performance.

[19]  Jens Mache,et al.  A Comparative Study of Real Workload Traces and Synthetic Workload Models for Parallel Job Scheduling , 1998, JSSPP.

[20]  Rajkumar Buyya,et al.  A taxonomy and survey on autonomic management of applications in grid computing environments , 2011 .

[21]  Lizhe Wang,et al.  Hybrid modelling and simulation of huge crowd over a hierarchical Grid architecture , 2013, Future Gener. Comput. Syst..

[22]  Albert Y. Zomaya,et al.  Energy-aware parallel task scheduling in a cluster , 2013, Future Gener. Comput. Syst..

[23]  Rajkumar Buyya,et al.  Coordinated load management in Peer-to-Peer coupled federated grid systems , 2012, The Journal of Supercomputing.

[24]  Francine Berman,et al.  A model for moldable supercomputer jobs , 2001, Proceedings 15th International Parallel and Distributed Processing Symposium. IPDPS 2001.

[25]  Samee Ullah Khan,et al.  Multi-level hierarchic genetic-based scheduling of independent jobs in dynamic heterogeneous grid environment , 2012, Inf. Sci..

[26]  Juan Li,et al.  A comparative study of rate monotonic schedulability tests , 2012, The Journal of Supercomputing.

[27]  Jack J. Dongarra An Overview of High Performance Computing and Challenges for the Future , 2008, VECPAR.

[28]  Geoffrey C. Fox,et al.  Task scheduling with ANN-based temperature prediction in a data center: a simulation-based study , 2011, Engineering with Computers.

[29]  Dan Tsafrir,et al.  Modeling User Runtime Estimates , 2005, JSSPP.

[30]  Emmanuel Medernach,et al.  Workload Analysis of a Cluster in a Grid Environment , 2005, JSSPP.

[31]  M. Teich,et al.  Fractal-Based Point Processes , 2005 .

[32]  Lizhe Wang,et al.  Performance evaluation of virtual machine-based Grid workflow system , 2008 .

[33]  Gregor von Laszewski,et al.  Towards building a cloud for scientific applications , 2011, Adv. Eng. Softw..

[34]  Patrice Abry,et al.  Wavelet Analysis of Long-Range-Dependent Traffic , 1998, IEEE Trans. Inf. Theory.

[35]  Rajiv Ranjan,et al.  CloudGenius: decision support for web server cloud migration , 2012, WWW.

[36]  Lizhe Wang,et al.  Resource management of distributed virtual machines , 2012, Int. J. Ad Hoc Ubiquitous Comput..

[37]  Fatos Xhafa,et al.  Modern approaches to modeling user requirements on resource and task allocation in hierarchical computational grids , 2011, Int. J. Appl. Math. Comput. Sci..

[38]  Xi He,et al.  Cloud Computing: a Perspective Study , 2010, New Generation Computing.

[39]  Jan Beran,et al.  Statistics for long-memory processes , 1994 .

[40]  Jelena Kovacevic,et al.  Wavelets and Subband Coding , 2013, Prentice Hall Signal Processing Series.

[41]  Richard G. Baraniuk,et al.  A Multifractal Wavelet Model with Application to Network Traffic , 1999, IEEE Trans. Inf. Theory.

[42]  Lizhe Wang,et al.  Review of performance metrics for green data centers: a taxonomy study , 2011, The Journal of Supercomputing.

[43]  H. E. Hurst,et al.  Long-Term Storage Capacity of Reservoirs , 1951 .

[44]  Juan Li,et al.  An overview of energy efficiency techniques in cluster computing systems , 2013, Cluster Computing.

[45]  Rajiv Ranjan,et al.  G-Hadoop: MapReduce across distributed data centers for data-intensive computing , 2013, Future Gener. Comput. Syst..

[46]  Wei Zhang,et al.  Research on Multi-Fractal Wavelet Model Based on Different Distributed Multipliers , 2008, 2008 International Symposium on Computational Intelligence and Design.