GA-Based Task Scheduler for the Cloud Computing Systems

Task scheduling problems are of paramount importance which relate to the efficiency of the whole cloud computing facilities. In Hadoop, the open-source implementation of MapReduce, scheduling policies, such as FIFO or delay scheduling in FAIR scheduler is used by the master node to distribute waiting tasks to computing nodes (slaves) in response to the status messages of these nodes it receives. Although delay scheduling policy has claimed to improve the throughput and response times by a factor of 2 compared to FIFO policy, it can still achieve more improvement by considering a holistic view of all the tasks waiting to be processed. Therefore, this paper proposes a new scheduler which makes a scheduling decision by evaluating the entire group of tasks in the job queue. A genetic algorithm is designed as the optimization method for the new scheduler. The preliminary simulation results show that our scheduler can get a shorter make span for jobs than FIFO and delay scheduling policies and achieve a better balanced load across all the nodes in the cloud.

[1]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[2]  Albert Y. Zomaya,et al.  Artificial life techniques for load balancing in computational grids , 2007, J. Comput. Syst. Sci..

[3]  Albert Y. Zomaya,et al.  Observations on Using Genetic Algorithms for Dynamic Load-Balancing , 2001, IEEE Trans. Parallel Distributed Syst..

[4]  Fatos Xhafa,et al.  Genetic algorithm based schedulers for grid computing systems , 2007 .

[5]  Yujia Ge,et al.  Search-Based Performance Testing of Applications with Composite Services , 2009, 2009 International Conference on Web Information Systems and Mining.

[6]  Ladislau Bölöni,et al.  A Comparison of Eleven Static Heuristics for Mapping a Class of Independent Tasks onto Heterogeneous Distributed Computing Systems , 2001, J. Parallel Distributed Comput..

[7]  Jian Xie,et al.  Independent Tasks Scheduling Based on Genetic Algorithm in Cloud Computing , 2009, 2009 5th International Conference on Wireless Communications, Networking and Mobile Computing.

[8]  Matei Zaharia,et al.  Job Scheduling for Multi-User MapReduce Clusters , 2009 .

[9]  Shu-Chin Wang,et al.  Towards a hybrid load balancing policy in grid computing system , 2009, Expert Syst. Appl..

[10]  Archana Ganapathi,et al.  Statistics-driven workload modeling for the Cloud , 2010, 2010 IEEE 26th International Conference on Data Engineering Workshops (ICDEW 2010).

[11]  Melanie Mitchell,et al.  An introduction to genetic algorithms , 1996 .

[12]  Naixue Xiong,et al.  A game-theoretic method of fair resource allocation for cloud computing services , 2010, The Journal of Supercomputing.