A Heuristic Scheduling Algorithm for Distributed Systems with Workflow Constraints

Distributed systems become one of the most important computing platform because they can efficiently handle large amount of data with a high computing workload. However, the job scheduling in the distributed systems is not a trivial issue. It can be even more complex when dealing with workflow-based composite jobs, i.e., each job has multiple tasks with dependencies between them. As the job scheduling problem has been proven to be an NP-hard, we propose a trail-based algorithm; Large Trail First (LTF), which is an effective heuristic approach for scheduling problem in the distributed systems when workflows exist. In this paper, "trail" of each task is the number of remaining tasks in each workflow. Thus, the idea of the algorithm is that, for each workflow job, the task with the largest size of the trail will be executed earliest. The experimental results show that the proposed approach is more effective and efficient than the other three approaches including a well- known 2-approximation algorithm.

[1]  Pruet Boonma,et al.  A Heuristic Algorithm for Workflow-Based Job Scheduling in Decentralized Distributed Systems with Heterogeneous Resources , 2015, Computer and Information Science.

[2]  Yannis Manolopoulos,et al.  Decentralized execution of linear workflows over web services , 2011, Future Gener. Comput. Syst..

[3]  Ronald L. Graham,et al.  Optimal scheduling for two-processor systems , 1972, Acta Informatica.

[4]  Fatos Xhafa,et al.  From Meta-computing to Interoperable Infrastructures: A Review of Meta-schedulers for HPC, Grid and Cloud , 2012, 2012 IEEE 26th International Conference on Advanced Information Networking and Applications.

[5]  Pruet Boonma,et al.  Workflow-Based Composite Job Scheduling for Decentralized Distributed Systems , 2013, 2013 16th International Conference on Network-Based Information Systems.

[6]  Kinji Mori,et al.  Autonomous decentralized resource allocation for tracking dynamic load change , 2005, Proceedings Autonomous Decentralized Systems, 2005. ISADS 2005..

[7]  David P. Anderson,et al.  On correlated availability in Internet-distributed systems , 2008, 2008 9th IEEE/ACM International Conference on Grid Computing.

[8]  Matt Welsh,et al.  Decentralized, adaptive resource allocation for sensor networks , 2005, NSDI.

[9]  Al-Sakib Khan Pathan,et al.  Advancements in Distributed Computing and Internet Technologies: Trends and Issues , 2011 .

[10]  Bernardo A. Huberman,et al.  Tycoon: A Market-Based Resource Allocation System , 2004, ArXiv.