Multi-Objective Workflow Scheduling With Deep-Q-Network-Based Multi-Agent Reinforcement Learning

Cloud Computing provides an effective platform for executing large-scale and complex workflow applications with a pay-as-you-go model. Nevertheless, various challenges, especially its optimal scheduling for multiple conflicting objectives, are yet to be addressed properly. The existing multi-objective workflow scheduling approaches are still limited in many ways, e.g., encoding is restricted by prior experts’ knowledge when handling a dynamic real-time problem, which strongly influences the performance of scheduling. In this paper, we apply a deep-Q-network model in a multi-agent reinforcement learning setting to guide the scheduling of multi-workflows over infrastructure-as-a-service clouds. To optimize multi-workflow completion time and user’s cost, we consider a Markov game model, which takes the number of workflow applications and heterogeneous virtual machines as state input and the maximum completion time and cost as rewards. The game model is capable of seeking for correlated equilibrium between make-span and cost criteria without prior experts’ knowledge and converges to the correlated equilibrium policy in a dynamic real-time environment. To validate our proposed approach, we conduct extensive case studies based on multiple well-known scientific workflow templates and Amazon EC2 cloud. The experimental results clearly suggest that our proposed approach outperforms traditional ones, e.g., non-dominated sorting genetic algorithm-II, multi-objective particle swarm optimization, and game-theoretic-based greedy algorithms, in terms of optimality of scheduling plans generated.

[1]  Yueshen Xu,et al.  Collaborative Service Selection via Ensemble Learning in Mixed Mobile Network Environments , 2017, Entropy.

[2]  Sakshi Kaushal,et al.  A hybrid multi-objective Particle Swarm Optimization for scientific workflow scheduling , 2017, Parallel Comput..

[3]  Yusen Zhan,et al.  Theoretically-Grounded Policy Advice from Multiple Teachers in Reinforcement Learning Settings with Applications to Negative Transfer , 2016, IJCAI.

[4]  Alex Graves,et al.  Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.

[5]  Keith B. Hall,et al.  Correlated Q-Learning , 2003, ICML.

[6]  Jianping Fan,et al.  iPrivacy: Image Privacy Protection by Identifying Sensitive Objects via Deep Multi-Task Learning , 2017, IEEE Transactions on Information Forensics and Security.

[7]  Sanjay Kadam,et al.  A novel multi-objective bacteria foraging optimization algorithm (MOBFOA) for multi-objective scheduling , 2018, Appl. Soft Comput..

[8]  Shijun Liu,et al.  A Reinforcement Learning Based Workflow Application Scheduling Approach in Dynamic Cloud Environment , 2017, CollaborateCom.

[9]  Alexander Boukhanovsky,et al.  Hybrid Evolutionary Workflow Scheduling Algorithm for Dynamic Heterogeneous Distributed Computational Environment , 2014, SOCO-CISIS-ICEUTE.

[10]  Xiao Liu,et al.  A sufficient and necessary temporal violation handling point selection strategy in cloud workflow , 2018, Future Gener. Comput. Syst..

[11]  Dimitri P. Bertsekas,et al.  Feature-based aggregation and deep reinforcement learning: a survey and some new implementations , 2018, IEEE/CAA Journal of Automatica Sinica.

[12]  Xiaorong Li,et al.  Multi-Objective Game Theoretic Schedulingof Bag-of-Tasks Workflows on Hybrid Clouds , 2014, IEEE Transactions on Cloud Computing.

[13]  Yueshen Xu,et al.  Network Location-Aware Service Recommendation with Random Walk in Cyber-Physical Systems , 2017, Sensors.

[14]  Zhiping Peng,et al.  Multiple DAGs Workflow Scheduling Algorithm Based on Reinforcement Learning in Cloud Computing , 2015, ISICA.

[15]  Kalyanmoy Deb,et al.  A fast and elitist multiobjective genetic algorithm: NSGA-II , 2002, IEEE Trans. Evol. Comput..

[16]  Srikanth Kandula,et al.  Resource Management with Deep Reinforcement Learning , 2016, HotNets.

[17]  Jin Sun,et al.  Minimizing cost and makespan for workflow scheduling in cloud using fuzzy dominance sort based HEFT , 2019, Future Gener. Comput. Syst..

[18]  MengChu Zhou,et al.  Stochastic Modeling and Performance Analysis of Migration-Enabled and Error-Prone Clouds , 2015, IEEE Transactions on Industrial Informatics.

[19]  Jianping Fan,et al.  Leveraging Content Sensitiveness and User Trustworthiness to Recommend Fine-Grained Privacy Settings for Social Image Sharing , 2018, IEEE Transactions on Information Forensics and Security.

[20]  Qingsheng Zhu,et al.  Fluctuation-Aware and Predictive Workflow Scheduling in Cost-Effective Infrastructure-as-a-Service Clouds , 2018, IEEE Access.

[21]  Qingsheng Zhu,et al.  A Multi-stage Dynamic Game-Theoretic Approach for Multi-Workflow Scheduling on Heterogeneous Virtual Machines from Multiple Infrastructure-as-a-Service Clouds , 2018, SCC.

[22]  Kenli Li,et al.  Bi-objective workflow scheduling of the energy consumption and reliability in heterogeneous computing systems , 2017, Inf. Sci..

[23]  Y. Rui,et al.  Learning to Rank Using User Clicks and Visual Features for Image Retrieval , 2015, IEEE Transactions on Cybernetics.

[24]  Albert Y. Zomaya,et al.  Computation Offloading for Service Workflow in Mobile Cloud Computing , 2015, IEEE Transactions on Parallel and Distributed Systems.

[25]  Saeed Sharifian,et al.  A distributed load balancing and admission control algorithm based on Fuzzy type-2 and Game theory for large-scale SaaS cloud architectures , 2018, Future Gener. Comput. Syst..

[26]  Changyin Sun,et al.  An adaptive strategy via reinforcement learning for the prisoner U+02BC s dilemma game , 2018, IEEE/CAA Journal of Automatica Sinica.

[27]  Weinan Zhang,et al.  MAgent: A Many-Agent Reinforcement Learning Platform for Artificial Collective Intelligence , 2017, AAAI.

[28]  Prasanta K. Jana,et al.  A GSA based hybrid algorithm for bi-objective workflow scheduling in cloud computing , 2018, Future Gener. Comput. Syst..

[29]  MengChu Zhou,et al.  A Stochastic Approach to Analysis of Energy-Aware DVS-Enabled Cloud Datacenters , 2015, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[30]  Xiaofeng Liao,et al.  Reinforcement Learning for Constrained Energy Trading Games With Incomplete Information , 2017, IEEE Transactions on Cybernetics.

[31]  Carlos A. Coello Coello,et al.  Handling multiple objectives with particle swarm optimization , 2004, IEEE Transactions on Evolutionary Computation.

[32]  Jian Wan,et al.  Location-Aware Service Recommendation With Enhanced Probabilistic Matrix Factorization , 2018, IEEE Access.

[33]  Lenz Belzner,et al.  Optimization of global production scheduling with deep reinforcement learning , 2018 .

[34]  Lei Wu,et al.  Scheduling Multi-Workflows Over Heterogeneous Virtual Machines With a Multi-Stage Dynamic Game-Theoretic Approach , 2018, Int. J. Web Serv. Res..

[35]  Sanyam Kapoor,et al.  Multi-Agent Reinforcement Learning: A Report on Challenges and Approaches , 2018, ArXiv.

[36]  Albert Y. Zomaya,et al.  GA-ETI: An enhanced genetic algorithm for the scheduling of scientific workflows in cloud environments , 2016, J. Comput. Sci..

[37]  Yi Wu,et al.  Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments , 2017, NIPS.

[38]  Yunni Xia,et al.  Multi-Objective Optimization for Location Prediction of Mobile Devices in Sensor-Based Applications , 2018, IEEE Access.