论文信息 - Multi-objective Reinforcement Learning for Responsive Grids

Multi-objective Reinforcement Learning for Responsive Grids

Grids organize resource sharing, a fundamental requirement of large scientific collaborations. Seamless integration of Grids into everyday use requires responsiveness, which can be provided by elastic Clouds, in the Infrastructure as a Service (IaaS) paradigm. This paper proposes a model-free resource provisioning strategy supporting both requirements. Provisioning is modeled as a continuous action-state space, multi-objective reinforcement learning (RL) problem, under realistic hypotheses; simple utility functions capture the high level goals of users, administrators, and shareholders. The model-free approach falls under the general program of autonomic computing, where the incremental learning of the value function associated with the RL model provides the so-called feedback loop. The RL model includes an approximation of the value function through an Echo State Network. Experimental validation on a real data-set from the EGEE Grid shows that introducing a moderate level of elasticity is critical to ensure a high level of user satisfaction.

[1] Hideyuki Tokuda,et al. A Time-Driven Scheduling Model for Real-Time Operating Systems , 1985, RTSS.

[2] Terrence J. Sejnowski,et al. A Parallel Network that Learns to Play Backgammon , 1989, Artif. Intell..

[3] Andrew W. Moore,et al. Generalization in Reinforcement Learning: Safely Approximating the Value Function , 1994, NIPS.

[4] Leemon C. Baird,et al. Residual Algorithms: Reinforcement Learning with Function Approximation , 1995, ICML.

[5] Mark J. Clement,et al. The Performance Impact of Advance Reservation Meta-scheduling , 2000, JSSPP.

[6] Geoffrey J. Gordon. Reinforcement Learning with Function Approximation Converges to a Region , 2000, NIPS.

[7] Kenji Doya,et al. Reinforcement Learning in Continuous Time and Space , 2000, Neural Computation.

[8] Ian T. Foster,et al. The anatomy of the grid: enabling scalable virtual organizations , 2001, Proceedings First IEEE/ACM International Symposium on Cluster Computing and the Grid.

[9] Herbert Jaeger,et al. Adaptive Nonlinear System Identification with Echo State Networks , 2002, NIPS.

[10] Pat Langley,et al. Editorial: On Machine Learning , 1986, Machine Learning.

[11] Harald Haas,et al. Harnessing Nonlinearity: Predicting Chaotic Systems and Saving Energy in Wireless Communication , 2004, Science.

[12] Binoy Ravindran,et al. Utility Accrual Scheduling under Arbitrary Time/Utility Functions and Multi-unit Resource Constraints , 2004 .

[13] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[14] Rajarshi Das,et al. Utility-Function-Driven Resource Allocation in Autonomic Systems , 2005, Second International Conference on Autonomic Computing (ICAC'05).

[15] Fabrizio Gagliardi,et al. Building an infrastructure for scientific Grid computing: status and goals of the EGEE project , 2005, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[16] A.S. McGough,et al. The GRIDCC Project the GRIDCC Collaboration , 2006, 2006 1st International Conference on Communication Systems Software & Middleware.

[17] Ivan Beschastnikh,et al. SPRUCE: A System for Supporting Urgent High-Performance Computing , 2006, Grid-Based Problem Solving Environments.

[18] Francesco Lelli,et al. The GRIDCC Project , 2006 .

[19] A. D. Meglio,et al. Programming the Grid with gLite , 2006 .

[20] Christophe Combet,et al. Integrating bioinformatics resources on the EGEE grid platform , 2006, Sixth IEEE International Symposium on Cluster Computing and the Grid (CCGRID'06).

[21] András Lörincz,et al. Reinforcement Learning with Echo State Networks , 2006, ICANN.

[22] Douglas Thain,et al. Grid Deployment of Legacy Bioinformatics Applications with Transparent Data Access , 2006, 2006 7th IEEE/ACM International Conference on Grid Computing.

[23] Rizos Sakellariou,et al. Job Scheduling on the Grid: Towards SLA-Based Scheduling , 2006, High Performance Computing Workshop.

[24] Charles Loomis,et al. Scheduling for Responsive Grids , 2008, Journal of Grid Computing.

[25] Rajarshi Das,et al. On the use of hybrid reinforcement learning for autonomic resource allocation , 2007, Cluster Computing.

[26] Michael Muskulus,et al. Modeling correlated workloads by combining model based clustering and a localized sampling algorithm , 2007, ICS '07.

[27] David Vengerov,et al. A Reinforcement Learning Approach to Dynamic Resource Allocation ∗ , 2005 .

[28] Michael Muskulus,et al. Analysis and modeling of job arrivals in a production grid , 2007, PERV.

[29] Lior Amar,et al. An On-line Algorithm for Fair-Share Node Allocations in a Cluster , 2007, Seventh IEEE International Symposium on Cluster Computing and the Grid (CCGrid '07).

[30] Balázs Kégl,et al. Grid Differentiated Services: A Reinforcement Learning Approach , 2008, 2008 Eighth IEEE International Symposium on Cluster Computing and the Grid (CCGRID).

[31] Johan Montagnat,et al. Workflow-Based Data Parallel Applications on the EGEE Production Grid Infrastructure , 2008, Journal of Grid Computing.

[32] Yong Zhao,et al. Cloud Computing and Grid Computing 360-Degree Compared , 2008, GCE 2008.

[33] Alexandru Iosup,et al. The Grid Workloads Archive , 2008, Future Gener. Comput. Syst..

[34] Balázs Kégl,et al. Utility-Based Reinforcement Learning for Reactive Grids , 2008, 2008 International Conference on Autonomic Computing.

[35] Lucio Grandinetti,et al. High Performance Computing and Grids in Action , 2008 .

[36] M. Cannataro. Handbook of Research on Computational Grid Technologies for Life Sciences, Biomedicine, and Healthcare , 2009 .

[37] Benoît Gotab,et al. Distributed jobs on EGEE Grid infrastructure for an Earth science application: moment tensor computation at the centroid of an earthquake , 2009, Earth Sci. Informatics.

[38] Omer F. Rana,et al. The Convergence of Clouds, Grids, and Autonomics , 2009, IEEE Internet Comput..

[39] Emmanuel Jeannot,et al. Grid Analysis of Radiological Data , 2009 .

[40] Carl E. Rasmussen,et al. Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[41] Shantenu Jha,et al. SAGA BigJob: An Extensible and Interoperable Pilot-Job Abstraction for Distributed Applications and Systems , 2010, 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing.

[42] Michèle Sebag,et al. The Grid Observatory , 2011, 2011 11th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing.

[43] P. Cochat,et al. Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.