A dimensionality reduction approach for many-objective Markov Decision Processes: Application to a water reservoir operation problem

Abstract The operation of complex environmental systems usually accounts for multiple, conflicting objectives, whose presence imposes to explicitly consider the preference structure of the parties involved. Multi-objective Markov Decision Processes are a useful mathematical framework for the resolution of such sequential, decision-making problems. However, the computational requirements of the available optimization techniques limit their application to problems involving few objectives. In real-world applications it is therefore common practice to select few, representative objectives with respect to which the problem is solved. This paper proposes a dimensionality reduction approach, based on the Non-negative Principal Component Analysis (NPCA), to aggregate the original objectives into a reduced number of principal components, with respect to which the optimization problem is solved. The approach is evaluated on the daily operation of a multi-purpose water reservoir (Tono Dam, Japan) with 10 operating objectives, and compared against a 5-objectives formulation of the same problem. Results show that the NPCA-based approach provides a better representation of the Pareto front, especially in terms of consistency and solution diversity.

[1]  Thomas L. Saaty,et al.  Parametric Objective Function (Part 1) , 1954, Oper. Res..

[2]  D. White Multi-objective infinite-horizon discounted Markov decision processes , 1982 .

[3]  Peter J. Fleming,et al.  Many-Objective Optimization: An Engineering Design Perspective , 2005, EMO.

[4]  Kalyanmoy Deb,et al.  Reference point based multi-objective optimization using evolutionary algorithms , 2006, GECCO.

[5]  Enrico Weber,et al.  Integrated and Participatory Water Resources Management - Practice , 2007 .

[6]  Patrick M. Reed,et al.  Many‐objective groundwater monitoring network design using bias‐aware ensemble Kalman filtering, evolutionary optimization, and visual analytics , 2011 .

[7]  Thomas A. Henzinger,et al.  Markov Decision Processes with Multiple Objectives , 2006, STACS.

[8]  R. Hogarth Beyond discrete biases: Functional and dysfunctional aspects of judgmental heuristics. , 1981 .

[9]  Suzana Dragicevic,et al.  GIS and Intelligent Agents for Multiobjective Natural Resource Allocation: A Reinforcement Learning Approach , 2009, Trans. GIS.

[10]  Pierre Geurts,et al.  Extremely randomized trees , 2006, Machine Learning.

[11]  Amnon Shashua,et al.  Nonnegative Sparse PCA , 2006, NIPS.

[12]  Claudio Carnevale,et al.  Surrogate models to compute optimal air quality planning policies at a regional scale , 2012, Environ. Model. Softw..

[13]  Andrea Castelletti,et al.  Planning the Optimal Operation of a Multioutlet Water Reservoir with Water Quality and Quantity Targets , 2014 .

[14]  Nazanin Shabani Incorporating flood control rule curves of the Columbia River hydroelectric system in a multireservoir reinforcement learning optimization model , 2009 .

[15]  D. J. White,et al.  Further Real Applications of Markov Decision Processes , 1988 .

[16]  John M. Flach,et al.  MGA: a decision support system for complex, incompletely defined problems , 1990, IEEE Trans. Syst. Man Cybern..

[17]  Daniel P. Loucks,et al.  Reliability, resiliency, and vulnerability criteria for water resource system performance evaluation , 1982 .

[18]  Marcello Restelli,et al.  Tree-based fitted Q-iteration for multi-objective Markov decision processes in water resource management , 2013 .

[19]  Orazio Giustolisi,et al.  An evolutionary multiobjective strategy for the effective management of groundwater resources , 2008 .

[20]  M. Farina,et al.  On the optimal solution definition for many-criteria optimization problems , 2002, 2002 Annual Meeting of the North American Fuzzy Information Processing Society Proceedings. NAFIPS-FLINT 2002 (Cat. No. 02TH8622).

[21]  Pekka Korhonen,et al.  A pareto race , 1988 .

[22]  Joseph R. Kasprzyk,et al.  Many objective robust decision making for complex environmental systems undergoing change , 2012, Environ. Model. Softw..

[23]  Eckart Zitzler,et al.  Improving hypervolume-based multiobjective evolutionary algorithms by using objective reduction methods , 2007, 2007 IEEE Congress on Evolutionary Computation.

[24]  Marco Laumanns,et al.  Performance assessment of multiobjective optimizers: an analysis and review , 2003, IEEE Trans. Evol. Comput..

[25]  Raghavan Srinivasan,et al.  Efficient multi-objective calibration of a computationally intensive hydrologic model with parallel computing software in Python , 2013, Environ. Model. Softw..

[26]  M. Franssen Arrow’s theorem, multi-criteria decision problems and multi-attribute preferences in engineering design , 2005 .

[27]  S. Galelli,et al.  Dealing with many-objective problems in water resources planning and management , 2011 .

[28]  Wenyan Wu,et al.  Multiobjective optimization of water distribution systems accounting for economic cost, hydraulic reliability, and greenhouse gas emissions , 2013 .

[29]  Andrea Castelletti,et al.  Many-Objective Operation of Selective Withdrawal Reservoirs Including Water Quality Targets , 2013 .

[30]  Demetris Koutsoyiannis,et al.  Evaluation of the parameterization‐simulation‐optimization approach for the control of reservoir systems , 2003 .

[31]  Warren B. Powell,et al.  “Approximate dynamic programming: Solving the curses of dimensionality” by Warren B. Powell , 2007, Wiley Series in Probability and Statistics.

[32]  Andrzej P. Wierzbicki,et al.  The Use of Reference Objectives in Multiobjective Optimization , 1979 .

[33]  Joseph R. Kasprzyk,et al.  Evolutionary multiobjective optimization in water resources: The past, present, and future , 2012 .

[34]  C. Gettys,et al.  Plausibility assessments in hypothesis generation , 1986 .

[35]  R. Bellman Dynamic programming. , 1957, Science.

[36]  Kaisa Miettinen,et al.  Wastewater treatment plant design and operation under multiple conflicting objective functions , 2013, Environ. Model. Softw..

[37]  Joshua D. Knowles,et al.  On metrics for comparing nondominated sets , 2002, Proceedings of the 2002 Congress on Evolutionary Computation. CEC'02 (Cat. No.02TH8600).

[38]  S. Ross A First Course in Probability , 1977 .

[39]  Eckart Zitzler,et al.  Are All Objectives Necessary? On Dimensionality Reduction in Evolutionary Multiobjective Optimization , 2006, PPSN.

[40]  Eckart Zitzler,et al.  Objective Reduction in Evolutionary Multiobjective Optimization: Theory and Applications , 2009, Evolutionary Computation.

[41]  Marcello Restelli,et al.  Tree‐based reinforcement learning for optimal water reservoir operation , 2010 .

[42]  Steven W. Effler,et al.  Simulation of Operations and Water Quality Performance of Reservoir Multilevel Intake Configurations , 2007 .

[43]  Alexander V. Lotov,et al.  Interactive Decision Maps: Approximation and Visualization of Pareto Frontier , 2004 .

[44]  John L. Grace,et al.  Selective Withdrawal from Man‐Made Lakes , 2013 .

[45]  Andrea Castelletti,et al.  Assessing the predictive capability of randomized tree-based ensembles in streamflow modelling , 2013 .

[46]  Darrell G. Fontane,et al.  Use of Multiobjective Particle Swarm Optimization in Water Resources Management , 2008 .

[47]  Fabio Zagonari,et al.  A heterogeneous multi-criteria multi-expert decision-support system for scoring combinations of flood mitigation and recovery options , 2013, Environ. Model. Softw..

[48]  Taesoon Kim,et al.  Single-reservoir operating rules for a year using multiobjective genetic algorithm , 2008 .

[49]  Joseph R. Kasprzyk,et al.  Many-objective de Novo water supply portfolio planning under deep uncertainty , 2012, Environ. Model. Softw..

[50]  Paulo Chaves,et al.  Deriving reservoir operational strategies considering water quantity and quality objectives by stochastic fuzzy neural networks , 2007 .

[51]  Carlos A. Coello Coello,et al.  Objective reduction using a feature selection technique , 2008, GECCO '08.

[52]  Patrick M. Reed,et al.  A framework for Visually Interactive Decision-making and Design using Evolutionary Multi-objective Optimization (VIDEO) , 2007, Environ. Model. Softw..

[53]  Pierre Geurts,et al.  Tree-Based Batch Mode Reinforcement Learning , 2005, J. Mach. Learn. Res..

[54]  Evan Dekker,et al.  Empirical evaluation methods for multiobjective reinforcement learning algorithms , 2011, Machine Learning.

[55]  Bart De Schutter,et al.  Reinforcement Learning and Dynamic Programming Using Function Approximators , 2010 .

[56]  Andrea Castelletti,et al.  Integrated and Participatory Water Resources Management. Theory , 2007 .

[57]  Carlos A. Coello Coello,et al.  Online Objective Reduction to Deal with Many-Objective Problems , 2009, EMO.

[58]  Patrice Perny,et al.  On Finding Compromise Solutions in Multiobjective Markov Decision Processes , 2010, ECAI.

[59]  L. Lasdon,et al.  On a bicriterion formation of the problems of integrated system identification and system optimization , 1971 .

[60]  Kalyanmoy Deb,et al.  Reference point based multi-objective optimization using evolutionary algorithms , 2006, GECCO '06.

[61]  Patrick M. Reed,et al.  Many objective visual analytics: rethinking the design of complex engineered systems , 2013 .

[62]  C. Gettys,et al.  Hypothesis plausibility and hypothesis generation , 1979 .

[63]  John W. Labadie,et al.  Optimal control of reservoir discharge quality through selective withdrawal , 1981 .

[64]  O. Larichev Cognitive validity in design of decision‐aiding techniques , 1992 .

[65]  I. Jolliffe Principal Component Analysis , 2002 .

[66]  John Yearwood,et al.  On the Limitations of Scalarisation for Multi-objective Reinforcement Learning of Pareto Fronts , 2008, Australasian Conference on Artificial Intelligence.

[67]  Marcello Restelli,et al.  A multiobjective reinforcement learning approach to water resources systems operation: Pareto frontier approximation in a single run , 2013 .

[68]  Srini Narayanan,et al.  Learning all optimal policies with multiple criteria , 2008, ICML '08.