Partially Observable Markov Decision Processes incorporating epistemic uncertainties

Abstract The use of Markov Decision Processes for Inspection Maintenance and Rehabilitation of civil engineering structures relies on the use of several transition matrices related to the stochastic degradation process, maintenance actions and imperfect inspections. Point estimators for these matrices are usually used and they are evaluated using statistical inference methods and/or expert evaluation methods. Thus, considerable epistemic uncertainty often veils the true values of these matrices. Our contribution through this paper is threefold. First, we present a methodology for incorporating epistemic uncertainties in dynamic programming algorithms used to solve finite horizon Markov Decision Processes (which may be partially observable). Second, we propose a methodology based on the use of Dirichlet distributions which answers, in our sense, much of the controversy found in the literature about estimating Markov transition matrices. Third, we show how the complexity resulting from the use of Monte-Carlo simulations for the transition matrices can be greatly overcome in the framework of dynamic programming. The proposed model is applied to concrete bridge under degradation, in order to provide the optimal strategy for inspection and maintenance. The influence of epistemic uncertainties on the optimal solution is underlined through sensitivity analysis regarding the input data.

[1]  J. K. Satia,et al.  Markovian Decision Processes with Uncertain Transition Probabilities , 1973, Oper. Res..

[2]  Olivier Poch,et al.  A maximum likelihood approximation method for Dirichlet's parameter estimation , 2008, Comput. Stat. Data Anal..

[3]  James V. Carnahan Analytical Framework for Optimizing Pavement Maintenance , 1988 .

[4]  Ross B. Corotis Risk communication with generalized uncertainty and linguistics , 2009 .

[5]  Jeff R. Wright,et al.  Civil And Environmental Systems Engineering , 1996 .

[6]  Carl Muehlmann,et al.  Trends in public infrastructure spending , 1999 .

[7]  A. Kiureghian,et al.  Aleatory or epistemic? Does it matter? , 2009 .

[8]  James E. Eckles,et al.  Optimum Maintenance with Incomplete Information , 1968, Oper. Res..

[9]  Samer M. Madanat,et al.  Robust Maintenance Policies for Markovian Systems under Model Uncertainty , 2005, Comput. Aided Civ. Infrastructure Eng..

[10]  Hongzhou Wang,et al.  A survey of maintenance policies of deteriorating systems , 2002, Eur. J. Oper. Res..

[11]  Alaa Chateauneuf,et al.  A generalised partially observable Markov decision process updated by decision trees for maintenance optimisation , 2011 .

[12]  Mingxiang Jiang,et al.  Modeling of risk-based inspection, maintenance and life-cycle cost with partially observable Markov decision processes , 2005 .

[13]  H. Pham,et al.  Invited reviewImperfect maintenance , 1996 .

[14]  André D. Orcesi,et al.  A bridge network maintenance framework for Pareto optimization of stakeholders/users costs , 2010, Reliab. Eng. Syst. Saf..

[15]  Samer Madanat,et al.  Computation of Infrastructure Transition Probabilities using Stochastic Duration Models , 2002 .

[16]  G. Monahan State of the Art—A Survey of Partially Observable Markov Decision Processes: Theory, Models, and Algorithms , 1982 .

[17]  Ward Whitt,et al.  Heavy-Traffic Limits for the G/H2*/n/mQueue , 2005, Math. Oper. Res..

[18]  Samer Madanat,et al.  Optimization of inspection and maintenance decisions for infrastructure facilities under performance model uncertainty: A quasi-Bayes approach , 2008 .

[19]  Robert Givan,et al.  Bounded-parameter Markov decision processes , 2000, Artif. Intell..

[20]  Eduard Hofer,et al.  When to separate uncertainties and when not to separate , 1996 .

[21]  Samer Madanat,et al.  Estimation of infrastructure transition probabilities from condition rating data , 1995 .

[22]  Laurent El Ghaoui,et al.  Robust Control of Markov Decision Processes with Uncertain Transition Matrices , 2005, Oper. Res..

[23]  Garud Iyengar,et al.  Robust Dynamic Programming , 2005, Math. Oper. Res..

[24]  Dimitri A. Grivas,et al.  Method for Estimating Transition Probability in Bridge Deterioration Models , 1998 .

[25]  Dan M. Frangopol,et al.  Probabilistic Lifetime-Oriented Multiobjective Optimization of Bridge Maintenance: Single Maintenance Type , 2006 .

[26]  Richard Bellman,et al.  Adaptive Control Processes: A Guided Tour , 1961, The Mathematical Gazette.

[27]  George E. Monahan,et al.  A Survey of Partially Observable Markov Decision Processes: Theory, Models, and Algorithms , 2007 .

[28]  Chelsea C. White,et al.  Markov Decision Processes with Imprecise Transition Probabilities , 1994, Oper. Res..

[29]  Shie Mannor,et al.  Percentile Optimization for Markov Decision Processes with Parameter Uncertainty , 2010, Oper. Res..

[30]  Samer Madanat,et al.  History-Dependent Bridge Deck Maintenance and Replacement Optimization with Markov Decision Processes , 2007 .

[31]  M. J. Kallen,et al.  Markov processes for maintenance optimization of civil infrastructure in the Netherlands , 2007 .