Stochastic shortest path problems with associative accumulative criteria

We consider a stochastic shortest path problem with associative criteria in which for each node of a graph we choose a probability distribution over the set of successor nodes so as to reach a given target node optimally. We formulate such a problem as an associative Markov decision processes. We show that an optimal value function is a unique solution to an optimality equation and find an optimal stationary policy. Also we give a value iteration method and a policy improvement method.

[1]  Toshiharu Fujita,et al.  CONDITIONAL DECISION-MAKING IN FUZZY ENVIRONMENT , 1999 .

[2]  N. G. F. Sancho,et al.  Routing problems and Markovian decision processes , 1985 .

[3]  D. White Minimizing a Threshold Probability in Discounted Markov Decision Processes , 1993 .

[4]  Cyrus Derman,et al.  Finite State Markovian Decision Processes , 1970 .

[5]  Yoshio Ohtsubo,et al.  Optimal policy for minimizing risk models in Markov decision processes , 2002 .

[6]  Yoshio Ohtsubo,et al.  Equivalence classes for optimizing risk models in Markov decision processes , 2004, Math. Methods Oper. Res..

[7]  Hans-Jürgen Zimmermann,et al.  Fuzzy Set Theory - and Its Applications , 1985 .

[8]  Yoshio Otsubo,et al.  MULTISTAGE MARKOV DECISION PROCESSES WITH MINIMUM CRITERIA OF RANDOM REWARDS , 2006 .

[9]  Yukihiro Maruyama AN INVARIANT IMBEDDING APPROACH TO ASSOCIATIVE SHORTEST PATH PROBLEMS , 1999 .

[10]  John N. Tsitsiklis,et al.  An Analysis of Stochastic Shortest Path Problems , 1991, Math. Oper. Res..

[11]  C. Derman On Sequential Decisions and Markov Chains , 1962 .

[12]  Yukihiro Maruyama ASSOCIATIVE SHORTEST AND LONGEST PATH PROBLEMS , 1999 .

[13]  Stephen P. Brooks,et al.  Markov Decision Processes. , 1995 .

[14]  Seiichi Iwamoto,et al.  On Markov policies for minimax decision processes , 2001 .

[15]  Yoshio Ohtsubo,et al.  Optimal threshold probability in undiscounted Markov decision processes with a target set , 2004, Appl. Math. Comput..

[16]  Congbin Wu,et al.  Minimizing risk models in Markov decision processes with policies depending on target values , 1999 .

[17]  Richard Bellman,et al.  Decision-making in fuzzy environment , 2012 .

[18]  L. A. Zadeh,et al.  Optimal Pursuit Strategies in Discrete-State Probabilistic Systems , 1962 .

[19]  Seiichi Iwamoto,et al.  STOCHASTIC DECISION-MAKING IN A FUZZY ENVIRONMENT , 1995 .

[20]  Yoshio Ohtsubo Minimizing risk models in stochastic shortest path problems , 2003, Math. Methods Oper. Res..