State Space Search for Risk-Averse Agents

We investigate search problems under risk in state-space graphs, with the aim of finding optimal paths for risk-averse agents. We consider problems where uncertainty is due to the existence of different scenarios of known probabilities, with different impacts on costs of solution-paths. We consider various non-linear decision criteria (EU, RDU, Yaari) to express risk averse preferences; then we provide a general optimization procedure for such criteria, based on a path-ranking algorithm applied on a scalarized valuation of the graph. We also consider partial preference models like second order stochastic dominance (SSD) and propose a multiobjective search algorithm to determine SSD-optimal paths. Finally, the numerical performance of our algorithms are presented and discussed.

[1]  I. Olkin,et al.  Inequalities: Theory of Majorization and Its Applications , 1980 .

[2]  Martin L. Puterman,et al.  Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .

[3]  Peter R. Wurman Optimal Factory Scheduling Using Stochastic Dominance A* , 1996, AAAI/IAAI, Vol. 2.

[4]  Jacqueline Grennon , 2nd Ed. , 2002, The Journal of nervous and mental disease.

[5]  Marco Scarsini,et al.  A note on stochastic dominance and inequality measures , 1989 .

[6]  J. Pratt RISK AVERSION IN THE SMALL AND IN THE LARGE11This research was supported by the National Science Foundation (grant NSF-G24035). Reproduction in whole or in part is permitted for any purpose of the United States Government. , 1964 .

[7]  Chelsea C. White,et al.  Multiobjective A* , 1991, JACM.

[8]  D. L. Hanson,et al.  ON THE THEORY OF RISK AVERSION , 1970 .

[9]  Michael P. Wellman,et al.  Path Planning under Time-Dependent Uncertainty , 1995, UAI.

[10]  Lawrence Mandow,et al.  A New Approach to Multiobjective A* Search , 2005, IJCAI.

[11]  Nils J. Nilsson,et al.  A Formal Basis for the Heuristic Determination of Minimum Cost Paths , 1968, IEEE Trans. Syst. Sci. Cybern..

[12]  J. Pratt,et al.  Increasing Risk: Some Direct Constructions , 1997 .

[13]  J. Neumann,et al.  Theory of Games and Economic Behavior. , 1945 .

[14]  Dan Suciu,et al.  Journal of the ACM , 2006 .

[15]  Patrice Perny,et al.  An Axiomatic Approach to Robustness in Search Problems with Multiple Scenarios , 2002, UAI.

[16]  Ishwar Murthy,et al.  Solving min‐max shortest‐path problems on a network , 1992 .

[17]  Judea Pearl,et al.  Heuristics : intelligent search strategies for computer problem solving , 1984 .

[18]  M. Yaari The Dual Theory of Choice under Risk , 1987 .

[19]  A Gerodimos,et al.  Robust Discrete Optimization and its Applications , 1996, J. Oper. Res. Soc..

[20]  D. Ellsberg Decision, probability, and utility: Risk, ambiguity, and the Savage axioms , 1961 .

[21]  J. Quiggin,et al.  Generalized Expected Utility Theory. The Rank-Dependent Model , 1996 .

[22]  Leslie Pack Kaelbling,et al.  Planning and Acting in Partially Observable Stochastic Domains , 1998, Artif. Intell..

[23]  M. Rothschild,et al.  Increasing risk: I. A definition , 1970 .

[24]  Patrice Perny,et al.  Search for Compromise Solutions in Multiobjective State Space Graphs , 2006, ECAI.