Sequential Decision Making under Uncertain Future Preferences

We present a model of sequential decision making under uncertain future preferences, assuming that the evolution of the trade-off weight vector is constrained by set inclusion. We consider different mechanisms for selecting the next trade-off weight vector: a mechanism working against the decision maker DM M, a mechanism trying to aid the DM B, and briefly a probabilistic mechanism P. Piecewise linear, convex upper and lower bounds on the optimal value function are determined for mechanisms M and B. Conditions are given that guarantee that the optimal value function is piecewise linear and convex for all three mechanisms. Procedures for computing these bounds and determining associated strategies are presented. A hypothetical situation involving an individual seeking promotion is used to illustrate the model and the numerical techniques.

[1]  Dimitri P. Bertsekas,et al.  Dynamic Programming and Stochastic Control , 1977, IEEE Transactions on Systems, Man, and Cybernetics.

[2]  Arthur M. Geoffrion,et al.  An Interactive Approach for Multi-Criterion Optimization, with an Application to the Operation of an Academic Department , 1972 .

[3]  Juan Figueras The restriction set approach to stochastic decision systems , 1972 .

[4]  D. Bertsekas,et al.  Sufficiently informative functions and the minimax feedback control of uncertain dynamic systems , 1973 .

[5]  L. G. Mitten Preference Order Dynamic Programming , 1974 .

[6]  T. H. Matheiss,et al.  A Survey and Comparison of Methods for Finding All Vertices of Convex Polyhedral Sets , 1980, Math. Oper. Res..

[7]  David M. Kreps Decision Problems with Expected Utility Criteria, II: Stationarity , 1977, Math. Oper. Res..

[8]  Chelsea C. White,et al.  Optimal Consumption and Portfolio Strategies in a Discrete-Time Model with Summary-Dependent Preferences , 1982, Journal of Financial and Quantitative Analysis.

[9]  Gerbard Tintner,et al.  Stochastic programming and stochastic control , 1975 .

[10]  S. Zionts,et al.  An Interactive Programming Method for Solving the Multiple Criteria Problem , 1976 .

[11]  H. S. Witsenhausen On The Uncertainty of Future Preferences , 1974 .

[12]  J. March Bounded rationality, ambiguity, and the engineering of choice , 1978 .

[13]  Edward J. Sondik,et al.  The Optimal Control of Partially Observable Markov Processes over a Finite Horizon , 1973, Oper. Res..

[14]  M. J. Sobel Ordinal Dynamic Programming , 1975 .

[15]  Louis J. Cherene Set Valued Dynamical Systems and Economic Flow , 1978 .

[16]  Joseph J. Talavage,et al.  A Tradeoff Cut Approach to Multiple Objective Optimization , 1980, Oper. Res..