Utility Theory for Sequential Decision Making
暂无分享,去创建一个
[1] Doina Precup,et al. On the Expressivity of Markov Reward , 2021, NeurIPS.
[2] Silviu Pitis,et al. Rethinking the Discount Factor in Reinforcement Learning: A Decision Theoretic Approach , 2019, AAAI.
[3] J. Schreiber. Foundations Of Statistics , 2016 .
[4] F. Ramsey. Truth and Probability , 2016 .
[5] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[6] E. Altman. Constrained Markov Decision Processes , 1999 .
[7] Peter Norvig,et al. Artificial Intelligence: A Modern Approach , 1995 .
[8] John B. Kidd,et al. Decisions with Multiple Objectives—Preferences and Value Tradeoffs , 1977 .
[9] S. C. Jaquette. A Utility Criterion for Markov Decision Processes , 1976 .
[10] D. Bernoulli. Specimen theoriae novae de mensura sortis : translated into German and English , 1967 .
[11] D. Michie. GAME-PLAYING AND GAME-LEARNING AUTOMATA , 1966 .
[12] T. Koopmans. Stationary Ordinal Utility and Impatience , 1960 .
[13] G. Debreu. Topological Methods in Cardinal Utility Theory , 1959 .
[14] D. Bernoulli. Exposition of a New Theory on the Measurement of Risk , 1954 .
[15] E. Rowland. Theory of Games and Economic Behavior , 1946, Nature.
[16] B. D. Finetti. La prévision : ses lois logiques, ses sources subjectives , 1937 .