A Tutorial on the Cross-Entropy Method
暂无分享,去创建一个
Shie Mannor | Pieter-Tjerk de Boer | Dirk P. Kroese | Reuven Y. Rubinstein | R. Rubinstein | Shie Mannor | P. Boer
[1] David S. Johnson,et al. Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .
[2] M. Garey. Johnson: computers and intractability: a guide to the theory of np- completeness (freeman , 1979 .
[3] Richard S. Sutton,et al. Neuronlike adaptive elements that can solve difficult learning control problems , 1983, IEEE Transactions on Systems, Man, and Cybernetics.
[4] David E. Goldberg,et al. Genetic Algorithms in Search Optimization and Machine Learning , 1988 .
[5] Mihalis Yannakakis,et al. Optimization, approximation, and complexity classes , 1991, STOC '88.
[6] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .
[7] Charles Leake,et al. Discrete Event Systems: Sensitivity Analysis and Stochastic Optimization by the Score Function Method , 1994 .
[8] Dimitri P. Bertsekas,et al. Dynamic Programming and Optimal Control, Two Volume Set , 1995 .
[9] Giovanni Righini,et al. Heuristics from Nature for Hard Combinatorial Optimization Problems , 1996 .
[10] Andrew W. Moore,et al. Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..
[11] John N. Tsitsiklis,et al. Neuro-Dynamic Programming , 1996, Encyclopedia of Machine Learning.
[12] Thomas G. Dietterich. What is machine learning? , 2020, Archives of Disease in Childhood.
[13] Reuven Y. Rubinstein,et al. Optimization of computer simulation models with rare events , 1997 .
[14] Reuven Y. Rubinstein,et al. Modern simulation and modeling , 1998 .
[15] C. Voudouris,et al. Guided Local Search — an Illustrative Example in Function Optimisation , 1998 .
[16] Luca Maria Gambardella,et al. Ant Algorithms for Discrete Optimization , 1999, Artificial Life.
[17] R. Rubinstein. The Cross-Entropy Method for Combinatorial and Continuous Optimization , 1999 .
[18] John N. Tsitsiklis,et al. Actor-Critic Algorithms , 1999, NIPS.
[19] Yishay Mansour,et al. Policy Gradient Methods for Reinforcement Learning with Function Approximation , 1999, NIPS.
[20] Walter J. Gutjahr,et al. A Graph-based Ant System and its convergence , 2000, Future Gener. Comput. Syst..
[21] Pieter Tjerk de Boer,et al. Analysis and efficient simulation of queueing models of telecommunications systems , 2000 .
[22] Leyuan Shi,et al. Nested Partitions Method for Global Optimization , 2000, Oper. Res..
[23] Andrew G. Barto,et al. Robot Weightlifting By Direct Policy Search , 2001, IJCAI.
[24] Bjarne E. Helvik,et al. Using the Cross-Entropy Method to Guide/Govern Mobile Agent's Path Finding in Networks , 2001, MATA.
[25] R. Rubinstein. Combinatorial Optimization, Cross-Entropy, Ants and Rare Events , 2001 .
[26] Peter L. Bartlett,et al. Experiments with Infinite-Horizon, Policy-Gradient Estimation , 2001, J. Artif. Intell. Res..
[27] Goldberg,et al. Genetic algorithms , 1993, Robust Control Systems with Genetic Algorithms.
[28] Pieter-Tjerk de Boer,et al. Estimating buffer overflows in three stages using cross-entropy , 2002, Proceedings of the Winter Simulation Conference.
[29] Dirk P. Kroese,et al. Sequence alignment by rare event simulation , 2002, Proceedings of the Winter Simulation Conference.
[30] Dirk P. Kroese,et al. SABRES: Sequence Alignment By Rare Event Simulation , 2002 .
[31] Reuven Y. Rubinstein,et al. Cross-entropy and rare events for maximal cut and partition problems , 2002, TOMC.
[32] J. Wade Davis,et al. Statistical Pattern Recognition , 2003, Technometrics.
[33] Ehl Emile Aarts,et al. Simulated annealing and Boltzmann machines , 2003 .
[34] Reuven Y. Rubinstein,et al. Rare event estimation for static models via cross-entropy and importance sampling , 2003 .
[35] Shie Mannor,et al. The Cross Entropy Method for Fast Policy Search , 2003, ICML.
[36] Vijay R. Konda,et al. OnActor-Critic Algorithms , 2003, SIAM J. Control. Optim..
[37] Peter Dayan,et al. Q-learning , 1992, Machine Learning.
[38] Dirk P. Kroese,et al. The Transform Likelihood Ratio Method for Rare Event Simulation with Heavy Tails , 2004, Queueing Syst. Theory Appl..
[39] Peter Dayan,et al. Technical Note: Q-Learning , 2004, Machine Learning.
[40] Dirk P. Kroese,et al. A Fast Cross-Entropy Method for Estimating Buffer Overflows in Queueing Networks , 2004, Manag. Sci..
[41] Dirk P. Kroese,et al. HEAVY TAILS, IMPORTANCE SAMPLING AND CROSS–ENTROPY , 2005 .
[42] Dirk P. Kroese,et al. Application of the Cross-Entropy Method to the Buffer Allocation Problem in a Simulation-Based Environment , 2005, Ann. Oper. Res..
[43] Avraham Shtub,et al. Managing Stochastic, Finite Capacity, Multi-Project Systems through the Cross-Entropy Methodology , 2005, Ann. Oper. Res..
[44] Ad Ridder,et al. Importance Sampling Simulations of Markovian Reliability Systems Using Cross-Entropy , 2005, Ann. Oper. Res..
[45] Shie Mannor,et al. Basis Function Adaptation in Temporal Difference Reinforcement Learning , 2005, Ann. Oper. Res..
[46] L. Margolin,et al. On the Convergence of the Cross-Entropy Method , 2005, Ann. Oper. Res..
[47] Miro Kraetzl,et al. The Cross-Entropy Method for Network Reliability Estimation , 2005, Ann. Oper. Res..
[48] Tito Homem-de-Mello,et al. Solving the Vehicle Routing Problem with Stochastic Demands using the Cross-Entropy Method , 2005, Ann. Oper. Res..
[49] Lih-Yuan Deng,et al. The Cross-Entropy Method: A Unified Approach to Combinatorial Optimization, Monte-Carlo Simulation, and Machine Learning , 2006, Technometrics.