Feature Selection Using Regularization in Approximate Linear Programs for Markov Decision Processes
暂无分享,去创建一个
Marek Petrik | Shlomo Zilberstein | Gavin Taylor | Ronald Parr | Ronald E. Parr | Gavin Taylor | S. Zilberstein | Marek Petrik
[1] Terence Tao,et al. The Dantzig selector: Statistical estimation when P is much larger than n , 2005, math/0506081.
[2] Kazuo Tanaka,et al. An approach to fuzzy control of nonlinear systems: stability and design issues , 1996, IEEE Trans. Fuzzy Syst..
[3] Michail G. Lagoudakis,et al. Least-Squares Policy Iteration , 2003, J. Mach. Learn. Res..
[4] R. Tibshirani. Regression Shrinkage and Selection via the Lasso , 1996 .
[5] P. Schweitzer,et al. Generalized polynomial approximations in Markovian decision processes , 1985 .
[6] Andrew Y. Ng,et al. Regularization and feature selection in least-squares temporal difference learning , 2009, ICML '09.
[7] Sridhar Mahadevan,et al. Learning Representation and Control in Markov Decision Processes: New Frontiers , 2009, Found. Trends Mach. Learn..
[8] Gavin Taylor,et al. Kernelized value function approximation for reinforcement learning , 2009, ICML '09.
[9] Gareth M. James,et al. DASSO: connections between the Dantzig selector and lasso , 2009 .
[10] Robert J. Vanderbei,et al. Linear Programming: Foundations and Extensions , 1998, Kluwer international series in operations research and management service.
[11] Justin K. Romberg,et al. Dantzig selector homotopy with dynamic measurements , 2009, Electronic Imaging.
[12] Benjamin Van Roy,et al. On Constraint Sampling in the Linear Programming Approach to Approximate Dynamic Programming , 2004, Math. Oper. Res..
[13] Shie Mannor,et al. Regularized Policy Iteration , 2008, NIPS.
[14] Benjamin Van Roy,et al. The Linear Programming Approach to Approximate Dynamic Programming , 2003, Oper. Res..
[15] Marek Petrik,et al. Constraint relaxation in approximate linear programs , 2009, ICML '09.
[16] Vivek F. Farias,et al. A Smoothed Approximate Linear Program , 2009, NIPS.
[17] Lihong Li,et al. Analyzing feature generation for value-function approximation , 2007, ICML '07.
[18] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[19] Preben Alstrøm,et al. Learning to Drive a Bicycle Using Reinforcement Learning and Shaping , 1998, ICML.