Robust Regression for Safe Exploration in Control
暂无分享,去创建一个
Soon-Jo Chung | Yisong Yue | Anima Anandkumar | Anqi Liu | Guanya Shi | Yisong Yue | Anima Anandkumar | Anqi Liu | Guanya Shi | Soon-Jo Chung
[1] Sida I. Wang,et al. Dropout Training as Adaptive Regularization , 2013, NIPS.
[2] David Rolnick,et al. Measuring and regularizing networks in function space , 2018, ICLR.
[3] Yisong Yue,et al. Smooth Imitation Learning for Online Sequence Prediction , 2016, ICML.
[4] Javier García,et al. Safe Exploration of State and Action Spaces in Reinforcement Learning , 2012, J. Artif. Intell. Res..
[5] Pieter Abbeel,et al. Safe Exploration in Markov Decision Processes , 2012, ICML.
[6] Jonathon Shlens,et al. Explaining and Harnessing Adversarial Examples , 2014, ICLR.
[7] Gábor Orosz,et al. End-to-End Safe Reinforcement Learning through Barrier Functions for Safety-Critical Continuous Control Tasks , 2019, AAAI.
[8] Soon-Jo Chung,et al. Neural Lander: Stable Drone Landing Control Using Learned Dynamics , 2018, 2019 International Conference on Robotics and Automation (ICRA).
[9] P. Olver. Nonlinear Systems , 2013 .
[10] Yang Song,et al. Improving the Robustness of Deep Neural Networks via Stability Training , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[11] Alkis Gotovos,et al. Safe Exploration for Optimization with Gaussian Processes , 2015, ICML.
[12] Andreas Krause,et al. Safe Model-based Reinforcement Learning with Stability Guarantees , 2017, NIPS.
[13] Andreas Krause,et al. Safe controller optimization for quadrotors with Gaussian processes , 2015, 2016 IEEE International Conference on Robotics and Automation (ICRA).
[14] Soon-Jo Chung,et al. Trajectory Optimization for Chance-Constrained Nonlinear Stochastic Systems , 2019, 2019 IEEE 58th Conference on Decision and Control (CDC).
[15] Brian D. Ziebart,et al. Shift-Pessimistic Active Learning Using Robust Bias-Aware Prediction , 2015, AAAI.
[16] Peter Auer,et al. Using Confidence Bounds for Exploitation-Exploration Trade-offs , 2003, J. Mach. Learn. Res..
[17] Swarat Chaudhuri,et al. Control Regularization for Reduced Variance Reinforcement Learning , 2019, ICML.
[18] Jennifer Listgarten,et al. Conditioning by adaptive sampling for robust design , 2019, ICML.
[19] Sébastien Bubeck,et al. Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems , 2012, Found. Trends Mach. Learn..
[20] Matus Telgarsky,et al. Spectrally-normalized margin bounds for neural networks , 2017, NIPS.
[21] Houman Owhadi,et al. On the Brittleness of Bayesian Inference , 2013, SIAM Rev..
[22] Jaime F. Fisac,et al. Reachability-based safe learning with Gaussian processes , 2014, 53rd IEEE Conference on Decision and Control.
[23] Jaime F. Fisac,et al. A General Safety Framework for Learning-Based Control in Uncertain Robotic Systems , 2017, IEEE Transactions on Automatic Control.
[24] A. Dawid,et al. Game theory, maximum entropy, minimum discrepancy and robust Bayesian decision theory , 2004, math/0410076.
[25] Carl E. Rasmussen,et al. Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.
[26] Aaron D. Ames,et al. Episodic Learning with Control Lyapunov Functions for Uncertain Robotic Systems* , 2019, 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).
[27] Prabhat,et al. Scalable Bayesian Optimization Using Deep Neural Networks , 2015, ICML.
[28] Joel W. Burdick,et al. Stagewise Safe Bayesian Optimization with Gaussian Processes , 2018, ICML.
[29] Ruben Martinez-Cantin,et al. BayesOpt: a Bayesian optimization library for nonlinear optimization, experimental design and bandits , 2014, J. Mach. Learn. Res..
[30] Brian D. Ziebart,et al. Robust Classification Under Sample Selection Bias , 2014, NIPS.
[31] Andrew Gordon Wilson,et al. Deep Kernel Learning , 2015, AISTATS.
[32] Angela P. Schoellig,et al. Robust Constrained Learning-based NMPC enabling reliable mobile robot path tracking , 2016, Int. J. Robotics Res..
[33] J. Doyle,et al. Essentials of Robust Control , 1997 .
[34] H. Shimodaira,et al. Improving predictive inference under covariate shift by weighting the log-likelihood function , 2000 .
[35] Cordelia Schmid,et al. Learning object class detectors from weakly annotated video , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.
[36] Edwin V. Bonilla,et al. Multi-task Gaussian Process Prediction , 2007, NIPS.
[37] Andreas Krause,et al. Information-Theoretic Regret Bounds for Gaussian Process Optimization in the Bandit Setting , 2009, IEEE Transactions on Information Theory.
[38] Ufuk Topcu,et al. Safe Reinforcement Learning via Shielding , 2017, AAAI.
[39] Brian D. Ziebart,et al. Robust Covariate Shift Prediction with General Losses and Feature Views , 2017, ArXiv.
[40] Andreas Krause,et al. Safe Exploration in Finite Markov Decision Processes with Gaussian Processes , 2016, NIPS.
[41] Yuichi Yoshida,et al. Spectral Normalization for Generative Adversarial Networks , 2018, ICLR.
[42] Andreas Krause,et al. Adaptive and Safe Bayesian Optimization in High Dimensions via One-Dimensional Subspaces , 2019, ICML.
[43] Yisong Yue,et al. Safe Exploration and Optimization of Constrained MDPs Using Gaussian Processes , 2018, AAAI.
[44] Weiping Li,et al. Applied Nonlinear Control , 1991 .
[45] Brian D. Ziebart,et al. Robust Covariate Shift Regression , 2016, AISTATS.
[46] Nitish Srivastava,et al. Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..
[47] Stefano Ermon,et al. Label-Free Supervision of Neural Networks with Physics and Domain Knowledge , 2016, AAAI.