暂无分享,去创建一个
Hongyuan Zha | Jiachen Yang | Daniel Faissol | Brenden K. Petersen | Brenden Petersen | H. Zha | D. Faissol | Jiachen Yang
[1] R. J. Williams,et al. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.
[2] Leslie Pack Kaelbling,et al. Planning and Acting in Partially Observable Stochastic Domains , 1998, Artif. Intell..
[3] Rich Caruana,et al. Multitask Learning , 1997, Machine-mediated learning.
[4] András Lörincz,et al. MDPs: Learning in Varying Environments , 2003, J. Mach. Learn. Res..
[5] B. Adams,et al. Dynamic multidrug therapies for hiv: optimal and sti control approaches. , 2004, Mathematical biosciences and engineering : MBE.
[6] András Lörincz,et al. Module-Based Reinforcement Learning: Experiments with a Real Robot , 1998, Machine Learning.
[7] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[8] Peter Stone,et al. Transfer Learning for Reinforcement Learning Domains: A Survey , 2009, J. Mach. Learn. Res..
[9] R. Altman,et al. Pharmacogenomics Knowledge for Personalized Medicine , 2012, Clinical pharmacology and therapeutics.
[10] Bruno Castro da Silva,et al. Learning Parameterized Skills , 2012, ICML.
[11] Finale Doshi-Velez,et al. Hidden Parameter Markov Decision Processes: An Emerging Paradigm for Modeling Families of Related Tasks , 2014, AAAI Fall Symposia.
[12] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.
[13] Tom Schaul,et al. Universal Value Function Approximators , 2015, ICML.
[14] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[15] Daniel C. Zielinski,et al. Personalized Whole-Cell Kinetic Models of Metabolism for Discovery in Genomics and Pharmacodynamics. , 2015, Cell systems.
[16] David Silver,et al. Deep Reinforcement Learning with Double Q-Learning , 2015, AAAI.
[17] Finale Doshi-Velez,et al. Hidden Parameter Markov Decision Processes: A Semiparametric Regression Approach for Discovering Latent Task Parametrizations , 2013, IJCAI.
[18] Tom Schaul,et al. Prioritized Experience Replay , 2015, ICLR.
[19] Greg Turk,et al. Preparing for the Unknown: Learning a Universal Policy with Online System Identification , 2017, Robotics: Science and Systems.
[20] Finale Doshi-Velez,et al. Robust and Efficient Transfer Learning with Hidden Parameter Markov Decision Processes , 2017, AAAI.
[21] Balaraman Ravindran,et al. EPOpt: Learning Robust Neural Network Policies Using Model Ensembles , 2016, ICLR.
[22] Christopher Burgess,et al. beta-VAE: Learning Basic Visual Concepts with a Constrained Variational Framework , 2016, ICLR 2016.
[23] Joel s. Brown,et al. Integrating evolutionary dynamics into treatment of metastatic castrate-resistant prostate cancer , 2017, Nature Communications.
[24] Sergey Levine,et al. Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks , 2017, ICML.
[25] Alec Radford,et al. Proximal Policy Optimization Algorithms , 2017, ArXiv.
[26] Sergey Levine,et al. Self-Consistent Trajectory Autoencoder: Hierarchical Reinforcement Learning with Trajectory Embeddings , 2018, ICML.
[27] Li You,et al. Towards Multidrug Adaptive Therapy , 2018, Cancer Research.
[28] Qiang Liu,et al. Learning to Explore with Meta-Policy Gradient , 2018, ICML 2018.
[29] Andrej Kastrin,et al. Predicting potential drug-drug interactions on topological and semantic similarity features using statistical learning , 2018, PloS one.
[30] Taylor W. Killian,et al. Direct Policy Transfer via Hidden Parameter Markov Decision Processes , 2018 .
[31] Sandra Zilles,et al. An Overview of Machine Teaching , 2018, ArXiv.
[32] Marcello Restelli,et al. Transfer of Value Functions via Variational Methods , 2018, NeurIPS.
[33] Felipe Petroski Such,et al. Efficient transfer learning and online adaptation with latent variable models for continuous control , 2018, ArXiv.
[34] Gary An,et al. Examining the controllability of sepsis using genetic algorithms on an agent-based model of systemic inflammation , 2018, PLoS Comput. Biol..
[35] Joelle Pineau,et al. Decoupling Dynamics and Reward for Transfer Learning , 2018, ICLR.
[36] Razvan Pascanu,et al. Meta-Learning with Latent Embedding Optimization , 2018, ICLR.
[37] Danica Kragic,et al. VPE: Variational Policy Embedding for Transfer Reinforcement Learning , 2018, 2019 International Conference on Robotics and Automation (ICRA).
[38] Gary An,et al. Deep Reinforcement Learning and Simulation as a Path Toward Precision Medicine , 2019, J. Comput. Biol..
[39] Katja Hofmann,et al. Fast Context Adaptation via Meta-Learning , 2018, ICML.
[40] Sergey Levine,et al. Efficient Off-Policy Meta-Reinforcement Learning via Probabilistic Context Variables , 2019, ICML.
[41] Shimon Whiteson,et al. Fingerprint Policy Optimisation for Robust Reinforcement Learning , 2018, ICML.
[42] Li You,et al. Towards Multidrug Adaptive Therapy , 2020, Cancer Research.