Causal Curiosity: RL Agents Discovering Self-supervised Experiments for Causal Representation Learning
暂无分享,去创建一个
[1] Clayton D. Scott,et al. IEEE Transactions on Pattern Analysis and Machine Intelligence , 2022, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[2] Yoshua Bengio,et al. CausalWorld: A Robotic Manipulation Benchmark for Causal Structure and Transfer Learning , 2020, ICLR.
[3] Felipe Petroski Such,et al. Generalized Hidden Parameter MDPs Transferable Model-based RL in a Handful of Trials , 2020, AAAI.
[4] Ben Poole,et al. Weakly-Supervised Disentanglement Without Compromises , 2020, ICML.
[5] Luisa M. Zintgraf,et al. VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning , 2019, ICLR.
[6] Jakub M. Tomczak,et al. DIVA: Domain Invariant Variational Autoencoders , 2019, DGS@ICLR.
[7] B. Schölkopf,et al. Causality for Machine Learning , 2019, Probabilistic and Causal Inference.
[8] Bernhard Schölkopf,et al. Challenging Common Assumptions in the Unsupervised Learning of Disentangled Representations , 2018, ICML.
[9] Bernhard Schölkopf,et al. A Local Information Criterion for Dynamical Systems , 2018, ArXiv.
[10] Guillaume Desjardins,et al. Understanding disentangling in β-VAE , 2018, ArXiv.
[11] Guillaume Desjardins,et al. Understanding disentangling in $\beta$-VAE , 2018, 1804.03599.
[12] Andriy Mnih,et al. Disentangling by Factorising , 2018, ICML.
[13] Roger B. Grosse,et al. Isolating Sources of Disentanglement in Variational Autoencoders , 2018, NeurIPS.
[14] Bernhard Schölkopf,et al. Learning Independent Causal Mechanisms , 2017, ICML.
[15] Taylor W. Killian,et al. Direct Policy Transfer via Hidden Parameter Markov Decision Processes , 2018 .
[16] Alexei A. Efros,et al. Curiosity-Driven Exploration by Self-Supervised Prediction , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).
[17] Marco Cuturi,et al. Soft-DTW: a Differentiable Loss Function for Time-Series , 2017, ICML.
[18] Finale Doshi-Velez,et al. Robust and Efficient Transfer Learning with Hidden Parameter Markov Decision Processes , 2017, AAAI.
[19] Christopher Burgess,et al. beta-VAE: Learning Basic Visual Concepts with a Constrained Variational Framework , 2016, ICLR 2016.
[20] B. Schoelkopf,et al. Algorithmic independence of initial condition and dynamical law in thermodynamics and causal inference , 2015, 1512.02057.
[21] Finale Doshi-Velez,et al. Hidden Parameter Markov Decision Processes: A Semiparametric Regression Approach for Discovering Latent Task Parametrizations , 2013, IJCAI.
[22] Bernhard Schölkopf,et al. Artificial intelligence: Learning to see and act , 2015, Nature.
[23] Bernhard Schölkopf,et al. Inferring latent structures via information inequalities , 2014, UAI.
[24] Max Welling,et al. Semi-supervised Learning with Deep Generative Models , 2014, NIPS.
[25] S. Morgan. Handbook of Causal Analysis for Social Research , 2013 .
[26] Pascal Vincent,et al. Representation Learning: A Review and New Perspectives , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[27] Felix Elwert,et al. Graphical Causal Models , 2013 .
[28] Bernhard Schölkopf,et al. On causal and anticausal learning , 2012, ICML.
[29] Jürgen Schmidhuber,et al. Learning skills from play: Artificial curiosity on a Katana robot arm , 2012, The 2012 International Joint Conference on Neural Networks (IJCNN).
[30] Jürgen Schmidhuber,et al. Formal Theory of Creativity, Fun, and Intrinsic Motivation (1990–2010) , 2010, IEEE Transactions on Autonomous Mental Development.
[31] Peter Spirtes,et al. Introduction to Causal Inference , 2010, J. Mach. Learn. Res..
[32] Bernhard Schölkopf,et al. Causal Inference Using the Algorithmic Markov Condition , 2008, IEEE Transactions on Information Theory.
[33] Jürgen Schmidhuber,et al. Optimal Artificial Curiosity, Creativity, Music, and the Fine Arts , 2005 .
[34] Shie Mannor,et al. A Tutorial on the Cross-Entropy Method , 2005, Ann. Oper. Res..
[35] Peter Grünwald,et al. A tutorial introduction to the minimum description length principle , 2004, ArXiv.
[36] Marko Bacic,et al. Model predictive control , 2003 .
[37] Leonie Kohl,et al. Fundamental Concepts in the Design of Experiments , 2000 .
[38] Veronica Czitrom,et al. One-Factor-at-a-Time versus Designed Experiments , 1999 .
[39] Jürgen Schmidhuber,et al. Curious model-building control systems , 1991, [Proceedings] 1991 IEEE International Joint Conference on Neural Networks.
[40] Jürgen Schmidhuber,et al. A possibility for implementing curiosity and boredom in model-building neural controllers , 1991 .
[41] P. Rousseeuw. Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .
[42] J. Rissanen,et al. Modeling By Shortest Data Description* , 1978, Autom..
[43] J. I. The Design of Experiments , 1936, Nature.