论文信息 - Hybrid Memoised Wake-Sleep: Approximate Inference at the Discrete-Continuous Interface - 字舞流文

Hybrid Memoised Wake-Sleep: Approximate Inference at the Discrete-Continuous Interface

Modeling complex phenomena typically involves the use of both discrete and continuous variables. Such a setting applies across a wide range of problems, from identifying trends in time-series data to performing effective compositional scene understanding in images. Here, we propose Hybrid Memoised Wake-Sleep (HMWS), an algorithm for effective inference in such hybrid discrete-continuous models. Prior approaches to learning suffer as they need to perform repeated expensive inner-loop discrete inference. We build on a recent approach, Memoised Wake-Sleep (MWS), which alleviates part of the problem by memoising discrete variables, and extend it to allow for a principled and effective way to handle continuous variables by learning a separate recognition model used for importancesampling based approximate inference and marginalization. We evaluate HMWS in the GP-kernel learning and 3D scene understanding domains, and show that it outperforms current state-of-the-art inference methods.

Samuel J. Gershman | Joshua B. Tenenbaum | Kevin Ellis | Luke Hewitt | Katherine M. Collins | Tuan Anh Le | N Siddharth

[1] Yoshua Bengio,et al. Reweighted Wake-Sleep , 2014, ICLR.

[2] Karol Gregor,et al. Neural Variational Inference and Learning in Belief Networks , 2014, ICML.

[3] Yee Whye Teh,et al. Revisiting Reweighted Wake-Sleep for Models with Stochastic Control Flow , 2018, UAI.

[4] Ronald J. Williams,et al. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.

[5] Jennifer J. Sun,et al. Learning Differentiable Programs with Admissible Neural Heuristics , 2020, NeurIPS.

[6] Ben Poole,et al. Categorical Reparameterization with Gumbel-Softmax , 2016, ICLR.

[7] Geoffrey E. Hinton,et al. The Helmholtz Machine , 1995, Neural Computation.

[8] Armando Solar-Lezama,et al. DreamCoder: growing generalizable, interpretable knowledge with wake–sleep Bayesian program learning , 2020, Philosophical Transactions of the Royal Society A.

[9] Pieter Abbeel,et al. Gradient Estimation Using Stochastic Computation Graphs , 2015, NIPS.

[10] Joshua B. Tenenbaum,et al. Structure Discovery in Nonparametric Regression through Compositional Kernel Search , 2013, ICML.

[11] Pushmeet Kohli,et al. Making sense of raw input , 2021, Artif. Intell..

[12] Geoffrey E. Hinton,et al. The "wake-sleep" algorithm for unsupervised neural networks. , 1995, Science.

[13] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.

[14] Andriy Mnih,et al. Variational Inference for Monte Carlo Objectives , 2016, ICML.

[15] Jascha Sohl-Dickstein,et al. REBAR: Low-variance, unbiased gradient estimates for discrete latent variable models , 2017, NIPS.

[16] Swarat Chaudhuri,et al. HOUDINI: Lifelong Learning as Program Synthesis , 2018, NeurIPS.

[17] Joshua B. Tenenbaum,et al. Learning to learn generative programs with Memoised Wake-Sleep , 2020, UAI.

[18] Ardavan Saeedi,et al. Variational Particle Approximations , 2014, J. Mach. Learn. Res..

[19] David Duvenaud,et al. Backpropagation through the Void: Optimizing control variates for black-box gradient estimation , 2017, ICLR.

[20] Yuhong Yang,et al. Information Theory, Inference, and Learning Algorithms , 2005 .

[21] Shakir Mohamed,et al. Variational Inference with Normalizing Flows , 2015, ICML.

[22] Yee Whye Teh,et al. The Concrete Distribution: A Continuous Relaxation of Discrete Random Variables , 2016, ICLR.

[23] Luc De Raedt,et al. DeepProbLog: Neural Probabilistic Logic Programming , 2018, BNAIC/BENELEARN.