论文信息 - Transport Score Climbing: Variational Inference Using Forward KL and Adaptive Neural Transport

Transport Score Climbing: Variational Inference Using Forward KL and Adaptive Neural Transport

Variational inference often minimizes the “reverse” Kullbeck-Leibler (KL) KL ( q || p ) from the approximate distribution q to the posterior p . Recent work studies the “forward” KL KL ( p || q ) , which unlike reverse KL does not lead to variational approximations that underestimate uncertainty. This paper introduces Transport Score Climbing (TSC), a method that optimizes KL ( p || q ) by using Hamiltonian Monte Carlo (HMC) and a novel adaptive transport map. The transport map improves the trajectory of HMC by acting as a change of variable between the latent variable space and a warped space. TSC uses HMC samples to dynamically train the transport map while optimizing KL ( p || q ) . TSC leverages synergies, where better transport maps lead to better HMC sampling, which then leads to better transport maps. We demonstrate TSC on synthetic and real data. We ﬁnd that TSC achieves competitive performance when training variational autoencoders on large-scale data.

C. A. Naesseth | D. Blei | Liyi Zhang

[1] Jacob R. Gardner,et al. Markov Chain Score Ascent: A Unifying Framework of Variational Inference with Markovian Gradients , 2022, NeurIPS.

[2] Grant M. Rotskoff,et al. Adaptive Monte Carlo augmented with normalizing flows , 2021, Proceedings of the National Academy of Sciences of the United States of America.

[3] Michael I. Jordan,et al. Variational Refinement for Importance Sampling Using the Forward Kullback-Leibler Divergence , 2021, UAI.

[4] Francisco J. R. Ruiz,et al. Unbiased Gradient Estimation for Variational Auto-Encoders using Coupled Markov Chains , 2020, UAI.

[5] Zhijian Ou,et al. Joint Stochastic Approximation and Its Application to Learning Discrete Latent Variable Models , 2020, UAI.

[6] C. A. Naesseth,et al. Markovian Score Climbing: Variational Inference with KL(p||q) , 2020, NeurIPS.

[7] John Paisley,et al. Reweighted Expectation Maximization , 2019, ArXiv.

[8] Fredrik Lindsten,et al. Elements of Sequential Monte Carlo , 2019, Found. Trends Mach. Learn..

[9] Joshua V. Dillon,et al. NeuTra-lizing Bad Geometry in Hamiltonian Monte Carlo Using Neural Transport , 2019, 1903.03704.

[10] Arnaud Doucet,et al. Hamiltonian Variational Auto-Encoder , 2018, NeurIPS.

[11] Aki Vehtari,et al. Yes, but Did It Work?: Evaluating Variational Inference , 2018, ICML.