论文信息 - Implicit Differentiation by Perturbation

Implicit Differentiation by Perturbation

This paper proposes a simple and efficient finite difference method for implicit differentiation of marginal inference results in discrete graphical models. Given an arbitrary loss function, defined on marginals, we show that the derivatives of this loss with respect to model parameters can be obtained by running the inference procedure twice, on slightly perturbed model parameters. This method can be used with approximate inference, with a loss function over approximate marginals. Convenient choices of loss functions make it practical to fit graphical models with hidden variables, high treewidth and/or model misspecification.

Justin Domke | Justin Domke

[1] Daphne Koller,et al. Constrained Approximate Maximum Entropy Learning of Markov Random Fields , 2008, UAI.

[2] Joachim M. Buhmann,et al. Spanning Tree Approximations for Conditional Random Fields , 2009, AISTATS.

[3] Justin Domke. Learning Convex Inference of Marginals , 2008, UAI.

[4] Martin J. Wainwright,et al. A new class of upper bounds on the log partition function , 2002, IEEE Transactions on Information Theory.

[5] Tamir Hazan,et al. Convergent Message-Passing Algorithms for Inference over General Graphs with Convex Free Energies , 2008, UAI.

[6] Yee Whye Teh,et al. Belief Optimization for Binary Networks: A Stable Alternative to Loopy Belief Propagation , 2001, UAI.

[7] Yee Whye Teh,et al. An Alternate Objective Function for Markovian Fields , 2002, ICML.

[8] Mark W. Schmidt,et al. Accelerated training of conditional random fields with stochastic gradient methods , 2006, ICML.

[9] Martin J. Wainwright,et al. Estimating the "Wrong" Graphical Model: Benefits in the Computation-Limited Setting , 2006, J. Mach. Learn. Res..

[10] Martial Hebert,et al. Discriminative Random Fields , 2006, International Journal of Computer Vision.

[11] B. Schölkopf,et al. Training Conditional Random Fields for Maximum Labelwise Accuracy , 2007 .