Estimation and Marginalization Using the Kikuchi Approximation Methods

In this letter, we examine a general method of approximation, known as the Kikuchi approximation method, for finding the marginals of a product distribution, as well as the corresponding partition function. The Kikuchi approximation method defines a certain constrained optimization problem, called the Kikuchi problem, and treats its stationary points as approximations to the desired marginals. We show how to associate a graph to any Kikuchi problem and describe a class of local message-passing algorithms along the edges of any such graph, which attempt to find the solutions to the problem. Implementation of these algorithms on graphs with fewer edges requires fewer operations in each iteration. We therefore characterize minimal graphs for a Kikuchi problem, which are those with the minimum number of edges. We show with empirical results that these simpler algorithms often offer significant savings in computational complexity, without suffering a loss in the convergence rate. We give conditions for the convexity of a given Kikuchi problem and the exactness of the approximations in terms of the loops of the minimal graph. More precisely, we show that if the minimal graph is cycle free, then the Kikuchi approximation method is exact, and the converse is also true generically. Together with the fact that in the cycle-free case, the iterative algorithms are equivalent to the well-known belief propagation algorithm, our results imply that, generically, the Kikuchi approximation method can be exact if and only if traditional junction tree methods could also solve the problem exactly.

[1]  R. Stanley What Is Enumerative Combinatorics , 1986 .

[2]  Payam Pakzad,et al.  A new look at the generalized distributive law , 2004, IEEE Transactions on Information Theory.

[3]  T. Morita Formal Structure of the Cluster Variation Method , 1994 .

[4]  Yair Weiss,et al.  Correctness of Local Probability Propagation in Graphical Models with Loops , 2000, Neural Computation.

[5]  Robert G. Gallager,et al.  Low-density parity-check codes , 1962, IRE Trans. Inf. Theory.

[6]  W. Freeman,et al.  Bethe free energy, Kikuchi approximations, and belief propagation algorithms , 2001 .

[7]  David J. C. MacKay,et al.  Good Codes Based on Very Sparse Matrices , 1995, IMACC.

[8]  Robert J. McEliece,et al.  The generalized distributive law , 2000, IEEE Trans. Inf. Theory.

[9]  Robert Michael Tanner,et al.  A recursive approach to low complexity codes , 1981, IEEE Trans. Inf. Theory.

[10]  Payam Pakzad,et al.  Low complexity, high performance algorithms for estimation and decoding , 2004 .

[11]  Alan L. Yuille,et al.  CCCP Algorithms to Minimize the Bethe and Kikuchi Free Energies: Convergent Alternatives to Belief Propagation , 2002, Neural Computation.

[12]  S. Wicker Error Control Systems for Digital Communication and Storage , 1994 .

[13]  David J. Spiegelhalter,et al.  Probabilistic Networks and Expert Systems , 1999, Information Science and Statistics.

[14]  Jung-Fu Cheng,et al.  Turbo Decoding as an Instance of Pearl's "Belief Propagation" Algorithm , 1998, IEEE J. Sel. Areas Commun..

[15]  S. Aji,et al.  The Generalized Distributive Law and Free Energy Minimization , 2001 .

[16]  C. Richard Johnson,et al.  Turbo Decoding as Iterative Constrained Maximum-Likelihood Sequence Detection , 2006, IEEE Transactions on Information Theory.

[17]  Yee Whye Teh,et al.  Belief Optimization for Binary Networks: A Stable Alternative to Loopy Belief Propagation , 2001, UAI.

[18]  Payam Pakzad,et al.  Belief Propagation and Statistical Physics , 2002 .

[19]  Jean C. Walrand,et al.  High-performance communication networks , 1999 .

[20]  Dariush Divsalar,et al.  Coding theorems for 'turbo-like' codes , 1998 .

[21]  Rüdiger L. Urbanke,et al.  The capacity of low-density parity-check codes under message-passing decoding , 2001, IEEE Trans. Inf. Theory.

[22]  R. Kikuchi A Theory of Cooperative Phenomena , 1951 .

[23]  Payam Pakzad,et al.  Sub-tree Based Upper and Lower Bounds on the Partition Function , 2006, 2006 IEEE International Symposium on Information Theory.

[24]  William T. Freeman,et al.  Constructing free-energy approximations and generalized belief propagation algorithms , 2005, IEEE Transactions on Information Theory.

[25]  Thomas J. Richardson,et al.  The geometry of turbo-decoding dynamics , 2000, IEEE Trans. Inf. Theory.

[26]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[27]  P. Hall On Representatives of Subsets , 1935 .

[28]  Ronald L. Rivest,et al.  Introduction to Algorithms , 1990 .

[29]  Rüdiger L. Urbanke,et al.  Design of capacity-approaching irregular low-density parity-check codes , 2001, IEEE Trans. Inf. Theory.

[30]  L. Beda Thermal physics , 1994 .

[31]  Michael Luby,et al.  LT codes , 2002, The 43rd Annual IEEE Symposium on Foundations of Computer Science, 2002. Proceedings..

[32]  H. Fédérer Geometric Measure Theory , 1969 .

[33]  Payam Pakzad,et al.  Kikuchi approximation method for joint decoding of LDPC codes and partial-response channels , 2006, IEEE Transactions on Communications.

[34]  A. Glavieux,et al.  Near Shannon limit error-correcting coding and decoding: Turbo-codes. 1 , 1993, Proceedings of ICC '93 - IEEE International Conference on Communications.

[35]  Robert J. McEliece,et al.  Belief Propagation on Partially Ordered Sets , 2003, Mathematical Systems Theory in Biology, Communications, Computation, and Finance.