Low complexity, high performance algorithms for estimation and decoding

The tremendous success of the graphical error-correcting codes has generated great interest in iterative message-passing algorithms for approximate decoding and estimation; indeed the applications of such approximation techniques extend to a surprisingly vast range of fields of science and engineering, from thermal physics to artificial intelligence, and from computer vision to communications and information theory. In this thesis we will discuss two novel approaches to the general estimation problem based on graphs; each approach results in methods which are superior, in terms of performance and/or complexity, to the existing algorithms that solve the same problem exactly or approximately. In the first approach we reformulate the general estimation problem in a measure-theoretic framework, free of the notion of ‘variables’, upon which the conventional ‘junction tree’ based methods heavily rely. This enables us to exploit all the underlying structure in the data of the problem, in order to reduce the complexity of the underlying marginalization task. We introduce appropriate notions of independence and junction trees, and the corresponding measure-theoretic junction tree algorithm, and we give an automatic procedure, called ‘lifting’, that finds such low complexity junction trees for a given marginalization problem. This automatic procedure often gives rise to algorithms that are substantially less complex than those obtained using the conventional methods. The second approach is based on a recently discovered connection between the fixed points of the well-known loopy belief propagation algorithm, and certain ‘variational free energy’ concepts from statistical physics. We discuss the general ‘Kikuchi approximation method’ for solving a marginalization problem. We define appropriate concepts of graphs and minimal graphs representing such problems, and derive several results on the strengths and limitations of the Kikuchi method based on the graph-theoretic properties of these graphs. In addition to theoretical discussions, for each approach we provide detailed examples and simulation results on several applications of interest, such as exact decoding of low-density parity-check (LDPC) codes, and joint decoding of a partial response magnetic recording channel and an LDPC code.

[1]  W. Freeman,et al.  Bethe free energy, Kikuchi approximations, and belief propagation algorithms , 2001 .

[2]  Thomas J. Richardson,et al.  The geometry of turbo-decoding dynamics , 2000, IEEE Trans. Inf. Theory.

[3]  Jan H. van Schuppen,et al.  Positive matrix factorization via extremal polyhedral cones , 1999 .

[4]  Yee Whye Teh,et al.  Belief Optimization for Binary Networks: A Stable Alternative to Loopy Belief Propagation , 2001, UAI.

[5]  Robert J. McEliece,et al.  The generalized distributive law , 2000, IEEE Trans. Inf. Theory.

[6]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[7]  Robert G. Gallager,et al.  Low-density parity-check codes , 1962, IRE Trans. Inf. Theory.

[8]  Yair Weiss,et al.  Correctness of Local Probability Propagation in Graphical Models with Loops , 2000, Neural Computation.

[9]  Alok Aggarwal,et al.  Finding minimal convex nested polygons , 1985, SCG '85.

[10]  David J. Spiegelhalter,et al.  Probabilistic Networks and Expert Systems , 1999, Information Science and Statistics.

[11]  Prakash P. Shenoy,et al.  Probability propagation , 1990, Annals of Mathematics and Artificial Intelligence.

[12]  Joel E. Cohen,et al.  Nonnegative ranks, decompositions, and factorizations of nonnegative matrices , 1993 .

[13]  Stuart J. Russell,et al.  Dynamic bayesian networks: representation, inference and learning , 2002 .

[14]  Payam Pakzad,et al.  A new look at the generalized distributive law , 2004, IEEE Transactions on Information Theory.

[15]  Nevin Lianwen Zhang,et al.  Exploiting Causal Independence in Bayesian Network Inference , 1996, J. Artif. Intell. Res..

[16]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[17]  Dariush Divsalar,et al.  Coding theorems for 'turbo-like' codes , 1998 .

[18]  David J. C. MacKay,et al.  Good Codes Based on Very Sparse Matrices , 1995, IMACC.

[19]  A. Glavieux,et al.  Near Shannon limit error-correcting coding and decoding: Turbo-codes. 1 , 1993, Proceedings of ICC '93 - IEEE International Conference on Communications.

[20]  John Cocke,et al.  Optimal decoding of linear codes for minimizing symbol error rate (Corresp.) , 1974, IEEE Trans. Inf. Theory.

[21]  Payam Pakzad,et al.  VLSI architectures for iterative decoders in magnetic recording channels , 2001 .

[22]  L. Beda Thermal physics , 1994 .

[23]  S. Aji,et al.  The Generalized Distributive Law and Free Energy Minimization , 2001 .

[24]  R. Kikuchi A Theory of Cooperative Phenomena , 1951 .

[25]  S. Kirkpatrick,et al.  Solvable Model of a Spin-Glass , 1975 .

[26]  William T. Freeman,et al.  Constructing free-energy approximations and generalized belief propagation algorithms , 2005, IEEE Transactions on Information Theory.

[27]  T. Morita Formal Structure of the Cluster Variation Method , 1994 .

[28]  P. Hall On Representatives of Subsets , 1935 .

[29]  Rüdiger L. Urbanke,et al.  The capacity of low-density parity-check codes under message-passing decoding , 2001, IEEE Trans. Inf. Theory.

[30]  R. Palmer,et al.  Solution of 'Solvable model of a spin glass' , 1977 .

[31]  Jean C. Walrand,et al.  High-performance communication networks , 1999 .

[32]  Alan L. Yuille,et al.  CCCP Algorithms to Minimize the Bethe and Kikuchi Free Energies: Convergent Alternatives to Belief Propagation , 2002, Neural Computation.

[33]  Robert J. McEliece,et al.  On the BCJR trellis for linear block codes , 1996, IEEE Trans. Inf. Theory.

[34]  J. Yedidia,et al.  The fully frustrated Ising model in infinite dimensions , 1990 .

[35]  Paul H. Siegel,et al.  Joint message-passing decoding of LDPC Codes and partial-response channels , 2002, IEEE Trans. Inf. Theory.

[36]  Jung-Fu Cheng,et al.  Turbo Decoding as an Instance of Pearl's "Belief Propagation" Algorithm , 1998, IEEE J. Sel. Areas Commun..

[37]  T. O’Neil Geometric Measure Theory , 2002 .

[38]  Robert J. McEliece,et al.  Belief Propagation on Partially Ordered Sets , 2003, Mathematical Systems Theory in Biology, Communications, Computation, and Finance.

[39]  Brendan J. Frey,et al.  Factor graphs and the sum-product algorithm , 2001, IEEE Trans. Inf. Theory.

[40]  Michael Luby,et al.  LT codes , 2002, The 43rd Annual IEEE Symposium on Foundations of Computer Science, 2002. Proceedings..

[41]  Robert Michael Tanner,et al.  A recursive approach to low complexity codes , 1981, IEEE Trans. Inf. Theory.

[42]  Michael I. Jordan,et al.  Loopy Belief Propagation for Approximate Inference: An Empirical Study , 1999, UAI.

[43]  Rüdiger L. Urbanke,et al.  Design of capacity-approaching irregular low-density parity-check codes , 2001, IEEE Trans. Inf. Theory.

[44]  Payam Pakzad,et al.  Belief Propagation and Statistical Physics , 2002 .

[45]  S. Wicker Error Control Systems for Digital Communication and Storage , 1994 .

[46]  M. Opper,et al.  From Naive Mean Field Theory to the TAP Equations , 2001 .