Thesis Proposal Probabilistic Reasoning with Permutations: A Fourier-Theoretic Approach

Permutations are ubiquitous in many real-world problems, such as voting, ranking, and data association. Representing uncertainty over permutations is challenging, since there are n! possibilities, and common factorized probability distribution representations, such as graphical models, are inefficient due to the mutual exclusivity constraints that are typically associated with permutations. This thesis explores a new approach for probabilistic reasoning with permutations based on the idea of approximating distributions using their low-frequency Fourier components. We use a generalized Fourier transform defined for functions on permutations, but unlike the widely used Fourier analysis on the circle or the real line, Fourier transforms of functions on permutations take the form of ordered collections of matrices. As we show, maintaining the appropriate set of low-frequency Fourier terms corresponds to maintaining matrices of simple marginal probabilities which summarize the underlying distribution. We show how to derive the Fourier coefficients of a variety of probabilistic models which arise in practice and that many useful models are either well-approximated or exactly represented by low-frequency (and in many cases, sparse) Fourier coefficient matrices. In addition to showing that Fourier representations are both compact and intuitive, we show how to cast common probabilistic inference operations in the Fourier domain, including marginalization, conditioning on evidence, and factoring based on probabilistic independence. The algorithms presented in this thesis are fully general and work gracefully in bandlimited settings where only a partial subset of Fourier coefficients is made available. From the theoretical side, we tackle several problems in understanding the consequences of the bandlimiting approximation. We present results in this thesis which illuminate the nature of error propagation in the Fourier domain and propose methods for mitigating their effects. Finally we demonstrate the effectiveness of our approach on several real datasets and show that our methods, in addition to being well-founded theoretically, are also scalable and provide superior results in practice.

[1]  R. H. Makar On the Analysis of the Kronecker Product of Irreducible Representations of the Symmetric Group , 1949 .

[2]  C. L. Mallows NON-NULL RANKING MODELS. I , 1957 .

[3]  J. Munkres ALGORITHMS FOR THE ASSIGNMENT AND TRANSIORTATION tROBLEMS* , 1957 .

[4]  U. Grenander Probabilities on Algebraic Structures , 1964 .

[5]  J. Tukey,et al.  An algorithm for the machine calculation of complex Fourier series , 1965 .

[6]  K. G. Murty An Algorithm for Ranking All the Assignment in Order of Increasing Cost , 1968 .

[7]  J. Marsden,et al.  Elementary classical analysis , 1974 .

[8]  D. Reid An algorithm for tracking multiple targets , 1978, 1978 IEEE Conference on Decision and Control including the 17th Symposium on Adaptive Processes.

[9]  Alan S. Willsky,et al.  On the Algebraic Structure of Certain Partially Observable Finite-State Markov Processes , 1978, Inf. Control..

[10]  D. Critchlow Metric Methods for Analyzing Partially Ranked Data , 1986 .

[11]  M. Fligner,et al.  Distance Based Ranking Models , 1986 .

[12]  P. Diaconis,et al.  SHUFFLING CARDS AND STOPPING-TIMES , 1986 .

[13]  Y. Bar-Shalom Tracking and data association , 1988 .

[14]  P. Diaconis Group representations in probability and statistics , 1988 .

[15]  Jin-quan Chen Group Representation Theory For Physicists , 1989 .

[16]  M. Trick,et al.  Voting schemes for which it can be difficult to tell who won the election , 1989 .

[17]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[18]  P. Diaconis A Generalization of Spectral Analysis with Application to Ranked Data , 1989 .

[19]  T. Inui,et al.  The Symmetric Group , 1990 .

[20]  P. Diaconis,et al.  Trailing the Dovetail Shuffle to its Lair , 1992 .

[21]  Richard M. Wilson,et al.  A course in combinatorics , 1992 .

[22]  J. B. Collins,et al.  Efficient gating in data association with multivariate Gaussian distributed states , 1992 .

[23]  M. Clausen,et al.  Fast Fourier transforms for symmetric groups : theory and implementation , 1993 .

[24]  Aubrey B. Poore,et al.  Multidimensional Assignments and Multitarget Tracking , 1993, Partitioning Data Sets.

[25]  Ingemar J. Cox,et al.  An Efficient Implementation of Reid's Multiple Hypothesis Tracking Algorithm and Its Evaluation for the Purpose of Visual Tracking , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[26]  David Keith Maslen,et al.  The efficient computation of Fourier transforms on the symmetric group , 1998, Math. Comput..

[27]  Yoram Singer,et al.  An Efficient Boosting Algorithm for Combining Preferences by , 2013 .

[28]  Xavier Boyen,et al.  Tractable Inference for Complex Stochastic Processes , 1998, UAI.

[29]  D. Rockmore,et al.  Nonlinear approximation theory on finite groups , 1999 .

[30]  A. Terras Fourier Analysis on Finite Groups and Applications: Index , 1999 .

[31]  Daniel N. Rockmore,et al.  The FFT: an algorithm the whole family can use , 2000, Comput. Sci. Eng..

[32]  Chris H. Q. Ding,et al.  Bipartite graph partitioning and data clustering , 2001, CIKM '01.

[33]  Risi Kondor,et al.  Diffusion kernels on graphs and other discrete structures , 2002, ICML 2002.

[34]  Thorsten Joachims,et al.  Optimizing search engines using clickthrough data , 2002, KDD.

[35]  John D. Lafferty,et al.  Cranking: Combining Rankings Using Conditional Probability Models on Permutations , 2002, ICML.

[36]  Leonidas J. Guibas,et al.  A Distributed Algorithm for Managing Multi-target Identities in Wireless Ad-hoc Sensor Networks , 2003, IPSN.

[37]  Jitendra Malik,et al.  Recognizing action at a distance , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[38]  Jon M. Kleinberg,et al.  Fast Algorithms for Large-State-Space HMMs with Applications to Web Usage Analysis , 2003, NIPS.

[39]  H. Balakrishnan,et al.  Polynomial approximation algorithms for belief matrix maintenance in identity management , 2004, 2004 43rd IEEE Conference on Decision and Control (CDC) (IEEE Cat. No.04CH37601).

[40]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[41]  Songhwai Oh,et al.  Markov chain Monte Carlo data association for general multiple-target tracking problems , 2004, 2004 43rd IEEE Conference on Decision and Control (CDC) (IEEE Cat. No.04CH37601).

[42]  Martial Hebert,et al.  A spectral technique for correspondence problems using pairwise constraints , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[43]  E. Candès,et al.  Stable signal recovery from incomplete and inaccurate measurements , 2005, math/0503066.

[44]  Kunle Olukotun,et al.  The Information-Form Data Association Filter , 2005, NIPS.

[45]  Eric J. Malm Decimation-in-Frequency Fast Fourier Transforms for the Symmetric Group , 2005 .

[46]  S. Sastry,et al.  A polynomial-time approximation algorithm for joint probabilistic data association , 2005, Proceedings of the 2005, American Control Conference, 2005..

[47]  Leonidas J. Guibas,et al.  Lazy inference on object identities in wireless sensor networks , 2005, IPSN 2005. Fourth International Symposium on Information Processing in Sensor Networks, 2005..

[48]  Jianbo Shi,et al.  Balanced Graph Matching , 2006, NIPS.

[49]  Stefan Carlsson,et al.  Tracking and Labelling of Interacting Multiple Targets , 2006, ECCV.

[50]  Frank Dellaert,et al.  MCMC Data Association and Sparse Factorization Updating for Real Time Multitarget Tracking with Merged and Multiple Measurements , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[51]  Risi Kondor,et al.  Snob : a C++ toolkit for fast Fourier transforms on the symmetric , 2006 .

[52]  Emmanuel J. Candès,et al.  Near-Optimal Signal Recovery From Random Projections: Universal Encoding Strategies? , 2004, IEEE Transactions on Information Theory.

[53]  Hariharan Narayanan On the complexity of computing Kostka numbers and Littlewood-Richardson coefficients , 2006 .

[54]  Kunle Olukotun,et al.  The Identity Management Kalman Filter (IMKF) , 2006, Robotics: Science and Systems.

[55]  Leonidas J. Guibas,et al.  Efficient Inference for Distributions on Permutations , 2007, NIPS.

[56]  Manfred K. Warmuth,et al.  Learning Permutations with Exponential Weights , 2007, COLT.

[57]  Yi Mao,et al.  Non-parametric Modeling of Partially Ranked Data , 2007, NIPS.

[58]  Jeff A. Bilmes,et al.  Consensus ranking under the exponential model , 2007, UAI.

[59]  Tony Jebara,et al.  Multi-object tracking with representations of the symmetric group , 2007, AISTATS.

[60]  R. Kondor The skew spectrum of functions on finite groups and their homogeneous spaces , 2007, 0712.4259.

[61]  Devavrat Shah,et al.  Inferring rankings under constrained sensing , 2008, NIPS.

[62]  Carlos Guestrin,et al.  Inference for Distributions over the Permutation Group , 2008 .

[63]  Karsten M. Borgwardt,et al.  The skew spectrum of graphs , 2008, ICML '08.

[64]  I. Kondor,et al.  Group theoretical methods in machine learning , 2008 .

[65]  Stephen E. Robertson,et al.  SoftRank: optimizing non-smooth rank metrics , 2008, WSDM '08.

[66]  Amnon Shashua,et al.  Probabilistic graph and hypergraph matching , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[67]  Leonidas J. Guibas,et al.  The identity management problem — A short survey , 2008, 2008 11th International Conference on Information Fusion.

[68]  Gregory Minton,et al.  Voting, the Symmetric Group, and Representation Theory , 2007, Am. Math. Mon..