Sorting from Noisier Samples

We study the problem of constructing an order over a set of elements given noisy samples. We consider two models for generating the noisy samples; in both, the distribution of samples is induced by an unknown state of nature: a permutation ρ. In Mallow's model, r permutations πi are generated independently from ρ, each with probability proportional to e−βdk (ρ, πi), where dK(ρ, πi) is the Kemeny distance between ρ and πi - the number of pairs they order differently. In the noisy comparisons model, we are given a tournament, generated from ρ as follows: if i is before j in ρ, then with probability 1/2 + γ, the edge between them is oriented from i to j. Both of these problems were studied by Braverman and Mossel [7]; they showed how to construct a maximum-likelihood permutation when the noise parameter (β or γ, respectively) is constant. In this work, we obtain algorithms that work in the presence of stronger noise [EQUATION] or [EQUATION], respectively). In Mallow's model, our algorithm works for a relaxed solution concept: likelier than nature. That is, rather than requiring that our output maximizes the likelihood over the entire domain, we guarantee that the likelihood of our output is, w.h.p., greater than or equal to that of the true state of nature (ρ). An interesting feature of our algorithm is that it handles noise by adding more noise.

[1]  Alan M. Frieze,et al.  A new rounding procedure for the assignment problem with applications to dense graph arrangement problems , 1996, Proceedings of 37th Conference on Foundations of Computer Science.

[2]  Jeff A. Bilmes,et al.  Consensus ranking under the exponential model , 2007, UAI.

[3]  Mark Braverman,et al.  Sorting from Noisy Information , 2009, ArXiv.

[4]  Joseph Naor,et al.  Approximating Minimum Feedback Sets and Multi-Cuts in Directed Graphs , 1995, IPCO.

[5]  M. Trick,et al.  Voting schemes for which it can be difficult to tell who won the election , 1989 .

[6]  Marek Karpinski,et al.  Approximation Schemes for the Betweenness Problem in Tournaments and Related Ranking Problems , 2009, APPROX-RANDOM.

[7]  Subhash Khot,et al.  Vertex cover might be hard to approximate to within 2-/spl epsiv/ , 2003, 18th IEEE Annual Conference on Computational Complexity, 2003. Proceedings..

[8]  S. Safra,et al.  On the hardness of approximating minimum vertex cover , 2005 .

[9]  Craig Boutilier,et al.  Effective sampling and learning for mallows models with pairwise-preference data , 2014, J. Mach. Learn. Res..

[10]  Ronald L. Graham,et al.  Concrete Mathematics, a Foundation for Computer Science , 1991, The Mathematical Gazette.

[11]  Ariel D. Procaccia,et al.  A Maximum Likelihood Approach For Selecting Sets of Alternatives , 2012, UAI.

[12]  Nir Ailon,et al.  An Active Learning Algorithm for Ranking from Pairwise Preferences with an Almost Optimal Query Complexity , 2010, J. Mach. Learn. Res..

[13]  Noga Alon,et al.  Ranking Tournaments , 2006, SIAM J. Discret. Math..

[14]  C. L. Mallows NON-NULL RANKING MODELS. I , 1957 .

[15]  Narges Simjour Improved Parameterized Algorithms for the Kemeny Aggregation Problem , 2009, IWPEC.

[16]  Silvio Lattanzi,et al.  On Reconstructing a Hidden Permutation , 2014, APPROX-RANDOM.

[17]  Mark Braverman,et al.  Noisy sorting without resampling , 2007, SODA '08.

[18]  Subhash Khot,et al.  Vertex cover might be hard to approximate to within 2-/spl epsiv/ , 2003, 18th IEEE Annual Conference on Computational Complexity, 2003. Proceedings..

[19]  Alan M. Frieze,et al.  Quick Approximation to Matrices and Applications , 1999, Comb..

[20]  Eli Upfal,et al.  Computing with Noisy Information , 1994, SIAM J. Comput..

[21]  Rolf Niedermeier,et al.  Fixed-parameter algorithms for Kemeny rankings , 2009, Theor. Comput. Sci..

[22]  Claire Mathieu,et al.  Electronic Colloquium on Computational Complexity, Report No. 144 (2006) How to rank with few errors A PTAS for Weighted Feedback Arc Set on Tournaments , 2006 .