Sampling binary contingency tables with a greedy start

We study the problem of counting and randomly sampling binary contingency tables. For given row and column sums, we are interested in approximately counting (or sampling) 0/1 n x m matrices with the specified row/column sums. We present a simulated annealing algorithm with running time O((nm)2D3d max log5(n+m)) for any row/column sums where D is the number of non-zero entries and d max is the maximum row/column sum. This is the first algorithm to directly solve binary contingency tables for all row/column sums. Previous work reduced the problem to the permanent, or restricted attention to row/column sums that are close to regular. The interesting aspect of our simulated annealing algorithm is that it starts at a non-trivial instance, whose solution relies on the existence of short alternating paths in the graph constructed by a particular Greedy algorithm.

[1]  D. Gale A theorem on flows in networks , 1957 .

[2]  P. Diaconis,et al.  Testing for independence in a two-way table , 1985 .

[3]  Leslie G. Valiant,et al.  Random Generation of Combinatorial Structures from a Uniform Distribution , 1986, Theor. Comput. Sci..

[4]  Mark Jerrum,et al.  Approximating the Permanent , 1989, SIAM J. Comput..

[5]  J. Besag,et al.  Generalized Monte Carlo significance tests , 1989 .

[6]  Mark Jerrum,et al.  Fast Uniform Generation of Regular Graphs , 1990, Theor. Comput. Sci..

[7]  Brendan D. McKay,et al.  Uniform Generation of Random Regular Graphs of Moderate Degree , 1990, J. Algorithms.

[8]  P. Diaconis,et al.  Geometric Bounds for Eigenvalues of Markov Chains , 1991 .

[9]  Alistair Sinclair,et al.  Improved Bounds for Mixing Rates of Markov Chains and Multicommodity Flow , 1992, Combinatorics, Probability and Computing.

[10]  P. Diaconis,et al.  Rectangular Arrays with Fixed Margins , 1995 .

[11]  Prasad Tetali,et al.  Simple Markov-Chain Algorithms for Generating Bipartite Graphs and Tournaments (Extended Abstract) , 1999, SODA.

[12]  J. Sanderson Testing Ecological Patterns , 2000, American Scientist.

[13]  Jun S. Liu,et al.  Monte Carlo strategies in scientific computing , 2001 .

[14]  Jason Schweinsberg An O(n 2 ) bound for the relaxation time of a Markov chain on cladograms , 2002 .

[15]  Van H. Vu,et al.  Generating Random Regular Graphs , 2003, STOC '03.

[16]  Yung-Pin Chen,et al.  An Application of Markov Chain Monte Carlo to Community Ecology , 2003, Am. Math. Mon..

[17]  Eric Vigoda,et al.  A polynomial-time approximation algorithm for the permanent of a matrix with nonnegative entries , 2004, JACM.

[18]  Martin E. Dyer,et al.  Sampling regular graphs and a peer-to-peer network , 2005, SODA '05.

[19]  Yuguo Chen,et al.  Sequential Monte Carlo Methods for Statistical Analysis of Tables , 2005 .

[20]  V. Vazirani,et al.  Accelerating simulated annealing for the permanent and combinatorial counting problems , 2006, SODA 2006.

[21]  Eric Vigoda,et al.  Sampling binary contingency tables with a greedy start , 2007 .