A Tale of Three Couplings: Poisson–Dirichlet and GEM Approximations for Random Permutations

For a random permutation of $n$ objects, as $n \to \infty$, the process giving the proportion of elements in the longest cycle, the second-longest cycle, and so on, converges in distribution to the Poisson–Dirichlet process with parameter 1. This was proved in 1977 by Kingman and by Vershik and Schmidt. For soft reasons, this is equivalent to the statement that the random permutations and the Poisson–Dirichlet process can be coupled so that zero is the limit of the expected $\ell_1$ distance between the process of cycle length proportions and the Poisson–Dirichlet process. We investigate how rapid this metric convergence can be, and in doing so, give two new proofs of the distributional convergence.One of the couplings we consider has an analogue for the prime factorizations of a uniformly distributed random integer, and these couplings rely on the ‘scale-invariant spacing lemma’ for the scale-invariant Poisson processes, proved in this paper.

[1]  William Feller,et al.  The fundamental limit theorems in probability , 1945 .

[2]  Steinar Engen,et al.  A note on the geometric series as a species frequency model , 1975 .

[3]  Simon Tavaré,et al.  The Poisson–Dirichlet Distribution and the Scale-Invariant Poisson Process , 1999, Combinatorics, Probability and Computing.

[4]  S. Rachev,et al.  Mass transportation problems , 1998 .

[5]  A. Vershik,et al.  Limit Measures Arising in the Asympyotic Theory of Symmetric Groups. I. , 1977 .

[6]  R. Arratia,et al.  Logarithmic Combinatorial Structures: A Probabilistic Approach , 2003 .

[7]  R. M. Dudley,et al.  Real Analysis and Probability , 1989 .

[8]  J. McCloskey,et al.  A model for the distribution of individuals by species in an environment , 1965 .

[9]  Ts. G. Ignatov On a Constant Arising in the Asymptotic Theory of Symmetric Groups, and on Poisson–Dirichlet Measures , 1982 .

[10]  Patrick Billingsley,et al.  On the distribution of large prime divisors , 1972 .

[11]  G. Dall'aglio Sugli estremi dei momenti delle funzioni di ripartizione doppia , 1956 .

[12]  R. Arratia,et al.  The Cycle Structure of Random Permutations , 1992 .

[13]  R. Arratia,et al.  Poisson Process Approximations for the Ewens Sampling Formula , 1992 .

[14]  J. Kingman Random Discrete Distributions , 1975 .

[15]  J. Kingman The population structure associated with the Ewens sampling formula. , 1977, Theoretical population biology.

[16]  Robert C. Griffiths,et al.  On the distribution of allele frequencies in a diffusion model , 1979 .

[17]  W. Ewens The sampling theory of selectively neutral alleles. , 1972, Theoretical population biology.