Consistent ordered sampling distributions : characterization and convergence

This paper is concerned with models for sampling from populations in which there exists a total order on the collection of types, but only the relative ordering of types which actually appear in the sample is known. The need for consistency between different sample sizes limits the possible models to what are here called 'consistent ordered sampling distributions'. We give conditions under which weak convergence of population distributions implies convergence of sampling distributions and conversely those under which population convergence may be inferred from convergence of sampling distributions. A central result exhibits a collection of 'ordered sampling functions', none of which is continuous, which separates measures in a certain class. More generally, we characterize all consistent ordered sampling distributions, proving an analogue of de Finetti's theorem in this context. These results are applied to an unsolved problem in genetics where it is shown that equilibrium age-ordered population allele frequencies for a wide class of exchangeable reproductive models converge weakly, as the population size becomes large, to the so-called GEM distribution. This provides an alternative characterization which is more informative and often more convenient than Kingman's (1977) characterization in terms of the Poisson-Dirichlet distribution.

[1]  Peter Donnelly,et al.  WEAK CONVERGENCE TO A MARKOV CHAIN WITH AN ENTRANCE BOUNDARY: ANCESTRAL PROCESSES IN POPULATION GENETICS' , 1991 .

[2]  W. Ewens Population Genetics Theory - The Past and the Future , 1990 .

[3]  P. Donnelly,et al.  The population genealogy of the infinitely-many neutral alleles model , 1987, Journal of mathematical biology.

[4]  P. Donnelly,et al.  Partition structures, Polya urns, the Ewens sampling formula, and the ages of alleles. , 1986, Theoretical population biology.

[5]  P. Donnelly,et al.  The ages of alleles and a coalescent , 1986, Advances in Applied Probability.

[6]  École d'été de probabilités de Saint-Flour,et al.  École d'été de probabilités de Saint-Flour XIII - 1983 , 1985 .

[7]  D. Aldous Exchangeability and related topics , 1985 .

[8]  S. Tavaré,et al.  Line-of-descent and genealogical processes, and their applications in population genetics models. , 1984, Theoretical population biology.

[9]  G. A. Watterson Lines of descent and the coalescent , 1984 .

[10]  J. Kingman On the genealogy of large populations , 1982, Journal of Applied Probability.

[11]  J. Kingman The Representation of Partition Structures , 1978 .

[12]  J. Kingman Random partitions in population genetics , 1978, Proceedings of the Royal Society of London. A. Mathematical and Physical Sciences.

[13]  J. Kingman The population structure associated with the Ewens sampling formula. , 1977, Theoretical population biology.

[14]  C. Cannings The latent roots of certain Markov chains arising in genetics: A new approach, II. Further haploid models , 1974, Advances in Applied Probability.

[15]  W. Ewens The sampling theory of selectively neutral alleles. , 1972, Theoretical population biology.

[16]  E. Davies,et al.  Tensor Products, Infinite Products, and Projective Limits of Choquet Simplexes. , 1968 .