Weighted sampling without replacement

Comparing concentration properties of uniform sampling with and without replacement has a long history which can be traced back to the pioneer work of Hoeffding (1963). The goal of this short note is to extend this comparison to the case of non-uniform weights, using a coupling between the two samples. When the items' weights are arranged in the same order as their values, we show that the induced coupling for the cumulative values is a submartingale coupling. As a consequence, the powerful Chernoff-type upper-tail estimates known for sampling with replacement automatically transfer to the case of sampling without replacement. For general weights, we use the same coupling to establish a sub-Gaussian concentration inequality. We also construct another martingale coupling which allows us to answer a question raised by Luh and Pippenger (2014) on sampling in Polya urns with different replacement numbers.

[1]  Odalric-Ambrym Maillard,et al.  Concentration inequalities for sampling without replacement , 2013, 1309.4029.

[2]  J. Pitman,et al.  Size-biased permutation of a finite sequence with independent and identically distributed terms , 2012, 1210.7856.

[3]  Kyle Luh,et al.  Large-Deviation Bounds for Sampling without Replacement , 2014, Am. Math. Mon..

[4]  Yaming Yu On the inclusion probabilities in some unequal probability sampling plans without replacement , 2010, 1005.4107.

[5]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[6]  Stochastic Orders , 2008 .

[7]  Alan M. Frieze,et al.  Random graphs , 2006, SODA '06.

[8]  Mark A. McComb Comparison Methods for Stochastic Models and Risks , 2003, Technometrics.

[9]  Gábor Lugosi,et al.  Concentration Inequalities , 2008, COLT.

[10]  J. Pitman,et al.  The two-parameter Poisson-Dirichlet distribution derived from a stable subordinator , 1997 .

[11]  R. Szekli Stochastic Ordering and Dependence in Applied Probability , 1995 .

[12]  N. Fisher,et al.  Probability Inequalities for Sums of Bounded Random Variables , 1994 .

[13]  Kenneth S. Alexander A Counterexample to a Correlation Inequality in Finite Sampling , 1989 .

[14]  L. Gordon Successive Sampling in Large Finite Populations , 1983 .

[15]  K. Joag-dev,et al.  Negative Association of Random Variables with Applications , 1983 .

[16]  Béla Bollobás,et al.  A Probabilistic Proof of an Asymptotic Formula for the Number of Labelled Regular Graphs , 1980, Eur. J. Comb..

[17]  D. A. Edwards On the existence of probability measures with given marginals , 1978 .

[18]  R. Serfling Probability Inequalities for the Sum in Sampling without Replacement , 1974 .

[19]  Lars Holst Some Limit Theorems with Applications in Sampling Theory , 1973 .

[20]  B. Rosén Asymptotic Theory for Successive Sampling with Varying Probabilities Without Replacement, II , 1972 .

[21]  D. Horvitz,et al.  A Generalization of Sampling Without Replacement from a Finite Universe , 1952 .