Threshold Rules for Online Sample Selection

We consider the following sample selection problem. We observe in an online fashion a sequence of samples, each endowed by a quality. Our goal is to either select or reject each sample, so as to maximize the aggregate quality of the subsample selected so far. There is a natural trade-off here between the rate of selection and the aggregate quality of the subsample. We show that for a number of such problems extremely simple and oblivious "threshold rules" for selection achieve optimal tradeoffs between rate of selection and aggregate quality in a probabilistic sense. In some cases we show that the same threshold rule is optimal for a large class of quality distributions and is thus oblivious in a strong sense.

[1]  Jeffrey Xu Yu,et al.  Probabilistic Skyline Operator over Sliding Windows , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[2]  Donald Kossmann,et al.  The Skyline operator , 2001, Proceedings 17th International Conference on Data Engineering.

[3]  Thomas S. Ferguson,et al.  Who Solved the Secretary Problem , 1989 .

[4]  R. F.,et al.  Mathematical Statistics , 1944, Nature.

[5]  Edith Cohen,et al.  Optimal oblivious routing in polynomial time , 2003, STOC '03.

[6]  Edith Cohen,et al.  Optimal oblivious routing in polynomial time , 2004, J. Comput. Syst. Sci..

[7]  D. Owen Handbook of Mathematical Functions with Formulas , 1965 .

[8]  B. K. Ghosh,et al.  Handbook of sequential analysis , 1991 .

[9]  Sergei Vassilvitskii,et al.  The hiring problem and Lake Wobegon strategies , 2008, SODA '08.

[10]  H. T. Kung,et al.  On the Average Number of Maxima in a Set of Vectors and Applications , 1978, JACM.

[11]  Mikhail J. Atallah,et al.  Computing all skyline probabilities for uncertain data , 2009, PODS.

[12]  Allan Borodin,et al.  Online computation and competitive analysis , 1998 .

[13]  Guevara Noubir,et al.  Universal approximations for TSP, Steiner tree, and set cover , 2005, STOC '05.

[14]  M. Abramowitz,et al.  Handbook of Mathematical Functions With Formulas, Graphs and Mathematical Tables (National Bureau of Standards Applied Mathematics Series No. 55) , 1965 .

[15]  P. Freeman The Secretary Problem and its Extensions: A Review , 1983 .

[16]  Mohammad Taghi Hajiaghayi,et al.  Oblivious network design , 2006, SODA '06.

[17]  E. Platen,et al.  About secretary problems , 1980 .