On Learning the Optimal Waiting Time
暂无分享,去创建一个
[1] J. Kiefer,et al. Asymptotic Minimax Character of the Sample Distribution Function and of the Classical Multinomial Estimator , 1956 .
[2] D. Teneketzis,et al. Asymptotically Efficient Adaptive Allocation Schemes for Controlled I.I.D. Processes: Finite Paramet , 1988 .
[3] P. Massart. The Tight Constant in the Dvoretzky-Kiefer-Wolfowitz Inequality , 1990 .
[4] C. B. Morgan. Truncated and Censored Samples, Theory and Applications , 1993 .
[5] Philip M. Long,et al. Adaptive Disk Spindown via Optimal Rent-to-Buy in Probabilistic Environments , 1999, Algorithmica.
[6] Luc Devroye,et al. Combinatorial methods in density estimation , 2001, Springer series in statistics.
[7] Frank Thomson Leighton,et al. The value of knowing a demand curve: bounds on regret for online posted-price auctions , 2003, 44th Annual IEEE Symposium on Foundations of Computer Science, 2003. Proceedings..
[8] Gábor Lugosi,et al. Prediction, learning, and games , 2006 .
[9] Kuzman Ganchev,et al. Censored exploration and the dark pool problem , 2009, UAI.
[10] Shie Mannor,et al. From Bandits to Experts: On the Value of Side-Observations , 2011, NIPS.
[11] Csaba Szepesvári,et al. Minimax Regret of Finite Partial-Monitoring Games in Stochastic Environments , 2011, COLT.
[12] Celso C. Ribeiro,et al. Exploiting run time distributions to compare sequential and parallel stochastic local search algorithms , 2012, J. Glob. Optim..
[13] Dean P. Foster,et al. No Internal Regret via Neighborhood Watch , 2011, AISTATS.
[14] Noga Alon,et al. From Bandits to Experts: A Tale of Domination and Independence , 2013, NIPS.
[15] Gábor Bartók,et al. A near-optimal algorithm for finite partial-monitoring games against adversarial opponents , 2013, COLT.
[16] Gábor Lugosi,et al. Concentration Inequalities - A Nonasymptotic Theory of Independence , 2013, Concentration Inequalities.