暂无分享,去创建一个
[1] H. S. Shapiro,et al. A Combinatory Detection Problem , 1963 .
[2] Leslie G. Valiant,et al. A theory of the learnable , 1984, CACM.
[3] Paul Erdgs,et al. ON TWO PROBLEMS OF INFORMATION THEORY bY PAUL ERDGS and ALFRJ~D RgNYI , 2001 .
[4] Peter Auer,et al. The Nonstochastic Multiarmed Bandit Problem , 2002, SIAM J. Comput..
[5] John N. Tsitsiklis,et al. The Sample Complexity of Exploration in the Multi-Armed Bandit Problem , 2004, J. Mach. Learn. Res..
[6] Baruch Awerbuch,et al. Adaptive routing with end-to-end feedback: distributed learning and geometric approaches , 2004, STOC '04.
[7] K. Horadam. Hadamard Matrices and Their Applications , 2006 .
[8] Shie Mannor,et al. Action Elimination and Stopping Conditions for the Multi-Armed Bandit and Reinforcement Learning Problems , 2006, J. Mach. Learn. Res..
[9] H. Robbins. Some aspects of the sequential design of experiments , 1952 .
[10] Thomas P. Hayes,et al. Stochastic Linear Optimization under Bandit Feedback , 2008, COLT.
[11] D. Đoković. Hadamard matrices of order 764 exist , 2008 .
[12] Nicolò Cesa-Bianchi,et al. Combinatorial Bandits , 2012, COLT.
[13] Rémi Munos,et al. Pure Exploration in Multi-armed Bandits Problems , 2009, ALT.
[14] Peter Stone,et al. Efficient Selection of Multiple Bandit Arms: Theory and Practice , 2010, ICML.
[15] Csaba Szepesvári,et al. Improved Algorithms for Linear Stochastic Bandits , 2011, NIPS.
[16] Wei Chu,et al. Contextual Bandits with Linear Payoff Functions , 2011, AISTATS.
[17] Nader H. Bshouty. On the Coin Weighing Problem with the Presence of Noise , 2012, APPROX-RANDOM.
[18] Ambuj Tewari,et al. PAC Subset Selection in Stochastic Multi-armed Bandits , 2012, ICML.
[19] Wei Chen,et al. Combinatorial Multi-Armed Bandit: General Framework and Applications , 2013, ICML.
[20] Shivaram Kalyanakrishnan,et al. Information Complexity in Bandit Subset Selection , 2013, COLT.
[21] Sébastien Bubeck,et al. Multiple Identifications in Multi-Armed Bandits , 2012, ICML.
[22] Wei Chen,et al. Combinatorial Partial Monitoring Game with Linear Feedback and Its Applications , 2014, ICML.
[23] Gábor Lugosi,et al. Mathematics of operations research , 1998 .
[24] Wei Chen,et al. Combinatorial Pure Exploration of Multi-Armed Bandits , 2014, NIPS.
[25] Zheng Wen,et al. Tight Regret Bounds for Stochastic Combinatorial Semi-Bandits , 2014, AISTATS.
[26] Alexandre Proutière,et al. Combinatorial Bandits Revisited , 2015, NIPS.
[27] Wei Chen,et al. Combinatorial Multi-Armed Bandit with General Reward Functions , 2016, NIPS.
[28] Tamir Hazan,et al. Tight Bounds for Bandit Combinatorial Optimization , 2017, COLT.
[29] Vaneet Aggarwal,et al. Regret Bounds for Stochastic Combinatorial Multi-Armed Bandits with Linear Space Complexity , 2018, ArXiv.
[30] Aleksandrs Slivkins,et al. Introduction to Multi-Armed Bandits , 2019, Found. Trends Mach. Learn..
[31] Shie Mannor,et al. Batch-Size Independent Regret Bounds for the Combinatorial Multi-Armed Bandit Problem , 2019, COLT.
[32] Masashi Sugiyama,et al. Polynomial-time Algorithms for Combinatorial Pure Exploration with Full-bandit Feedback , 2019, ArXiv.