暂无分享,去创建一个
[1] Abhimanyu Dubey,et al. Cooperative Multi-Agent Bandits with Heavy Tails , 2020, ICML.
[2] T. L. Lai Andherbertrobbins. Asymptotically Efficient Adaptive Allocation Rules , 2022 .
[3] Varun Kanade,et al. Distributed Non-Stochastic Experts , 2012, NIPS.
[4] Seif Haridi,et al. Distributed Algorithms , 1992, Lecture Notes in Computer Science.
[5] Varun Kanade,et al. Decentralized Cooperative Stochastic Bandits , 2018, NeurIPS.
[6] Sanjay Shakkottai,et al. Social Learning in Multi Agent Multi Armed Bandits , 2019, Proc. ACM Meas. Anal. Comput. Syst..
[7] Kamyar Azizzadenesheli,et al. Multi-Agent Multi-Armed Bandits with Limited Communication , 2021, J. Mach. Learn. Res..
[8] Aurélien Garivier,et al. The KL-UCB Algorithm for Bounded Stochastic Bandits and Beyond , 2011, COLT.
[9] Csaba Szepesvari,et al. Bandit Algorithms , 2020 .
[10] Sanjay Shakkottai,et al. The Gossiping Insert-Eliminate Algorithm for Multi-Agent Bandits , 2020, AISTATS.
[11] Nicolò Cesa-Bianchi,et al. Cooperative Online Learning: Keeping your Neighbors Updated , 2019, ALT.
[12] Rémi Munos,et al. A Finite-Time Analysis of Multi-armed Bandits Problems with Kullback-Leibler Divergences , 2011, COLT.
[13] István Hegedüs,et al. Gossip-based distributed stochastic bandit algorithms , 2013, ICML.