Multi-armed bandits in the presence of side observations in social networks
暂无分享,去创建一个
Atilla Eryilmaz | Ness B. Shroff | Swapna Buccapatnam | A. Eryilmaz | N. Shroff | Swapna Buccapatnam
[1] T. L. Lai Andherbertrobbins. Asymptotically Efficient Adaptive Allocation Rules , 2022 .
[2] Csaba Szepesvári,et al. Exploration-exploitation tradeoff using variance estimates in multi-armed bandits , 2009, Theor. Comput. Sci..
[3] M. Jackson,et al. An Economic Model of Friendship: Homophily, Minorities and Segregation , 2007 .
[4] Shie Mannor,et al. From Bandits to Experts: On the Value of Side-Observations , 2011, NIPS.
[5] R. Agrawal. Sample mean based index policies by O(log n) regret for the multi-armed bandit problem , 1995, Advances in Applied Probability.
[6] D. Kandel. Homophily, Selection, and Socialization in Adolescent Friendships , 1978, American Journal of Sociology.
[7] Marc Lelarge,et al. Leveraging Side Observations in Stochastic Bandits , 2012, UAI.
[8] Nicolò Cesa-Bianchi,et al. Gambling in a rigged casino: The adversarial multi-armed bandit problem , 1995, Proceedings of IEEE 36th Annual Foundations of Computer Science.
[9] Peter Auer,et al. Finite-time Analysis of the Multiarmed Bandit Problem , 2002, Machine Learning.
[10] Albert,et al. Emergence of scaling in random networks , 1999, Science.