Efficient bandit algorithms for online multiclass prediction
暂无分享,去创建一个
Ambuj Tewari | Sham M. Kakade | Shai Shalev-Shwartz | S. Kakade | S. Shalev-Shwartz | Ambuj Tewari | Shai Shalev-Shwartz
[1] F ROSENBLATT,et al. The perceptron: a probabilistic model for information storage and organization in the brain. , 1958, Psychological review.
[2] Richard O. Duda,et al. Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.
[3] N. Littlestone. Learning Quickly When Irrelevant Attributes Abound: A New Linear-Threshold Algorithm , 1987, 28th Annual Symposium on Foundations of Computer Science (sfcs 1987).
[4] Nicolò Cesa-Bianchi,et al. Gambling in a rigged casino: The adversarial multi-armed bandit problem , 1995, Proceedings of IEEE 36th Annual Foundations of Computer Science.
[5] Manfred K. Warmuth,et al. Exponentiated Gradient Versus Gradient Descent for Linear Predictors , 1997, Inf. Comput..
[6] Yoav Freund,et al. Large Margin Classification Using the Perceptron Algorithm , 1998, COLT' 98.
[7] Vladimir Vapnik,et al. Statistical learning theory , 1998 .
[8] Jason Weston,et al. Support vector machines for multi-class pattern recognition , 1999, ESANN.
[9] Jason Weston,et al. A kernel method for multi-labelled classification , 2001, NIPS.
[10] Koby Crammer,et al. Ultraconservative Online Algorithms for Multiclass Problems , 2001, J. Mach. Learn. Res..
[11] Koby Crammer,et al. Online Passive-Aggressive Algorithms , 2003, J. Mach. Learn. Res..
[12] Robert D. Kleinberg. Nearly Tight Bounds for the Continuum-Armed Bandit Problem , 2004, NIPS.
[13] Adam Tauman Kalai,et al. Online convex optimization in the bandit setting: gradient descent without a gradient , 2004, SODA '05.
[14] Adam Tauman Kalai,et al. Online convex optimization in the bandit setting , 2005, SODA 2005.
[15] Santosh S. Vempala,et al. An algorithmic theory of learning: Robust concepts and random projection , 1999, Machine Learning.
[16] Yoram Singer,et al. Online multiclass learning by interclass hypothesis sharing , 2006, ICML.
[17] Yoram Singer,et al. A primal-dual perspective of online learning algorithms , 2007, Machine Learning.
[18] J. Langford,et al. The Epoch-Greedy algorithm for contextual multi-armed bandits , 2007, NIPS 2007.