论文信息 - Label Partitioning For Sublinear Ranking

Label Partitioning For Sublinear Ranking

We consider the case of ranking a very large set of labels, items, or documents, which is common to information retrieval, recommendation, and large-scale annotation tasks. We present a general approach for converting an algorithm which has linear time in the size of the set to a sublinear one via label partitioning. Our method consists of learning an input partition and a label assignment to each partition of the space such that precision at k is optimized, which is the loss function of interest in this setting. Experiments on large-scale ranking and recommendation tasks show that our method not only makes the original linear time algorithm computationally tractable, but can also improve its performance.

[1] Francesca Odone,et al. Histogram intersection kernel for image classification , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[2] Alexander C. Berg,et al. Fast and Balanced: Efficient Label Tree Learning for Large Scale Object Recognition , 2011, NIPS.

[3] Samy Bengio,et al. A Discriminative Kernel-Based Approach to Rank Images from Text Queries , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4] John Langford,et al. Error-Correcting Tournaments , 2009, ALT.

[5] Christiane Fellbaum,et al. Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[7] Yanjun Qi,et al. Polynomial Semantic Indexing , 2009, NIPS.

[8] Bernhard Schölkopf,et al. Kernel Principal Component Analysis , 1997, ICANN.

[9] Nello Cristianini,et al. Large Margin DAGs for Multiclass Classification , 1999, NIPS.

[10] Michael Collins,et al. Discriminative Reranking for Natural Language Parsing , 2000, CL.

[11] Patrick Gallinari,et al. Ranking with ordered weighted pairwise classification , 2009, ICML '09.

[12] H. Robbins. A Stochastic Approximation Method , 1951 .

[13] Peter N. Yianilos,et al. Data structures and algorithms for nearest neighbor search in general metric spaces , 1993, SODA '93.

[14] Richard O. Duda,et al. Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[15] Cordelia Schmid,et al. Good Practice in Large-Scale Learning for Image Classification , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16] Alexander J. Smola,et al. Maximum Margin Matrix Factorization for Collaborative Ranking , 2007 .

[17] Moustapha Cissé,et al. Learning Compact Class Codes for Fast Inference in Large Multi Class Classification , 2012, ECML/PKDD.

[18] Jon Louis Bentley,et al. Multidimensional binary search trees used for associative searching , 1975, CACM.

[19] Jason Weston,et al. Label Embedding Trees for Large Multi-Class Tasks , 2010, NIPS.

[20] Ryan M. Rifkin,et al. In Defense of One-Vs-All Classification , 2004, J. Mach. Learn. Res..

[21] Filip Radlinski,et al. A support vector method for optimizing average precision , 2007, SIGIR.

[22] Peter A. Flach,et al. Evaluation Measures for Multi-class Subgroup Discovery , 2009, ECML/PKDD.

[23] John Riedl,et al. Item-based collaborative filtering recommendation algorithms , 2001, WWW '01.

[24] C. Fellbaum. An Electronic Lexical Database , 1998 .

[25] Kilian Q. Weinberger,et al. Distance Metric Learning for Large Margin Nearest Neighbor Classification , 2005, NIPS.

[26] Yu He,et al. The YouTube video recommendation system , 2010, RecSys '10.

[27] T. Landauer,et al. Indexing by Latent Semantic Analysis , 1990 .

[28] Piotr Indyk,et al. Approximate Nearest Neighbor: Towards Removing the Curse of Dimensionality , 2012, Theory Comput..

[29] Piotr Indyk,et al. Approximate nearest neighbors: towards removing the curse of dimensionality , 1998, STOC '98.

[30] Christopher J. C. Burges,et al. From RankNet to LambdaRank to LambdaMART: An Overview , 2010 .

[31] Bernhard Schölkopf,et al. Kernel Principal Component Analysis , 1997, International Conference on Artificial Neural Networks.

[32] Jason Weston,et al. WSABIE: Scaling Up to Large Vocabulary Image Annotation , 2011, IJCAI.