论文信息 - StarSpace: Embed All The Things! - 字舞流文

StarSpace: Embed All The Things!

We present StarSpace, a general-purpose neural embedding model that can solve a wide variety of problems: labeling tasks such as text classification, ranking tasks such as information retrieval/web search, collaborative filtering-based or content-based recommendation, embedding of multi-relational graphs, and learning word, sentence or document level embeddings. In each case the model works by embedding those entities comprised of discrete features and comparing them against each other -- learning similarities dependent on the task. Empirical results on a number of tasks show that StarSpace is highly competitive with existing methods, whilst also being generally applicable to new cases where those methods are not.

Jason Weston | Antoine Bordes | Sumit Chopra | Keith Adams | Adam Fisch | Ledell Yu Wu | Antoine Bordes | J. Weston | S. Chopra | Keith Adams | Adam Fisch | Ledell Yu Wu

[1] Yoshua Bengio,et al. A Neural Probabilistic Language Model , 2003, J. Mach. Learn. Res..

[2] Kenneth Y. Goldberg,et al. Eigentaste: A Constant Time Collaborative Filtering Algorithm , 2001, Information Retrieval.

[3] Neil D. Lawrence,et al. Non-linear matrix factorization with Gaussian processes , 2009, ICML '09.

[4] Jason Weston,et al. Supervised Semantic Indexing , 2009, ECIR.

[5] Yoram Singer,et al. Adaptive Subgradient Methods for Online Learning and Stochastic Optimization , 2011, J. Mach. Learn. Res..

[6] Steffen Rendle,et al. Factorization Machines , 2010, 2010 IEEE International Conference on Data Mining.

[7] Jason Weston,et al. Learning Structured Embeddings of Knowledge Bases , 2011, AAAI.

[8] Stephen J. Wright,et al. Hogwild: A Lock-Free Approach to Parallelizing Stochastic Gradient Descent , 2011, NIPS.

[9] Jason Weston,et al. WSABIE: Scaling Up to Large Vocabulary Image Annotation , 2011, IJCAI.

[10] Yehuda Koren,et al. Advances in Collaborative Filtering , 2011, Recommender Systems Handbook.

[11] Jason Weston,et al. Natural Language Processing (Almost) from Scratch , 2011, J. Mach. Learn. Res..

[12] Hans-Peter Kriegel,et al. A Three-Way Model for Collective Learning on Multi-Relational Data , 2011, ICML.

[13] Nicolas Le Roux,et al. A latent factor model for highly multi-relational data , 2012, NIPS.

[14] Martha Larson,et al. CLiMF: learning to maximize reciprocal rank with collaborative less-is-more filtering , 2012, RecSys.

[15] Jason Weston,et al. Translating Embeddings for Modeling Multi-relational Data , 2013, NIPS.

[16] Jeffrey Dean,et al. Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[17] Jason Weston,et al. Connecting Language and Knowledge Bases with Embedding Models for Relation Extraction , 2013, EMNLP.

[18] Jason Weston,et al. A semantic matching energy function for learning with multi-relational data , 2013, Machine Learning.

[19] Jason Weston,et al. Semantic Frame Identification with Distributed Word Representations , 2014, ACL.

[20] Jason Weston,et al. #TagSpace: Semantic Embeddings from Hashtags , 2014, EMNLP.

[21] Antoine Bordes,et al. Composing Relationships with Translations , 2015, EMNLP.

[22] Jens Lehmann,et al. DBpedia - A large-scale, multilingual knowledge base extracted from Wikipedia , 2015, Semantic Web.

[23] Ting Liu,et al. Document Modeling with Gated Recurrent Neural Network for Sentiment Classification , 2015, EMNLP.

[24] Xiang Zhang,et al. Text Understanding from Scratch , 2015, ArXiv.

[25] Kyunghyun Cho,et al. Efficient Character-level Document Classification by Combining Convolution and Recurrent Layers , 2016, ArXiv.

[26] Yann LeCun,et al. Very Deep Convolutional Networks for Natural Language Processing , 2016, ArXiv.

[27] Lorenzo Rosasco,et al. Holographic Embeddings of Knowledge Graphs , 2015, AAAI.

[28] Jason Weston,et al. Reading Wikipedia to Answer Open-Domain Questions , 2017, ACL.

[29] Yelong Shen,et al. Modeling Large-Scale Structured Relationships with Shared Memory for Knowledge Base Completion , 2016, Rep4NLP@ACL.

[30] Tomas Mikolov,et al. Bag of Tricks for Efficient Text Classification , 2016, EACL.

[31] Rudolf Kadlec,et al. Knowledge Base Completion: Baselines Strike Back , 2017, Rep4NLP@ACL.

[32] Tomas Mikolov,et al. Enriching Word Vectors with Subword Information , 2016, TACL.

[33] Holger Schwenk,et al. Supervised Learning of Universal Sentence Representations from Natural Language Inference Data , 2017, EMNLP.

[34] Yann LeCun,et al. Very Deep Convolutional Networks for Text Classification , 2016, EACL.