论文信息 - CSE: Conceptual Sentence Embeddings based on Attention Model - 字舞流文

CSE: Conceptual Sentence Embeddings based on Attention Model

Most sentence embedding models typically represent each sentence only using word surface, which makes these models indiscriminative for ubiquitous homonymy and polysemy. In order to enhance representation capability of sentence, we employ conceptualization model to assign associated concepts for each sentence in the text corpus, and then learn conceptual sentence embedding (CSE). Hence, this semantic representation is more expressive than some widely-used text representation models such as latent topic model, especially for short-text. Moreover, we further extend CSE models by utilizing a local attention-based model that select relevant words within the context to make more efficient prediction. In the experiments, we evaluate the CSE models on two tasks, text classification and information retrieval. The experimental results show that the proposed models outperform typical sentence embed-ding models.

Qiang Zhou | Heyan Huang | Yashen Wang | Chong Feng | Xiong Gao | Jiahui Gu | Heyan Huang | Jiahui Gu | Yashen Wang | Chong Feng | Qiang Zhou | Xiong Gao | Jiahui Gu

[1] Christopher D. Manning,et al. Effective Approaches to Attention-based Neural Machine Translation , 2015, EMNLP.

[2] Chih-Jen Lin,et al. LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[3] Alexander J. Smola,et al. Scalable inference in latent variable models , 2012, WSDM '12.

[4] Yoshua Bengio,et al. Hierarchical Probabilistic Neural Network Language Model , 2005, AISTATS.

[5] Larry P. Heck,et al. Learning deep structured semantic models for web search using clickthrough data , 2013, CIKM.

[6] Geoffrey E. Hinton,et al. Learning representations by back-propagating errors , 1986, Nature.

[7] Michael I. Jordan,et al. Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[8] Craig MacDonald,et al. Overview of the TREC-2012 Microblog Track , 2012, Text Retrieval Conference.

[9] Haixun Wang,et al. Short Text Conceptualization Using a Probabilistic Knowledgebase , 2011, IJCAI.

[10] Zellig S. Harris,et al. Distributional Structure , 1954 .

[11] Ramón Fernández Astudillo,et al. Not All Contexts Are Created Equal: Better Word Representations with Variable Attention , 2015, EMNLP.

[12] Qun Liu,et al. Syntax-based Deep Matching of Short Texts , 2015, IJCAI.

[13] Jason Weston,et al. A Neural Attention Model for Abstractive Sentence Summarization , 2015, EMNLP.

[14] Alexander J. Smola,et al. An architecture for parallel topic models , 2010, Proc. VLDB Endow..

[15] Zhiyuan Liu,et al. Topical Word Embeddings , 2015, AAAI.

[16] Iadh Ounis,et al. Overview of the TREC 2011 Microblog Track , 2011, TREC.

[17] Dan Roth,et al. Learning Question Classifiers , 2002, COLING.

[18] Quoc V. Le,et al. Distributed Representations of Sentences and Documents , 2014, ICML.

[19] Rabab Kreidieh Ward,et al. Deep Sentence Embedding Using Long Short-Term Memory Networks: Analysis and Application to Information Retrieval , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[20] Haixun Wang,et al. Open Domain Short Text Conceptualization: A Generative + Descriptive Modeling Approach , 2015, IJCAI.

[21] Bowen Zhou,et al. Dependency-based Convolutional Neural Networks for Sentence Embedding , 2015, ACL.

[22] Haixun Wang,et al. Probase: a probabilistic taxonomy for text understanding , 2012, SIGMOD Conference.

[23] Michael McGill,et al. Introduction to Modern Information Retrieval , 1983 .

[24] Jeffrey Dean,et al. Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[25] Kevin Gimpel,et al. Towards Universal Paraphrastic Sentence Embeddings , 2015, ICLR.

[26] Kevin Gimpel,et al. Tailoring Continuous Word Representations for Dependency Parsing , 2014, ACL.

[27] Xiaofeng Meng,et al. Query Understanding through Knowledge-Based Conceptualization , 2015, IJCAI.

[28] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[29] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[30] M. de Rijke,et al. A syntax-aware re-ranker for microblog retrieval , 2014, SIGIR.