论文信息 - Siamese Network-Based Supervised Topic Modeling

Siamese Network-Based Supervised Topic Modeling

Label-specific topics can be widely used for supporting personality psychology, aspect-level sentiment analysis, and cross-domain sentiment classification. To generate label-specific topics, several supervised topic models which adopt likelihood-driven objective functions have been proposed. However, it is hard for them to get a precise estimation on both topic discovery and supervised learning. In this study, we propose a supervised topic model based on the Siamese network, which can trade off label-specific word distributions with document-specific label distributions in a uniform framework. Experiments on real-world datasets validate that our model performs competitive in topic discovery quantitatively and qualitatively. Furthermore, the proposed model can effectively predict categorical or real-valued labels for new documents by generating word embeddings from a label-specific topical space.

[1] Ming Zhou,et al. Learning Sentiment-Specific Word Embedding for Twitter Sentiment Classification , 2014, ACL.

[2] David M. Blei,et al. Supervised Topic Models , 2007, NIPS.

[3] Yoshua Bengio,et al. Practical Recommendations for Gradient-Based Training of Deep Architectures , 2012, Neural Networks: Tricks of the Trade.

[4] Christopher D. Manning,et al. Better Word Representations with Recursive Neural Networks for Morphology , 2013, CoNLL.

[5] Omer Levy,et al. Neural Word Embedding as Implicit Matrix Factorization , 2014, NIPS.

[6] Weifeng Li,et al. Supervised Topic Modeling Using Hierarchical Dirichlet Process-Based Inverse Regression: Experiments on E-Commerce Applications , 2018, IEEE Transactions on Knowledge and Data Engineering.

[7] Jun Zhao,et al. How to Generate a Good Word Embedding , 2015, IEEE Intelligent Systems.

[8] Ramesh Nallapati,et al. Labeled LDA: A supervised topic model for credit attribution in multi-labeled corpora , 2009, EMNLP.

[9] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[10] Hang Li,et al. Convolutional Neural Network Architectures for Matching Natural Language Sentences , 2014, NIPS.

[11] C. Willmott,et al. Advantages of the mean absolute error (MAE) over the root mean square error (RMSE) in assessing average model performance , 2005 .

[12] Xin Li,et al. Sentiment Strength Prediction Using Auxiliary Features , 2017, WWW.

[13] Tomas Mikolov,et al. Bag of Tricks for Efficient Text Classification , 2016, EACL.

[14] Timothy N. Rubin,et al. Statistical topic models for multi-label document classification , 2011, Machine Learning.

[15] Timothy Baldwin,et al. Machine Reading Tea Leaves: Automatically Evaluating Topic Coherence and Topic Model Quality , 2014, EACL.

[16] Huchuan Lu,et al. Saliency Detection via Graph-Based Manifold Ranking , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[17] Yann LeCun,et al. Signature Verification Using A "Siamese" Time Delay Neural Network , 1993, Int. J. Pattern Recognit. Artif. Intell..

[18] David M. Blei,et al. Probabilistic topic models , 2012, Commun. ACM.

[19] Michael I. Jordan,et al. Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[20] Harith Alani,et al. Automatically Extracting Polarity-Bearing Topics for Cross-Domain Sentiment Classification , 2011, ACL.

[21] Heng Ji,et al. A Novel Neural Topic Model and Its Supervised Extension , 2015, AAAI.

[22] Ming Zhou,et al. Learning Topic Representation for SMT with Neural Networks , 2014, ACL.

[23] Hwee Tou Ng,et al. An Unsupervised Neural Attention Model for Aspect Extraction , 2017, ACL.

[24] Ron Artstein,et al. Survey Article: Inter-Coder Agreement for Computational Linguistics , 2008, CL.

[25] Quoc V. Le,et al. Grounded Compositional Semantics for Finding and Describing Images with Sentences , 2014, TACL.

[26] Yelong Shen,et al. End-to-end Learning of LDA by Mirror-Descent Back Propagation over a Deep Architecture , 2015, NIPS.

[27] Yue Zhang,et al. A Neural Model for Joint Event Detection and Summarization , 2017, IJCAI.

[28] K. Scherer,et al. Evidence for universality and cultural variation of differential emotion response patterning. , 1994, Journal of personality and social psychology.

[29] Elia Bruni,et al. Multimodal Distributional Semantics , 2014, J. Artif. Intell. Res..

[30] Chong Wang,et al. Simultaneous image classification and annotation , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[31] Thomas Hofmann,et al. Probabilistic Latent Semantic Analysis , 1999, UAI.

[32] B. Weiner,et al. Attribution in personality psychology. , 1999 .

[33] Lei Zhang,et al. Sentiment Analysis and Opinion Mining , 2017, Encyclopedia of Machine Learning and Data Mining.

[34] Felix Hill,et al. SimLex-999: Evaluating Semantic Models With (Genuine) Similarity Estimation , 2014, CL.