Aspect and Sentiment Extraction Based on Information-Theoretic Co-clustering

In this paper, we propose an aspect and sentiment extraction method based on information-theoretic Co-clustering. Unlike the existing feature based sentiment analysis methods, which only process the explicit associations between feature words and sentiment words. Our method considers the implicit associations intra evaluated features, the association intra sentiment words, and the associations inter evaluated features and sentiment words. At first, the co-occurrence relationships of feature words and sentiment words are represented as a feature-sentiment words matrix. And with the feature-sentiment words matrix, the information-theoretic Co-clustering algorithm is used to simultaneously cluster evaluated features and sentiment words. The clustering results of feature words are viewed as different aspects of the evaluated objects, and the clustering results of sentiment words which are associated with different aspects are viewed as aspect specific sentiment words. The experimental results demonstrate that this method can obtain good performance of aspect and sentiment extraction.

[1]  Uzay Kaymak,et al.  Polarity analysis of texts using discourse structure , 2011, CIKM '11.

[2]  Yue Lu,et al.  Latent aspect rating analysis on review text data: a rating regression approach , 2010, KDD.

[3]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[4]  Xinying Xu,et al.  Hidden sentiment association in chinese web opinion mining , 2008, WWW.

[5]  Walid Magdy,et al.  An efficient method for using machine translation technologies in cross-language patent search , 2011, CIKM '11.

[6]  Xiaolong Wang,et al.  Topic sentiment analysis in twitter: a graph-based hashtag sentiment classification approach , 2011, CIKM '11.

[7]  Georgios Paliouras,et al.  ELS: a word-level method for entity-level sentiment analysis , 2011, WIMS '11.

[8]  Hsin-Hsi Chen,et al.  Using Morphological and Syntactic Structures for Chinese Opinion Analysis , 2009, EMNLP.

[9]  Khairullah Khan,et al.  Sentence based sentiment classification from online customer reviews , 2010, FIT.

[10]  Inderjit S. Dhillon,et al.  Information-theoretic co-clustering , 2003, KDD '03.

[11]  Fu Lee Wang,et al.  Web Information Systems and Mining , 2010, Lecture Notes in Computer Science.

[12]  Vincent Ng,et al.  Examining the Role of Linguistic Knowledge Sources in the Automatic Identification and Classification of Reviews , 2006, ACL.

[13]  Xianghua Fu,et al.  Multi-aspect Blog Sentiment Analysis Based on LDA Topic Model and Hownet Lexicon , 2011, WISM.

[14]  Jingbo Zhu,et al.  Multi-aspect opinion polling from textual reviews , 2009, CIKM.

[15]  Vasileios Hatzivassiloglou,et al.  Predicting the Semantic Orientation of Adjectives , 1997, ACL.

[16]  Mirella Lapata,et al.  Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, EMNLP 2009, 6-7 August 2009, Singapore, A meeting of SIGDAT, a Special Interest Group of the ACL , 2009, EMNLP.

[17]  Lei Zhang,et al.  Entity discovery and assignment for opinion mining applications , 2009, KDD.

[18]  Songbo Tan,et al.  An Iterative Reinforcement Approach for Fine-Grained Opinion Mining , 2009, NAACL.