Learning Lexical Subjectivity Strength for Chinese Opinionated Sentence Identification

Lexical subjectivity strength has proven to be of great value to subjectivity classification. However, the quantitative calculation of lexical subjectivity strength has not yet been much explored. This paper presents a fuzzy set based approach to automatically learn lexical subjectivity strength for Chinese opinionated sentence identification. To approach this task, log-linear probabilities are employed to extract a set of subjective words from opinionated sentences, and three fuzzy sets, namely low-strength subjectivity, medium-strength subjectivity and high-strength subjectivity, are then defined to represent their respective classes of subjectivity strength. Furthermore, three membership functions are built to indicate the degrees of subjective words in different fuzzy sets. Finally, the acquired lexical subjective strength is further exploited to perform subjectivity classification. The experimental results on the NTCIR-7 MOAT data demonstrate that the introduction of lexical subjective strength is beneficial to subjectivity classification.

[1]  Ellen Riloff,et al.  Learning Extraction Patterns for Subjective Expressions , 2003, EMNLP.

[2]  Hsin-Hsi Chen,et al.  Overview of Multilingual Opinion Analysis Task at NTCIR-7 , 2008, NTCIR.

[3]  Xin Wang,et al.  Chinese Sentence-Level Sentiment Classification Based on Fuzzy Sets , 2010, COLING.

[4]  Valentin Jijkoun,et al.  Generating Focused Topic-Specific Sentiment Lexicons , 2010, ACL.

[5]  Hongfei Yan,et al.  Jointly Modeling Aspects and Opinions with a MaxEnt-LDA Hybrid , 2010, EMNLP.

[6]  Teuvo Kohonen,et al.  Self-Organization and Associative Memory , 1988 .

[7]  Claire Cardie,et al.  Identifying Expressions of Opinion in Context , 2007, IJCAI.

[8]  Bo Pang,et al.  A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts , 2004, ACL.

[9]  Xin Wang,et al.  Chinese Sentence-Level Sentiment Classification Based on Sentiment Morphemes , 2010, 2010 International Conference on Asian Language Processing.

[10]  Janyce Wiebe,et al.  Learning Subjective Language , 2004, CL.

[11]  Hsin-Hsi Chen,et al.  Overview of Opinion Analysis Pilot Task at NTCIR-6 , 2007, NTCIR.

[12]  Hong Yu,et al.  Towards Answering Opinion Questions: Separating Facts from Opinions and Identifying the Polarity of Opinion Sentences , 2003, EMNLP.

[13]  Lotfi A. Zadeh,et al.  Fuzzy Sets , 1996, Inf. Control..

[14]  Kam-Fai Wong,et al.  Coarse-Fine Opinion Mining - WIA in NTCIR-7 MOAT Task , 2008, NTCIR.

[15]  Fiona I. B. Ngô,et al.  Sense and Subjectivity , 2011 .

[16]  Janyce Wiebe,et al.  Recognizing Contextual Polarity in Phrase-Level Sentiment Analysis , 2005, HLT.

[17]  Mike Thelwall,et al.  A Study of Information Retrieval Weighting Schemes for Sentiment Analysis , 2010, ACL.

[18]  Janyce Wiebe,et al.  Articles: Recognizing Contextual Polarity: An Exploration of Features for Phrase-Level Sentiment Analysis , 2009, CL.

[19]  Janyce Wiebe Subjectivity Word Sense Disambiguation , 2009, EMNLP 2009.

[20]  Nitin Indurkhya,et al.  Handbook of Natural Language Processing , 2010 .

[21]  Lillian Lee,et al.  Opinion Mining and Sentiment Analysis , 2008, Found. Trends Inf. Retr..

[22]  Rada Mihalcea,et al.  Word Sense and Subjectivity , 2006, ACL.

[23]  Ellen Riloff,et al.  Exploiting Subjectivity Classification to Improve Information Extraction , 2005, AAAI.

[24]  Bing Liu,et al.  Sentiment Analysis and Subjectivity , 2010, Handbook of Natural Language Processing.