Using Bilingual Lexicon to Judge Sentiment Orientation of Chinese Words

It is a challenging task to identify sentiments (the affective parts of options) of reviews. One of the most important problems is to predict the sentiment orientation of the words. This paper proposes a new method for determining the sentiment orientation of the Chinese words by using bilingual lexicons. Given a Chinese word, we observe the occurrences of English sentiment words in its interpretations, to predict the sentiment orientation of the Chinese word. The whole process can be illustrated logically as follows: (1) translate a Chinese word into Chinese-English interpretation; (2) generate an English word sequence by parsing the interpretation; (3) calculate the sentiment vector from the English word sequence; (4) use a classifier to predict the sentiment orientation for the Chinese word. The performance of two kinds of classifiers (SVM and C4.5) is studied. The experiments show that the proposed method performed well and achieved high accuracy.

[1]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[2]  Mike Y. Chen,et al.  Yahoo! For Amazon: Sentiment Parsing from Small Talk on the Web , 2001 .

[3]  Kenneth Ward Church,et al.  Word Association Norms, Mutual Information, and Lexicography , 1989, ACL.

[4]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[5]  Michael L. Littman,et al.  Measuring praise and criticism: Inference of semantic orientation from association , 2003, TOIS.

[6]  Bernhard E. Boser,et al.  A training algorithm for optimal margin classifiers , 1992, COLT '92.

[7]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[8]  Peter D. Turney Mining the Web for Synonyms: PMI-IR versus LSA on TOEFL , 2001, ECML.

[9]  Jian Liu,et al.  Sentiment classification using phrase patterns , 2004, The Fourth International Conference onComputer and Information Technology, 2004. CIT '04..

[10]  Alistair Kennedy,et al.  Sentiment Classification of Movie and Product Reviews Using Contextual Valence Shifters , 2005 .

[11]  Bo Pang,et al.  A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts , 2004, ACL.

[12]  Janyce Wiebe,et al.  Identifying Collocations for Recognizing Opinions , 2001 .

[13]  Vasileios Hatzivassiloglou,et al.  Predicting the Semantic Orientation of Adjectives , 1997, ACL.

[14]  Hiroya Takamura,et al.  Sentiment Classification Using Word Sub-sequences and Dependency Sub-trees , 2005, PAKDD.

[15]  Samy Bengio,et al.  SVMTorch: Support Vector Machines for Large-Scale Regression Problems , 2001, J. Mach. Learn. Res..

[16]  M. de Rijke,et al.  UvA-DARE ( Digital Academic Repository ) Using WordNet to measure semantic orientations of adjectives , 2004 .