A Fast and Accurate Method for Bilingual Opinion Lexicon Extraction

The explosion of multilingual opinion data has made essential the need for automatic tools to analyze and understand people's opinions in multilingual reviews. In cross-language opinion mining, bilingual opinion lexicon plays a central role. Most of the bilingual lexicon extraction methods are based on EM algorithm. However, these methods are faced with two major problems: high complexity and unsatisfying precision. In this paper, we propose a novel approach to extract bilingual opinion lexicon using collocation alignment, where a collocation is a combination of a feature word and an opinion word. There are tight association between feature words and opinion words, which can be helpful to reduce the computation space and alignment errors. Experimental results demonstrate that our solution is effective and competitive.

[1]  Claire Cardie,et al.  Topic Identification for Fine-Grained Opinion Analysis , 2008, COLING.

[2]  Ari Rappoport,et al.  Bilingual Lexicon Generation Using Non-Aligned Signatures , 2010, ACL.

[3]  Benno Stein,et al.  Cross-Language Text Classification Using Structural Correspondence Learning , 2010, ACL.

[4]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[5]  Dekai Wu,et al.  Learning an English-Chinese Lexicon from a Parallel Corpus , 1994, AMTA.

[6]  Xinying Xu,et al.  Hidden sentiment association in chinese web opinion mining , 2008, WWW.

[7]  Oren Etzioni,et al.  OPINE: Extracting Product Features and Opinions from Reviews , 2005, HLT/EMNLP.

[8]  Kenneth Ward Church,et al.  K-vec: A New Approach for Aligning Parallel Texts , 1994, COLING.

[9]  Vasileios Hatzivassiloglou,et al.  Predicting the Semantic Orientation of Adjectives , 1997, ACL.

[10]  Takashi Inui,et al.  Extracting Semantic Orientations of Phrases from Dictionary , 2007, NAACL.

[11]  Lei Shi,et al.  Cross Language Text Classification by Model Translation and Semi-Supervised Learning , 2010, EMNLP.

[12]  Janyce Wiebe,et al.  Learning Subjective Adjectives from Corpora , 2000, AAAI/IAAI.

[13]  Robert L. Mercer,et al.  The Mathematics of Statistical Machine Translation: Parameter Estimation , 1993, CL.

[14]  Rada Mihalcea,et al.  Learning Multilingual Subjective Language via Cross-Lingual Projections , 2007, ACL.

[15]  EstimationPeter,et al.  The Mathematics of Machine Translation : Parameter , 2004 .

[16]  John Shawe-Taylor,et al.  The use of machine translation tools for cross-lingual text mining , 2005 .

[17]  Pascale Fung,et al.  A statistical view on bilingual lexicon extraction , 1998, AMTA.

[18]  Kenneth Ward Church,et al.  A Program for Aligning Sentences in Bilingual Corpora , 1993, CL.

[19]  Hermann Ney,et al.  A Systematic Comparison of Various Statistical Alignment Models , 2003, CL.

[20]  Zheng Lin,et al.  Language-independent sentiment classification using three common words , 2011, CIKM '11.

[21]  Chun Chen,et al.  Opinion Word Expansion and Target Extraction through Double Propagation , 2011, CL.

[22]  Pascale Fung,et al.  Mining Very-Non-Parallel Corpora: Parallel Sentence and Lexicon Extraction via Bootstrapping and E , 2004, EMNLP.

[23]  Philippe Langlais,et al.  Revisiting Context-based Projection Methods for Term-Translation Spotting in Comparable Corpora , 2010, COLING.

[24]  Claire Cardie,et al.  Identifying Expressions of Opinion in Context , 2007, IJCAI.

[25]  Núria Bel,et al.  Cross-Lingual Text Categorization , 2003, ECDL.

[26]  Pascale Fung,et al.  A Statistical View on Bilingual Lexicon Extraction: From Parallel Corpora to Non-parallel Corpora , 1998, AMTA.

[27]  Songbo Tan,et al.  A novel scheme for domain-transfer problem in the context of sentiment analysis , 2007, CIKM '07.

[28]  Oren Etzioni,et al.  Extracting Product Features and Opinions from Reviews , 2005, HLT.

[29]  Vasileios Hatzivassiloglou,et al.  Translating Collocations for Bilingual Lexicons: A Statistical Approach , 1996, CL.

[30]  Julian Kupiec,et al.  An Algorithm for Finding Noun Phrase Correspondences in Bilingual Corpora , 1993, ACL.

[31]  John Blitzer,et al.  Domain Adaptation with Structural Correspondence Learning , 2006, EMNLP.

[32]  Slava M. Katz,et al.  Principled Disambiguation: Discriminating Adjective Senses with Modified Nouns , 1995, CL.

[33]  Eric Chang,et al.  Red Opal: product-feature scoring from reviews , 2007, EC '07.