Aspect-Level Cross-lingual Sentiment Classification with Constrained SMT

Most cross-lingual sentiment classification (CLSC) research so far has been performed at sentence or document level. Aspect-level CLSC, which is more appropriate for many applications, presents the additional difficulty that we consider subsentential opinionated units which have to be mapped across languages. In this paper, we extend the possible cross-lingual sentiment analysis settings to aspect-level specific use cases. We propose a method, based on constrained SMT, to transfer opinionated units across languages by preserving their boundaries. We show that cross-language sentiment classifiers built with this method achieve comparable results to monolingual ones, and we compare different cross-lingual settings.

[1]  Pushpak Bhattacharyya,et al.  Cross-Lingual Sentiment Analysis for Indian Languages using Linked WordNets , 2012, COLING.

[2]  Min Xiao,et al.  Multi-View AdaBoost for Multilingual Subjectivity Analysis , 2012, COLING.

[3]  Philipp Koehn,et al.  An Experimental Management System , 2010, Prague Bull. Math. Linguistics.

[4]  Xiaojun Wan,et al.  Co-Training for Cross-Lingual Sentiment Classification , 2009, ACL.

[5]  Rada Mihalcea,et al.  A Bootstrapping Method for Building Subjectivity Lexicons for Languages with Scarce Resources , 2008, LREC.

[6]  Verónica Pérez-Rosas,et al.  Learning Sentiment Lexicons in Spanish , 2012, LREC.

[7]  Rada Mihalcea,et al.  Multilingual Subjectivity: Are More Languages Better? , 2010, COLING.

[8]  Benno Stein,et al.  Cross-Language Text Classification Using Structural Correspondence Learning , 2010, ACL.

[9]  Anthony Rousseau,et al.  XenC: An Open-Source Tool for Data Selection in Natural Language Processing , 2013, Prague Bull. Math. Linguistics.

[10]  Rada Mihalcea,et al.  Multilingual Sentiment and Subjectivity Analysis , 2011 .

[11]  Weiping Wang,et al.  A Cross-Lingual Joint Aspect/Sentiment Model for Sentiment Analysis , 2014, CIKM.

[12]  Rada Mihalcea,et al.  Learning Multilingual Subjective Language via Cross-Lingual Projections , 2007, ACL.

[13]  Xiaojun Wan,et al.  Cross-Language Opinion Target Extraction in Review Texts , 2012, 2012 IEEE 12th International Conference on Data Mining.

[14]  Claire Cardie,et al.  Joint Bilingual Sentiment Classification with Unlabeled Parallel Corpora , 2011, ACL.

[15]  Gholamreza Haffari,et al.  The Haves and the Have-Nots: Leveraging Unlabelled Corpora for Sentiment Analysis , 2013, ACL.

[16]  Houfeng Wang,et al.  Cross-Lingual Mixture Model for Sentiment Classification , 2012, ACL.

[17]  Hinrich Schütze,et al.  Sentiment Translation through Multi-Edge Graphs , 2010, COLING.

[18]  Mykola Pechenizkiy,et al.  Cross-lingual polarity detection with machine translation , 2013, WISDOM '13.

[19]  Alexandra Balahur,et al.  Comparative experiments using supervised learning and machine translation for multilingual sentiment analysis , 2014, Comput. Speech Lang..

[20]  Bing Liu,et al.  Sentiment Analysis and Opinion Mining , 2012, Synthesis Lectures on Human Language Technologies.

[21]  Kam-Fai Wong,et al.  A Mixed Model for Cross Lingual Opinion Analysis , 2013, NLPCC.

[22]  Pushpak Bhattacharyya,et al.  Lost in Translation: Viability of Machine Translation for Cross Language Sentiment Analysis , 2013, CICLing.

[23]  Philipp Koehn,et al.  Synthesis Lectures on Human Language Technologies , 2016 .

[24]  Steven Skiena,et al.  International Sentiment Analysis for News and Blogs , 2021, ICWSM.

[25]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[26]  Dragomir R. Radev,et al.  Identifying the Semantic Orientation of Foreign Words , 2011, ACL.

[27]  Maite Taboada,et al.  Cross-Linguistic Sentiment Analysis: From English to Spanish , 2009, RANLP.

[28]  Philipp Koehn,et al.  Moses: Open Source Toolkit for Statistical Machine Translation , 2007, ACL.

[29]  Christopher Joseph Pal,et al.  Cross Lingual Adaptation: An Experiment on Sentiment Classifications , 2010, ACL.

[30]  William D. Lewis,et al.  Intelligent Selection of Language Model Training Data , 2010, ACL.

[31]  Josef Steinberger,et al.  Creating Sentiment Dictionaries via Triangulation , 2011, Decis. Support Syst..

[32]  Yannick Versley,et al.  Subsentential Sentiment on a Shoestring: A Crosslingual Analysis of Compositional Classification , 2015, HLT-NAACL.

[33]  Xiaolong Wang,et al.  Cross-lingual Opinion Analysis via Negative Transfer Detection , 2014, ACL.