Emoji-Powered Representation Learning for Cross-Lingual Sentiment Classification

Sentiment classification typically relies on a large amount of labeled data. In practice, the availability of labels is highly imbalanced among different languages, e.g., more English texts are labeled than texts in any other languages, which creates a considerable inequality in the quality of related information services received by users speaking different languages. To tackle this problem, cross-lingual sentiment classification approaches aim to transfer knowledge learned from one language that has abundant labeled examples (i.e., the source language, usually English) to another language with fewer labels (i.e., the target language). The source and the target languages are usually bridged through off-the-shelf machine translation tools. Through such a channel, cross-language sentiment patterns can be successfully learned from English and transferred into the target languages. This approach, however, often fails to capture sentiment knowledge specific to the target language, and thus compromises the accuracy of the downstream classification task. In this paper, we employ emojis, which are widely available in many languages, as a new channel to learn both the cross-language and the language-specific sentiment patterns. We propose a novel representation learning method that uses emoji prediction as an instrument to learn respective sentiment-aware representations for each language. The learned representations are then integrated to facilitate cross-lingual sentiment classification. The proposed method demonstrates state-of-the-art performance on benchmark datasets, which is sustained even when sentiment labels are scarce.

[1]  M. Csíkszentmihályi,et al.  Validity and Reliability of the Experience‐Sampling Method , 1987, The Journal of nervous and mental disease.

[2]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[3]  Thomas G. Dietterich Approximate Statistical Tests for Comparing Supervised Classification Learning Algorithms , 1998, Neural Computation.

[4]  Sepp Hochreiter,et al.  The Vanishing Gradient Problem During Learning Recurrent Neural Nets and Problem Solutions , 1998, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[5]  Rich Caruana,et al.  Overfitting in Neural Nets: Backpropagation, Conjugate Gradient, and Early Stopping , 2000, NIPS.

[6]  Tommi S. Jaakkola,et al.  Fast optimal leaf ordering for hierarchical clustering , 2001, ISMB.

[7]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[8]  Michael Gamon,et al.  Sentiment classification on customer feedback data: noisy data, large feature vectors, and the role of linguistic analysis , 2004, COLING.

[9]  Taku Kudo,et al.  MeCab : Yet Another Part-of-Speech and Morphological Analyzer , 2005 .

[10]  John Blitzer,et al.  Domain Adaptation with Structural Correspondence Learning , 2006, EMNLP.

[11]  Xiaohui Yu,et al.  ARSA: a sentiment-aware model for predicting sales performance using blogs , 2007, SIGIR.

[12]  Qiang Yang,et al.  Can chinese web pages be classified with english data source? , 2008, WWW.

[13]  Chun Chen,et al.  DASA: Dissatisfaction-oriented Advertising based on Sentiment Analysis , 2010, Expert Syst. Appl..

[14]  Isabell M. Welpe,et al.  Predicting Elections with Twitter: What 140 Characters Reveal about Political Sentiment , 2010, ICWSM.

[15]  Brendan T. O'Connor,et al.  From Tweets to Polls: Linking Text Sentiment to Public Opinion Time Series , 2010, ICWSM.

[16]  Natalie S. Glance,et al.  Star Quality: Aggregating Reviews to Rank Products and Merchants , 2010, ICWSM.

[17]  Mike Thelwall,et al.  Sentiment in short strength detection informal text , 2010 .

[18]  David A. Shamma,et al.  Characterizing debate performance via aggregated twitter sentiment , 2010, CHI.

[19]  Ari Rappoport,et al.  Enhanced Sentiment Learning Using Twitter Hashtags and Smileys , 2010, COLING.

[20]  Benno Stein,et al.  Cross-Language Text Classification Using Structural Correspondence Learning , 2010, ACL.

[21]  Johan Bollen,et al.  Twitter mood predicts the stock market , 2010, J. Comput. Sci..

[22]  Kenji Araki,et al.  Automatically Annotating A Five-Billion-Word Corpus of Japanese Blogs for Affect and Sentiment Analysis , 2012, WASSA@ACL.

[23]  Ke Xu,et al.  MoodLens: an emoticon-based sentiment analysis system for chinese tweets , 2012, KDD.

[24]  Mark S. Ackerman,et al.  The way i talk to you: sentiment expression in an organizational context , 2012, CHI.

[25]  Kiraz Candan Herdem Reactions: Twitter based mobile application for awareness of friends' emotions , 2012, UbiComp.

[26]  Minyi Guo,et al.  Emoticon Smoothed Language Models for Twitter Sentiment Analysis , 2012, AAAI.

[27]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[28]  Geoffrey E. Hinton,et al.  Speech recognition with deep recurrent neural networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[29]  Xiaotie Deng,et al.  Exploiting Topic based Twitter Sentiment for Stock Prediction , 2013, ACL.

[30]  Benjamin Schrauwen,et al.  Training and analyzing deep recurrent neural networks , 2013, NIPS 2013.

[31]  Min Xiao,et al.  Semi-Supervised Representation Learning for Cross-Lingual Text Classification , 2013, EMNLP.

[32]  Razvan Pascanu,et al.  On the difficulty of training recurrent neural networks , 2012, ICML.

[33]  Benjamin Schrauwen,et al.  Training and Analysing Deep Recurrent Neural Networks , 2013, NIPS.

[34]  Derek Ruths,et al.  Gender Inference of Twitter Users in Non-English Contexts , 2013, EMNLP.

[35]  Phil Blunsom,et al.  Multilingual Models for Compositional Distributed Semantics , 2014, ACL.

[36]  Hugo Larochelle,et al.  An Autoencoder Approach to Learning Bilingual Word Representations , 2014, NIPS.

[37]  Amy Voida,et al.  Towards personal stress informatics: comparing minimally invasive techniques for measuring daily stress in the wild , 2014, PervasiveHealth.

[38]  Manaal Faruqui,et al.  Improving Vector Space Word Representations Using Multilingual Correlation , 2014, EACL.

[39]  Quoc V. Le,et al.  Distributed Representations of Sentences and Documents , 2014, ICML.

[40]  Jacob Eisenstein,et al.  Emoticons vs. Emojis on Twitter: A Causal Inference Approach , 2015, ArXiv.

[41]  Marie-Francine Moens,et al.  Bilingual Word Embeddings from Non-Parallel Document-Aligned Data Applied to Bilingual Lexicon Induction , 2015, ACL.

[42]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[43]  Xuanjing Huang,et al.  Long Short-Term Memory Neural Networks for Chinese Word Segmentation , 2015, EMNLP.

[44]  Christopher D. Manning,et al.  Bilingual Word Representations with Monolingual Quality in Mind , 2015, VS@HLT-NAACL.

[45]  Long Chen,et al.  Learning Bilingual Sentiment Word Embeddings for Cross-language Sentiment Classification , 2015, ACL.

[46]  Christopher D. Manning,et al.  Learning Distributed Representations for Multilingual Text Sequences , 2015, VS@HLT-NAACL.

[47]  Ting Liu,et al.  Document Modeling with Gated Recurrent Neural Network for Sentiment Classification , 2015, EMNLP.

[48]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[49]  Ivor W. Tsang,et al.  Transfer Learning for Cross-Language Text Categorization through Active Correspondences Construction , 2016, AAAI.

[50]  Karin M. Verspoor,et al.  Findings of the 2016 Conference on Machine Translation , 2016, WMT.

[51]  Michael Rohs,et al.  EmojiZoom: emoji entry via large overview maps 😄🔍 , 2016, MobileHCI.

[52]  Henriette Cramer,et al.  Sender-intended functions of emojis in US messaging , 2016, MobileHCI.

[53]  Saif Mohammad,et al.  How Translation Alters Sentiment , 2016, J. Artif. Intell. Res..

[54]  Yuan Yu,et al.  TensorFlow: A system for large-scale machine learning , 2016, OSDI.

[55]  Horacio Saggion,et al.  How Cosmopolitan Are Emojis?: Exploring Emojis Usage and Meaning over Different Languages with Distributional Semantics , 2016, ACM Multimedia.

[56]  Fabio Crestani,et al.  Like It or Not , 2016, ACM Comput. Surv..

[57]  K. Robert Lai,et al.  Dimensional Sentiment Analysis Using a Regional CNN-LSTM Model , 2016, ACL.

[58]  Li Zhao,et al.  Attention-based LSTM for Aspect-level Sentiment Classification , 2016, EMNLP.

[59]  T. Kanata JAPANESE MENTAL HEALTH CARE IN HISTORICAL CONTEXT: WHY DID JAPAN BECOME A COUNTRY WITH SO MANY PSYCHIATRIC CARE BEDS? , 2016 .

[60]  Loren G. Terveen,et al.  "Blissfully Happy" or "Ready toFight": Varying Interpretations of Emoji , 2016, ICWSM.

[61]  Xiaojun Wan,et al.  Attention-based LSTM Network for Cross-Lingual Sentiment Classification , 2016, EMNLP.

[62]  Ning Wang,et al.  Learning from the ubiquitous language: an empirical analysis of emoji usage of smartphone users , 2016, UbiComp.

[63]  Xiaojun Wan,et al.  Cross-Lingual Sentiment Classification with Bilingual Document Representation Learning , 2016, ACL.

[64]  Darja Fiser,et al.  A Global Analysis of Emoji Usage , 2016, WAC@ACL.

[65]  Channary Tauch,et al.  The roles of emojis in mobile phone notifications , 2016, UbiComp Adjunct.

[66]  Lei Zhang,et al.  Sentiment Analysis and Opinion Mining , 2017, Encyclopedia of Machine Learning and Data Mining.

[67]  Gregory D. Abowd,et al.  Inferring Mood Instability on Social Media by Leveraging Ecological Momentary Assessments , 2017, Proc. ACM Interact. Mob. Wearable Ubiquitous Technol..

[68]  Min Yang,et al.  Attention Based LSTM for Target Dependent Sentiment Classification , 2017, AAAI.

[69]  Iyad Rahwan,et al.  Using millions of emoji occurrences to learn any-domain representations for detecting sentiment, emotion and sarcasm , 2017, EMNLP.

[70]  Hongning Wang,et al.  Clustered Model Adaption for Personalized Sentiment Analysis , 2017, WWW.

[71]  Fernando Mourão,et al.  Beyond the Stars: Towards a Novel Sentiment Rating to Evaluate Applications in Web Stores of Mobile Apps , 2017, WWW.

[72]  John A. Stankovic,et al.  Distant Emotion Recognition , 2017, Proc. ACM Interact. Mob. Wearable Ubiquitous Technol..

[73]  Jun Yan,et al.  Sentence-level Sentiment Classification with Weak Supervision , 2017, SIGIR.

[74]  Thomas Hofmann,et al.  Leveraging Large Amounts of Weakly Supervised Data for Multi-Language Sentiment Classification , 2017, WWW.

[75]  Philipp Koehn,et al.  Findings of the 2017 Conference on Machine Translation (WMT17) , 2017, WMT.

[76]  HENNING POHL,et al.  Beyond Just Text , 2017, ACM Trans. Comput. Hum. Interact..

[77]  Georgios Balikas,et al.  Multitask Learning for Fine-Grained Twitter Sentiment Analysis , 2017, SIGIR.

[78]  Jiebo Luo,et al.  Spice Up Your Chat: The Intentions and Sentiment Effects of Using Emojis , 2017, ICWSM.

[79]  Loren G. Terveen,et al.  Understanding Emoji Ambiguity in Context: The Role of Text in Emoji-Related Miscommunication , 2017, ICWSM.

[80]  Neha Kumar,et al.  Goodbye Text, Hello Emoji: Mobile Communication on WeChat in China , 2017, CHI.

[81]  Xiaoyan Zhu,et al.  Linguistically Regularized LSTM for Sentiment Classification , 2016, ACL.

[82]  Claire Cardie,et al.  MPQA Opinion Corpus , 2017 .

[83]  Qiang Chen,et al.  Modeling Language Discrepancy for Cross-Lingual Sentiment Analysis , 2017, CIKM.

[84]  Ning Wang,et al.  Untangling Emoji Popularity Through Semantic Embeddings , 2017, ICWSM.

[85]  Shuai Wang,et al.  Deep learning for sentiment analysis: A survey , 2018, WIREs Data Mining Knowl. Discov..

[86]  Gabriele Bavota,et al.  Sentiment Analysis for Software Engineering: How Far Can We Go? , 2018, 2018 IEEE/ACM 40th International Conference on Software Engineering (ICSE).

[87]  Qiao Liu,et al.  Content Attention Model for Aspect Based Sentiment Analysis , 2018, WWW.

[88]  Miki Haseyama,et al.  Sentiment-aware personalized tweet recommendation through multimodal FFM , 2018, Multimedia Tools and Applications.

[89]  Xuanzhe Liu,et al.  Through a Gender Lens: Learning Usage Patterns of Emojis from Large-Scale Android Users , 2017, WWW.

[90]  Lihua Sun,et al.  Applying uncertainty theory into the restaurant recommender system based on sentiment analysis of online Chinese reviews , 2018, World Wide Web.

[91]  Oksana Smal,et al.  POLITICAL DISCOURSE CONTENT ANALYSIS: A CRITICAL OVERVIEW OF A COMPUTERIZED TEXT ANALYSIS PROGRAM LINGUISTIC INQUIRY AND WORD COUNT (LIWC) , 2020, Naukovì zapiski Nacìonalʹnogo unìversitetu «Ostrozʹka akademìâ». Serìâ «Fìlologìâ».