Borrow from rich cousin: transfer learning for emotion detection using cross lingual embedding

Abstract Performance of any natural language processing (NLP) system greatly depends on the amount of resources and tools available in a particular language or domain. Therefore, while solving any problem in low-resource setting, it is important to investigate techniques to leverage the resources and tools available in resource-rich languages. In this paper we propose an efficient technique to mitigate the problem of resource scarcity for emotion detection in Hindi by leveraging information from a resource-rich language like English. Our method follows a deep transfer learning framework which efficiently captures relevant information through the shared space of two languages, showing significantly better performance compared to the monolingual scenario that learns in the vector space of only one language. As base learning models, we use Convolution Neural Network (CNN) and Bi-Directional Long Short Term Memory (Bi-LSTM). As there are no available emotion labeled dataset for Hindi, we create a new dataset for emotion detection in disaster domain by annotating sentences of news documents with nine different classes based on Plutchikâ;;s wheel of emotions. To improve the performance of emotion classification in Hindi, we employ transfer learning to exploit the resources available in the related domains. The core of our approach lies in generating a cross-lingual word embedding representation of words in the shared embedding space. The neural networks are trained on the existing datasets, and then weights are fine-tuned following the four different transfer learning strategies for emotion classification in Hindi. We obtain a significant performance gain in our our proposed transfer learning techniques, achieving an F1-score of 0.53 (compared to 0.47)-thereby implying that knowledge from a resource-rich language can be transferred across language and domains. 1

[1]  Francisco Herrera,et al.  An insight into classification with imbalanced data: Empirical results and current trends on using data intrinsic characteristics , 2013, Inf. Sci..

[2]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[3]  Chun Chen,et al.  DASA: Dissatisfaction-oriented Advertising based on Sentiment Analysis , 2010, Expert Syst. Appl..

[4]  Ruslan Salakhutdinov,et al.  Transfer Learning for Sequence Tagging with Hierarchical Recurrent Networks , 2016, ICLR.

[5]  Prashanth U. Nyer,et al.  The role of emotions in marketing , 1999 .

[6]  Bo Pang,et al.  Seeing Stars: Exploiting Class Relationships for Sentiment Categorization with Respect to Rating Scales , 2005, ACL.

[7]  Yoshua Bengio,et al.  Why Does Unsupervised Pre-training Help Deep Learning? , 2010, AISTATS.

[8]  Erkki Sutinen,et al.  Are They Different? Affect, Feeling, Emotion, Sentiment, and Opinion Detection in Text , 2014, IEEE Transactions on Affective Computing.

[9]  Noriko Kando,et al.  Multi-Document Summarization with Subjectivity Analysis at DUC 2005 , 2005 .

[10]  Saif Mohammad,et al.  SemEval-2018 Task 1: Affect in Tweets , 2018, *SEMEVAL.

[11]  Fredrik Johansson,et al.  Emotion classification of social media posts for estimating people’s reactions to communicated alert messages during crises , 2014, Security Informatics.

[12]  Loren Terveen,et al.  PHOAKS: a system for sharing recommendations , 1997, CACM.

[13]  Samuel L. Smith,et al.  Offline bilingual word vectors, orthogonal transformations and the inverted softmax , 2017, ICLR.

[14]  R. Plutchik Emotions : a general psychoevolutionary theory , 1984 .

[15]  Tomas Mikolov,et al.  Enriching Word Vectors with Subword Information , 2016, TACL.

[16]  Daniel Miller,et al.  The Relative Impact of Anger and Efficacy on Collective Action is Affected by Feelings of Fear , 2009 .

[17]  Quoc V. Le,et al.  Exploiting Similarities among Languages for Machine Translation , 2013, ArXiv.

[18]  Young-Bum Kim,et al.  Cross-Lingual Transfer Learning for POS Tagging without Cross-Lingual Resources , 2017, EMNLP.

[19]  C. Nass,et al.  Emotion in human-computer interaction , 2002 .

[20]  Gerard de Melo,et al.  A Helping Hand: Transfer Learning for Deep Sentiment Analysis , 2018, ACL.

[21]  Athena Vakali,et al.  Detecting variation of emotions in online activities , 2017, Expert Syst. Appl..

[22]  Elke A. Rundensteiner,et al.  Automatic emotion detection in text streams by analyzing Twitter data , 2018, International Journal of Data Science and Analytics.

[23]  Swapna Somasundaran,et al.  QA with Attitude: Exploiting Opinion Type Analysis for Improving Question Answering in On-line Discussions and the News , 2007, ICWSM.

[24]  Pushpak Bhattacharyya,et al.  A Hybrid Deep Learning Architecture for Sentiment Analysis , 2016, COLING.

[25]  Cherie D. Maestas,et al.  Cross Corpus Emotion Classification Using Survey Data , 2018 .

[26]  Hiroshi Kanayama,et al.  Multilingual Training of Crosslingual Word Embeddings , 2017, EACL.

[27]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[28]  Saif Mohammad,et al.  #Emotional Tweets , 2012, *SEMEVAL.

[29]  Iyad Rahwan,et al.  Using millions of emoji occurrences to learn any-domain representations for detecting sentiment, emotion and sarcasm , 2017, EMNLP.

[30]  Athena Vakali,et al.  Sentiment analysis leveraging emotions and word embeddings , 2017 .

[31]  James N. Druckman,et al.  Emotion and the Framing of Risky Choice , 2008 .

[32]  S. Lee,et al.  A domain transferable lexicon set for Twitter sentiment analysis using a supervised machine learning approach , 2018, Expert Syst. Appl..