A Deep Learning Approach for Recognizing Textual Emotion from Bengali-English Code-Mixed Data

Emotion detection is a computational approach for finding the distinct emotion or feeling of an individual. Although Bengali is a low-resource language, the amount of Bengali-English codemixed textual data has grown significantly because of the recent widespread use of social media applications among Bengali users. Gradually, the classification of emotions in Bengali-English codemixed data has become a crucial challenge for applications in e-commerce, healthcare, suicidal attempt reduction and crime detection. Nevertheless, the lack of Bengali language processing techniques and Bengali-English dataset have made the emotion recognition more challenging. This research work offers a Deep Learning based approach for classifying emotions from Bengali-English code-mixed data into six basic categories: disgust, sadness, joy, anger, fear, and surprise. Due to the lack of required dataset, a Bengali-English code-mixed corpus consisting of 10,221 sentences is created. In order to identify the best features, this work investigates several word embedding techniques, including Word2Vec, FastText, and Keras Embedding Layer. Different types of of Machine Learning and Deep Learning based algorithms including the proposed technique using Word2Vec and BiLSTM are applied on the developed corpus. In order to find out the best technique, a comparative analysis among all the methods is demonstrated revealing that the BiLSTM with Word2Vec word embedding technique outperforms rest other models achieving the highest accuracy of 76.1%.

[1]  Iqbal H. Sarker,et al.  BEmoC: A Corpus for Identifying Emotion in Bengali Texts , 2022, SN Computer Science.

[2]  Mohammed Moshiul Hoque,et al.  Multi-class Textual Emotion Categorization using Ensemble of Convolutional and Recurrent Neural Network , 2021, SN Computer Science.

[3]  Iqbal H. Sarker,et al.  Emotion Classification in a Resource Constrained Language Using Transformer-based Approach , 2021, NAACL.

[4]  Akshita Aggarwal,et al.  Towards Emotion Recognition in Hindi-English Code-Mixed Data: A Transformer Based Approach , 2021, WASSA.

[5]  Mohamed El Bachir Menai,et al.  A survey of state-of-the-art approaches for emotion recognition in text , 2020, Knowledge and Information Systems.

[6]  Nourah Alswaidan,et al.  A survey of state-of-the-art approaches for emotion recognition in text , 2020, Knowledge and Information Systems.

[7]  Vinay Singh,et al.  Corpus Creation and Emotion Prediction for Hindi-English Code-Mixed Social Media Text , 2018, NAACL.

[8]  Tomas Mikolov,et al.  Enriching Word Vectors with Subword Information , 2016, TACL.

[9]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[10]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[11]  S. Hochreiter,et al.  Long Short-Term Memory , 1997, Neural Computation.

[12]  P. Ekman,et al.  Facial expression and emotion , 2023, Laryngo-Rhino-Otologie.

[13]  M. Hoque,et al.  CUET-NLP@TamilNLP-ACL2022: Multi-Class Textual Emotion Detection from Social Media using Transformer , 2022, DRAVIDIANLANGTECH.

[14]  MD.Asif Iqbal,et al.  BEmoD: Development of Bengali Emotion Dataset for Classifying Expressions of Emotion in Texts , 2020, ICO.

[15]  B. Premjith,et al.  Emotion Detection in Hinglish(Hindi+English) Code-Mixed Social Media Text , 2020 .