Emotion Detection using Online Machine Learning Method and TLBO on Mixed Script

Due to rapid modernization of our societies, most people, if not all, have access to online social media and mobile communication devices. These people hail from diverse cultures and ethnicity, and interact with each other more often on these social media sites. Moreover, due to their distinct backgrounds, they all have an influence on the common language in which they communicate. Also, many users employ a myriad of shorthand, emoticons and abbreviations in their statements to reduce their effort. This calls for a means to assist in better communications through social media. In our work, we have researched on understanding the underlying emotions and sentiments of these interactions and communications. Our focus was on analyzing the conversations by Indians in the code-mix of English and Hindi languages and identifying the usage patterns of various words and parts of speech. We have categorized statements into 6 groups based on emotions and improved the model using TLBO technique and online learning algorithms. These features were integrated in our application to assist the mobile device users in quickly sort and prioritize their messages based on the emotions attached with the statements and provide much more immersive communications with their friends and family.

[1]  Thorsten Joachims,et al.  Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.

[2]  M Damashek,et al.  Gauging Similarity with n-Grams: Language-Independent Categorization of Text , 1995, Science.

[3]  Janyce Wiebe,et al.  Effects of Adjective Orientation and Gradability on Sentence Subjectivity , 2000, COLING.

[4]  Rakesh Chandra Balabantaray,et al.  Text normalization of code mix and sentiment analysis , 2015, 2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI).

[5]  R. Venkata Rao,et al.  Teaching-learning-based optimization: A novel method for constrained mechanical design optimization problems , 2011, Comput. Aided Des..

[6]  J. Gumperz The Sociolinguistic Significance of Conversational Code-Switching , 1977 .

[7]  Shruti Pal Mother Tongue Influence on Spoken English , 2011 .

[8]  Monojit Choudhury,et al.  Mining Hindi-English Transliteration Pairs from Online Hindi Lyrics , 2012, LREC.

[9]  Pushpak Bhattacharyya,et al.  Transliteration involving English and Hindi languages using Syllabification Approach , 2009 .

[10]  H. Isahara,et al.  Language identification based on string kernels , 2005, IEEE International Symposium on Communications and Information Technology, 2005. ISCIT 2005..

[11]  Maite Taboada,et al.  Lexicon-Based Methods for Sentiment Analysis , 2011, CL.

[12]  Sharvari Govilkar,et al.  A Framework for Sentiment Analysis in Hindi using HSWN , 2015 .

[13]  M. García Spanish-English Code Switching in Slam Poetry , 2014 .

[14]  Monojit Choudhury,et al.  Word-level Language Identification using CRF: Code-switching Shared Task Report of MSR India System , 2014, CodeSwitch@EMNLP.

[15]  S. Patil,et al.  Use of Google Translate in medical communication: evaluation of accuracy , 2014, BMJ : British Medical Journal.

[16]  Shana Poplack,et al.  Sometimes I’ll start a sentence in Spanish Y TERMINO EN ESPAÑOL: toward a typology of code-switching1 , 1980 .

[17]  Shlomo Argamon,et al.  Using appraisal groups for sentiment analysis , 2005, CIKM '05.

[18]  Pushpak Bhattacharyya,et al.  A Fall-back Strategy for Sentiment Analysis in Hindi: a Case Study , 2010 .

[19]  Sometimes I’ll start a sentence in Spanish y termino en espanol: toward a typology of code-switching , 2003 .

[20]  Ajai Kumar Jain,et al.  AnglaHindi: an English to Hindi machine-aided translation system , 2003, MTSUMMIT.