Corpus for Emotion Detection on Roman Urdu

Language assets, like corpora, are essential for different natural language processing tasks. There are many useful applications of emotion analysis of text such as dialog systems, smart agents, mental disorder clinical diagnoses. In this research, an emotion labeled corpus for Roman Urdu is presented for evaluation of emotions in short text. We collected a sizable corpus with 10,000 manually annotated sentences in Roman Scripted Urdu to facilitate the development and evaluation of emotion detection systems for Roman Urdu. This corpus is the first of its kind to be created for the Roman Urdu.

[1]  Cecilia Ovesdotter Alm,et al.  Emotions from Text: Machine Learning for Text-based Emotion Prediction , 2005, HLT.

[2]  Fabio Previdi,et al.  Unsupervised learning of fundamental emotional states via word embeddings , 2017, 2017 IEEE Symposium Series on Computational Intelligence (SSCI).

[3]  Mirza O. Beg,et al.  Domain Specific Emotion Lexicon Expansion , 2018, 2018 14th International Conference on Emerging Technologies (ICET).

[4]  Hongfei Lin,et al.  Construction and Application of Chinese Emotional Corpus , 2012, CLSW.

[5]  Shingo Kuroiwa,et al.  Japanese Emotion Corpus Analysis and its Usefor Automatic Emotion Word Identification , 2008, Eng. Lett..

[6]  Liang Yang,et al.  The Identification of the Emotionality of Metaphorical Expressions Based on a Manually Annotated Chinese Corpus , 2018, IEEE Access.

[7]  Adil Alpkocak,et al.  Emotion Extraction from Turkish Text , 2014, 2014 European Network Intelligence Conference.

[8]  Kazuki Sato,et al.  Estimation of Emotion Type and Intensity in Japanese Tweets Using Multi-task Deep Learning , 2019, AINA Workshops.

[9]  Vinay Singh,et al.  Corpus Creation and Emotion Prediction for Hindi-English Code-Mixed Social Media Text , 2018, NAACL.

[10]  Jeremy Barnes,et al.  Annotation, Modelling and Analysis of Fine-Grained Emotions on a Stance and Sentiment Detection Corpus , 2017, WASSA@EMNLP.

[11]  Stan Szpakowicz,et al.  Identifying Expressions of Emotion in Text , 2007, TSD.

[12]  Erkki Sutinen,et al.  Are They Different? Affect, Feeling, Emotion, Sentiment, and Opinion Detection in Text , 2014, IEEE Transactions on Affective Computing.

[13]  Ming Li,et al.  Automatic emotional spoken language text corpus construction from written dialogs in fictions , 2017, 2017 Seventh International Conference on Affective Computing and Intelligent Interaction (ACII).

[14]  Waqas Anwar,et al.  A Survey of Automatic Urdu Language Processing , 2006, 2006 International Conference on Machine Learning and Cybernetics.

[15]  Daryl Essam,et al.  Discriminative Feature Spamming Technique for Roman Urdu Sentiment Analysis , 2019, IEEE Access.

[16]  Mary Jane C. Samonte,et al.  Emotion detection in blog posts using keyword spotting and semantic analysis , 2017, ICCIP.

[17]  Yves Bestgen,et al.  CECL at SemEval-2019 Task 3: Using Surface Learning for Detecting Emotion in Textual Conversations , 2019, *SEMEVAL.

[18]  Qi Gao,et al.  Annotation and Detection of Emotion in Text-based Dialogue Systems with CNN , 2017, ArXiv.

[19]  Mohamed H. Haggag,et al.  A Hybrid Model for Emotion Detection from Text , 2017, Int. J. Inf. Retr. Res..

[20]  Rafael A. Calvo,et al.  Natural language processing in mental health applications using non-clinical texts† , 2017, Natural Language Engineering.

[21]  Changqin Quan,et al.  Construction of a Blog Emotion Corpus for Chinese Emotional Expression Analysis , 2009, EMNLP.

[22]  Mohammad Abid Khan,et al.  Urdu Sentiment Analysis Using Supervised Machine Learning Approach , 2018, Int. J. Pattern Recognit. Artif. Intell..