Emotion Detection of Tweets in Indonesian Language using Non-Negative Matrix Factorization

Emotion detection is an application that is widely used in social media for industrial environment, health, and security problems. Twitter is ashort text messageknown as tweet. Based on content and purposes, the tweet can describes as information about a user"s emotion. Emotion detection by means oftweet, is a challenging problem because only a few features can be extracted. Getting features related to emotion is important at the first phase of extraction, so the appropriate features such as a hashtag, emoji, emoticon, and adjective terms are needed. We propose a new method for analyzing the linkages among features and reducedsemantically using Non- Negative Matrix Factorization (NMF). The dataset is taken from a Twitter application using Indonesian language with normalization of informal terms in advance. There are 764 tweets in corpus which have five emotions, i.e. happy (senang), angry (marah), fear (takut), sad (sedih), and surprise(terkejut). Then, the percentage of user"s emotion is computed by k- Nearest Neighbor(kNN) approach. Our proposed model achieves the problem of emotion detectionwhich is proved by the result near ground truth.

[1]  Saif Mohammad,et al.  #Emotional Tweets , 2012, *SEMEVAL.

[2]  Mitsuru Ishizuka,et al.  Textual Affect Sensing for Sociable and Expressive Online Communication , 2007, ACII.

[3]  Arifin,et al.  CLASSIFICATION OF EMOTIONS IN INDONESIAN TEXTSUSING K-NN METHOD , 2012 .

[4]  Xiaohui Yan,et al.  Learning Topics in Short Texts by Non-negative Matrix Factorization on Term Correlation Matrix , 2013, SDM.

[5]  Amit P. Sheth,et al.  Harnessing Twitter "Big Data" for Automatic Emotion Identification , 2012, 2012 International Conference on Privacy, Security, Risk and Trust and 2012 International Confernece on Social Computing.

[6]  Matthew Purver,et al.  Predicting Emotion Labels for Chinese Microblog Texts , 2012, SDAD@ECML/PKDD.

[8]  Stuart Adam Battersby,et al.  Experimenting with Distant Supervision for Emotion Classification , 2012, EACL.

[9]  Agus Zainal Arifin,et al.  Topic Identification of Indonesian Language News Article Based on Theme Query , 2013 .

[10]  Hugh E. Williams,et al.  Stemming Indonesian , 2005, ACSC.

[11]  Henning Titi Ciptaningtyas,et al.  ENHANCED CONFIX STRIPPING STEMMER AND ANTS ALGORITHM FOR CLASSIFYING NEWS DOCUMENT IN INDONESIAN LANGUAGE , 2009 .

[12]  Muhammad Mahbubur Rahman Mining Social Data to Extract Intellectual Knowledge , 2012 .

[13]  Nancy Ide,et al.  Distant Supervision for Emotion Classification with Discrete Binary Values , 2013, CICLing.

[14]  Soo-Young Lee,et al.  Non-negative Matrix Factorization Based Text Mining: Feature Extraction and Classification , 2006, ICONIP.

[15]  Wael Khreich,et al.  A Survey of Techniques for Event Detection in Twitter , 2015, Comput. Intell..