Emotion detection of tweets in Indonesian language using LDA and expression symbol conversion

Twitter is one of the social networks that attract many Indonesian people because it is considered as a medium to express opinions and feelings about certain topic. Twitter popularity can be used as an efficient source of sentiment data for marketing or social studies. Social studies that can be applied to the process of Twitter analysis is emotion detection. Emotion detection has a potency to be applied in a wide range of applications, ranging from health applications, counseling, business, to community population studies. This research utilizes one of the most popular and simplest topic modeling models, that is Latent Dirichlet Allocation (LDA), as well as conversion expression symbol (emoticon/ emoji), which shows the emotion or topic in a tweet to multiply the vocabulary that represents emotion. The advantage of the LDA method proposed is that it can detect some emotions on the tweet because the detection is not rigid and is able to show the proportion of emotion in the tweet. This research compares emotional detection using LDA and conversion expression symbol with emotional detection using LDA without conversion expression symbol. The result shows that emotional detection using LDA with conversion expression symbol is better with the reached average difference of accuracy 14.096%.