Topic Summarization of Microblog Document in Bahasa Indonesia using the Phrase Reinforcement Algorithm

Abstract Microblog topic summarization is a part of the challenges to automatically find a topic of any group of microblog posts. This study focused on summarizing Twitter data in Bahasa Indonesia. The main algorithm used in this research is The Phrase Reinforcement Algorithm. This algorithm summarized a group of tweets discussing similar topics using a semi-abstractive approach. As a result of some initial experiments during this study, there are some variations applied in order to obtain summary with a better quality. The evaluation is conducted using human assessment and more than 60% agreed that the summaries have the good grammatical, readability, and informative quality

[1]  Jugal K. Kalita,et al.  Experiments in Microblog Summarization , 2010, 2010 IEEE Second International Conference on Social Computing.

[2]  Wei Xu,et al.  A Preliminary Study of Tweet Summarization using Information Extraction , 2013 .

[3]  Jeffrey Nichols,et al.  Summarizing sporting events using twitter , 2012, IUI '12.

[4]  Andrei Olariu Clustering to Improve Microblog Stream Summarization , 2012, 2012 14th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing.

[5]  Jugal K. Kalita,et al.  Summarizing Microblogs Automatically , 2010, NAACL.

[6]  Ophir Frieder,et al.  Information Retrieval: Algorithms and Heuristics , 1998 .