Named entity recognition on Indonesian Twitter posts using long short-term memory networks

The task of Named-Entity Recognition (NER) can support the higher-level tasks such as question answering, text summarization, and information retrieval. This work views NER on Indonesian Twitter posts as a sequence labeling problem using supervised machine learning approach. The architecture used is Long Short-Term Memory Networks (LSTMs), with word embedding and POS tag as the model features. As the result, our model can give a performance with an F1 score of 77.08%.

[1]  Wei Xu,et al.  Bidirectional LSTM-CRF Models for Sequence Tagging , 2015, ArXiv.

[2]  Ralph Grishman,et al.  Message Understanding Conference- 6: A Brief History , 1996, COLING.

[3]  Jason Weston,et al.  Natural Language Processing (Almost) from Scratch , 2011, J. Mach. Learn. Res..

[4]  Mirna Adriani,et al.  Named entity recognition on Indonesian microblog messages , 2016, 2016 International Conference on Asian Language Processing (IALP).

[5]  George R. Krupka,et al.  IsoQuest Inc.: Description of the NetOwl™ Extractor System as Used for MUC-7 , 1998, MUC.

[6]  Martin Jansche,et al.  Information Extraction from Voicemail Transcripts , 2002, EMNLP.

[7]  Suresh Pabboju,et al.  Sentiment Analysis on Twitter Using Streaming API , 2017, 2017 IEEE 7th International Advance Computing Conference (IACC).

[8]  Ameya Prabhu,et al.  Towards Deep Learning in Hindi NER: An approach to tackle the Labelled Data Sparsity , 2016, ICON.

[9]  Oren Etzioni,et al.  Named Entity Recognition in Tweets: An Experimental Study , 2011, EMNLP.

[10]  Yutaka Matsuo,et al.  Tweet Analysis for Real-Time Event Detection and Earthquake Reporting System Development , 2013, IEEE Transactions on Knowledge and Data Engineering.

[11]  Wei Xu,et al.  End-to-end learning of semantic role labeling using recurrent neural networks , 2015, ACL.

[12]  Christopher D. Manning,et al.  Nested Named Entity Recognition , 2009, EMNLP.

[13]  Ming Zhou,et al.  Recognizing Named Entities in Tweets , 2011, ACL.

[14]  Sampo Pyysalo,et al.  Attending to Characters in Neural Sequence Labeling Models , 2016, COLING.