Extraction of Person Entities Affiliated with Telkom University using Long Short-Term Memory (LSTM) on Related News Articles

News text is text that contains information about an incident that is submitted in writing or not. In the news text there is a lot of unstructured information and makes it difficult for people to know what entities are contained in the news text. To overcome these problems, information extraction is carried out which is useful for adapting information entities in the form of structured text. In this study, information extraction was carried out using the Long Short-Term Memory (LSTM) method in a news data related to Telkom University. Therefore, in this study, the development of an information extraction system was carried out by focusing on the Person entity using the LSTM method. The test results obtained an F1 Score of 81% with a tuning parameter 256 word embedding size, 32 batch, 150 epochs, 0.1 dropout, 150 units, and 1 layer on LSTM.

[1]  I. Budi,et al.  Combination of LSTM and CNN for Article-Level Propaganda Detection in News Articles , 2020, 2020 Fifth International Conference on Informatics and Computing (ICIC).

[2]  Lin Zhuo,et al.  Chinese Data Extraction and Named Entity Recognition , 2020, 2020 5th IEEE International Conference on Big Data Analytics (ICBDA).

[3]  Bernie S. Fabito,et al.  Named-Entity Recognition for Disaster Related Filipino News Articles , 2018, TENCON 2018 - 2018 IEEE Region 10 Conference.

[4]  Ajib Susanto,et al.  Indonesian news classification based on NaBaNA , 2017, 2017 International Seminar on Application for Technology of Information and Communication (iSemantic).

[5]  Zainab Abu Bakar,et al.  Information extraction: Evaluating named entity recognition from classical Malay documents , 2016, 2016 Third International Conference on Information Retrieval and Knowledge Management (CAMP).

[6]  Kai Chen,et al.  A LSTM-based method for stock returns prediction: A case study of China stock market , 2015, 2015 IEEE International Conference on Big Data (Big Data).

[7]  Driss Aboutajdine,et al.  Amazighe Named Entity Recognition using a A rule based approach , 2014, 2014 IEEE/ACS 11th International Conference on Computer Systems and Applications (AICCSA).

[8]  Dan Roth,et al.  Design Challenges and Misconceptions in Named Entity Recognition , 2009, CoNLL.

[9]  R. Greiner,et al.  Comparing Bayesian Network Classifiers , 1999, UAI.

[10]  S. Hochreiter,et al.  Long Short-Term Memory , 1997, Neural Computation.

[11]  Derwin Suhartono,et al.  Named-Entity Recognition for Indonesian Language using Bidirectional LSTM-CNNs , 2018 .

[12]  Hermann Ney,et al.  LSTM Neural Networks for Language Modeling , 2012, INTERSPEECH.