Semantic Role Labeling for Generating Template of Indonesian News Sentences

Template-based approach is commonly used in automated journalism, including for Indonesian news article generation based on hand-crafted templates. Nowadays, there is still no research in automated template generation. This paper describes a method to generate templates automatically using semantic role labeling (SRL). Since there is no public dataset for Indonesian SRL, we also create the dataset for Indonesian SRL using PropBank annotation (Palmer et al., 2005). For the SRL model, we adapt the model that was introduced by He et al. (2017). Result from the model will be saved and compared to the templates that were manually defined by Indrayani and Khodra (2018). Our 2-layer BiLSTM model achieves 0.92 F1 on token level and 0.84 on sentence level. The automatically generated templates have similar quality to the ones that Indrayani and Khodra (2018) defined.

[1]  Luke S. Zettlemoyer,et al.  Deep Semantic Role Labeling: What Works and What’s Next , 2017, ACL.

[2]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[3]  Pierre Nugues,et al.  Multilingual Semantic Role Labeling , 2009, CoNLL Shared Task.

[4]  Wei Xu,et al.  End-to-end learning of semantic role labeling using recurrent neural networks , 2015, ACL.

[5]  Jürgen Schmidhuber,et al.  Training Very Deep Networks , 2015, NIPS.

[6]  Samuel Louvan,et al.  Indosum: A New Benchmark Dataset for Indonesian Text Summarization , 2018, 2018 International Conference on Asian Language Processing (IALP).

[7]  Masayu Leylia Khodra,et al.  Data-Driven News Generation for Indonesian Municipal Election , 2018, 2018 5th International Conference on Advanced Informatics: Concept Theory and Applications (ICAICTA).

[8]  Ayu Purwarianti,et al.  An initial study of Indonesian semantic role labeling and its application on event extraction , 2016, 2016 International Conference on Asian Language Processing (IALP).

[9]  Daniel Gildea,et al.  The Proposition Bank: An Annotated Corpus of Semantic Roles , 2005, CL.

[10]  Zoubin Ghahramani,et al.  A Theoretically Grounded Application of Dropout in Recurrent Neural Networks , 2015, NIPS.