A Practical Approach to Chinese Emergency Event Extraction using BiLSTM-CRF

For enhancing situation awareness in the Internet Era, it is of great significance to automatically extract emergency events from online news reports. However, existing approaches to this task are mainly rule- or pattern-based, and hence suffer from poor generalization. Though recent researches on event extraction (EE) have explored deep neural models (DNM) and achieve impressing results, they are mostly tested using the benchmark dataset ACE2005, which contains no emergency events, and not well suited for practical applications as take as input the entity labels of candidate arguments provided by that dataset. Also, these methods ignore the inter-dependency between triggers and arguments, which could be useful for EE. Hence, this article proposes a DNM for Chinese emergency EE, which uses BiLSTM (bidirectional Long Short Term Memory), pretrains domain-specific word embeddings, and adopts CRF (Conditional Random Field) to capture the interplay between triggers and arguments. Extensive experiments, conducted on the Chinese Emergency Corpus, prove the effectiveness of the proposed model, and show that it outperforms other state-of-the-art methods substantially.

[1]  Chen Chen,et al.  Joint Modeling for Chinese Event Extraction with Rich Linguistic Features , 2012, COLING.

[2]  Roland Vollgraf,et al.  Contextual String Embeddings for Sequence Labeling , 2018, COLING.

[3]  Aravind Srinivasan,et al.  'Beating the news' with EMBERS: forecasting civil unrest using open source indicators , 2014, KDD.

[4]  Gary King,et al.  An Automated Information Extraction Tool for International Conflict Data with Performance as Good as Human Coders: A Rare Events Evaluation Design , 2003, International Organization.

[5]  Jun Zhao,et al.  Event Extraction via Dynamic Multi-Pooling Convolutional Neural Networks , 2015, ACL.

[6]  Xiang Ren,et al.  Empower Sequence Labeling with Task-Aware Neural Language Model , 2017, AAAI.

[7]  Lifu Huang,et al.  Zero-Shot Transfer Learning for Event Extraction , 2017, ACL.

[8]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[9]  Liu Zongtian,et al.  A Text Event Elements Extraction Method Based on Event Ontology , 2016 .

[10]  Xiaoyong Du,et al.  Analogical Reasoning on Chinese Morphological and Semantic Relations , 2018, ACL.

[11]  Jakub Piskorski,et al.  Online News Event Extraction for Global Crisis Surveillance , 2011, Trans. Comput. Collect. Intell..

[12]  Jiang De-liang Research on extraction of emergency event information based on rules matching , 2010 .

[13]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[14]  Ralph Grishman,et al.  Using Document Level Cross-Event Inference to Improve Event Extraction , 2010, ACL.

[15]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[16]  Heng Ji,et al.  Joint Event Extraction via Structured Prediction with Global Features , 2013, ACL.

[17]  Xiao Liu,et al.  Jointly Multiple Events Extraction via Attention-based Graph Information Aggregation , 2018, EMNLP.

[18]  Eduard H. Hovy,et al.  End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF , 2016, ACL.

[19]  Jakub Piskorski,et al.  Real-Time News Event Extraction for Global Crisis Monitoring , 2008, NLDB.

[20]  Ralph Grishman,et al.  Real-time event extraction for infectious disease outbreaks , 2002 .

[21]  Ralph Grishman,et al.  Joint Event Extraction via Recurrent Neural Networks , 2016, NAACL.

[22]  Wei Xu,et al.  Bidirectional LSTM-CRF Models for Sequence Tagging , 2015, ArXiv.

[23]  Jun Zhao,et al.  How to Generate a Good Word Embedding , 2015, IEEE Intelligent Systems.

[24]  Chandra Bhagavatula,et al.  Semi-supervised sequence tagging with bidirectional language models , 2017, ACL.