A deep neural network model for speakers coreference resolution in legal texts

Abstract Coreference resolution is one of the fundamental tasks in natural language processing (NLP), and is of great significance to understand the semantics of texts. Meanwhile, resolving coreference is essential for many NLP downstream applications. Existing methods largely focus on pronouns, possessives and noun phrases resolution in the general domain, while little work is proposed for professional domains such as the legal field. Different from general texts, how to code legal texts and capture the relationship between entities in the text, and then resolve coreference is a challenging problem. For better understanding the legal text, and facilitating a series of downstream tasks in legal text mining, we propose a deep neural network model for coreference resolution in court record documents. Specifically, the pre-trained language model and bi-directional long short-term memory networks are first utilized to encode legal texts. Second, graph neural networks are applied to incorporate reference relations between entities. Finally, two distinct classifiers are used to score the candidate pairs. Results on the dataset show that our model achieves 87.53% F1 score on court record documents, outperforming neural baseline models by a large margin. Further analysis shows that the proposed method can effectively identify the reference relations between entities and model the entity dependencies.

[1]  Bo Gao,et al.  A novel intelligent classification model for breast cancer diagnosis , 2019, Inf. Process. Manag..

[2]  Christopher D. Manning,et al.  Improving Coreference Resolution by Learning Entity-Level Distributed Representations , 2016, ACL.

[3]  Yijia Liu,et al.  Towards Better UD Parsing: Deep Contextualized Word Embeddings, Ensemble, and Treebank Concatenation , 2018, CoNLL.

[4]  Luke S. Zettlemoyer,et al.  Higher-Order Coreference Resolution with Coarse-to-Fine Inference , 2018, NAACL.

[5]  Hong Chen,et al.  PreCo: A Large-scale Dataset in Preschool Vocabulary for Coreference Resolution , 2018, EMNLP.

[6]  Ruihong Huang,et al.  Event Coreference Resolution by Iteratively Unfolding Inter-dependencies among Events , 2017, EMNLP.

[7]  Peng Zhou,et al.  Joint Extraction of Entities and Relations Based on a Novel Tagging Scheme , 2017, ACL.

[8]  Massimiliano Giacalone,et al.  Big Data and forensics: An innovative approach for a predictable jurisprudence , 2018, Inf. Sci..

[9]  Serena Villata,et al.  Ontology Population and Alignment for the Legal Domain: YAGO, Wikipedia and LKIF , 2017, International Semantic Web Conference.

[10]  Serena Villata,et al.  A low-cost, high-coverage legal named entity recognizer, classifier and linker , 2017, ICAIL.

[11]  Penghua Li,et al.  Law text classification using semi-supervised convolutional neural networks , 2018, 2018 Chinese Control And Decision Conference (CCDC).

[12]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[13]  Alessandro Moschitti,et al.  A Practical Perspective on Latent Structured Prediction for Coreference Resolution , 2017, EACL.

[14]  Rakesh Nagi,et al.  An incremental graph-partitioning algorithm for entity resolution , 2019, Inf. Fusion.

[15]  Marie-Francine Moens,et al.  Binary and Multitask Classification Model for Dutch Anaphora Resolution: Die/Dat Prediction , 2020, ArXiv.

[16]  Sadao Kurohashi,et al.  Entity-Centric Joint Modeling of Japanese Coreference Resolution and Predicate Argument Structure Analysis , 2018, ACL.

[17]  Grigorios Tsoumakas,et al.  Local word vectors guiding keyphrase extraction , 2018, Inf. Process. Manag..

[18]  Christopher Dozier,et al.  Automatic Extraction and Linking of Person Names In Legal Text , 2000, RIAO.

[19]  Sophia Ananiadou,et al.  Investigating Domain-Specific Information for Neural Coreference Resolution on Biomedical Texts , 2018, BioNLP.

[20]  Weijia Jia,et al.  Legal Judgment Prediction via Multi-Perspective Bi-Feedback Network , 2019, IJCAI.

[21]  Pushpak Bhattacharyya,et al.  Identifying Participant Mentions and Resolving Their Coreferences in Legal Court Judgements , 2018, TSD.

[22]  Min Yang,et al.  A novel approach for entity resolution in scientific documents using context graphs , 2018, Inf. Sci..

[23]  Yan Song,et al.  Knowledge-aware Pronoun Coreference Resolution , 2019, ACL.

[24]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[25]  Nelleke Oostdijk,et al.  The Construction of a 500-Million-Word Reference Corpus of Contemporary Written Dutch , 2013, Essential Speech and Language Technology for Dutch.

[26]  Claire Cardie,et al.  Identifying Anaphoric and Non-Anaphoric Noun Phrases to Improve Coreference Resolution , 2002, COLING.

[27]  Siddhartha Jonnalagadda,et al.  Coreference analysis in clinical notes: a multi-pass sieve with alternate anaphora resolution modules , 2012, J. Am. Medical Informatics Assoc..

[28]  Mari Ostendorf,et al.  Multi-Task Identification of Entities, Relations, and Coreference for Scientific Knowledge Graph Construction , 2018, EMNLP.

[29]  Dan Roth,et al.  Understanding the Value of Features for Coreference Resolution , 2008, EMNLP.

[30]  Josef van Genabith,et al.  Exploring the Use of Text Classification in the Legal Domain , 2017, ASAIL@ICAIL.

[31]  Chen Chen,et al.  Chinese Zero Pronoun Resolution with Deep Neural Networks , 2016, ACL.

[32]  Michael Strube,et al.  Latent Structures for Coreference Resolution , 2015, TACL.

[33]  Ion Androutsopoulos,et al.  Extracting contract elements , 2017, ICAIL.

[34]  Shu Zhang,et al.  Using Case Facts to Predict Penalty with Deep Learning , 2019, ICPCSEE.

[35]  Yan Song,et al.  Incorporating Context and External Knowledge for Pronoun Coreference Resolution , 2019, NAACL.

[36]  Ion Androutsopoulos,et al.  Neural Legal Judgment Prediction in English , 2019, ACL.

[37]  Hui Wang,et al.  Case Facts Analysis Method Based on Deep Learning , 2019, WISA.

[38]  Timothy Dozat,et al.  Deep Biaffine Attention for Neural Dependency Parsing , 2016, ICLR.

[39]  Xiaoyong Du,et al.  Analogical Reasoning on Chinese Morphological and Semantic Relations , 2018, ACL.

[40]  P. Santhi Thilagam,et al.  Crime base: Towards building a knowledge base for crime entities and their relationships from online news papers , 2019, Inf. Process. Manag..

[41]  Chen Lin,et al.  Towards generalizable entity-centric clinical coreference resolution , 2017, J. Biomed. Informatics.

[42]  Iris Hendrickx,et al.  Cross-Domain Dutch Coreference Resolution , 2011, RANLP.

[43]  Serena Villata,et al.  Legal NERC with ontologies, Wikipedia and curriculum learning , 2017, EACL.

[44]  Liangliang Cao,et al.  Focal Visual-Text Attention for Visual Question Answering , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[45]  Min Zhang,et al.  Chinese Zero Pronoun Resolution , 2020, ACM Trans. Asian Low Resour. Lang. Inf. Process..

[46]  Donghong Ji,et al.  An end-to-end joint model for evidence information extraction from court record document , 2020, Inf. Process. Manag..

[47]  Rui Zhang,et al.  Neural Coreference Resolution with Deep Biaffine Attention by Joint Mention Detection and Mention Clustering , 2018, ACL.

[48]  Laura Dietz,et al.  UNH at SemEval-2019 Task 12: Toponym Resolution in Scientific Papers , 2019, SemEval@NAACL-HLT.

[49]  Akiko Aizawa,et al.  Corpus for Coreference Resolution on Scientific Papers , 2014, LREC.

[50]  Luke S. Zettlemoyer,et al.  End-to-end Neural Coreference Resolution , 2017, EMNLP.

[51]  Ion Androutsopoulos,et al.  Large-Scale Multi-Label Text Classification on EU Legislation , 2019, ACL.

[52]  Bei Yu,et al.  HClaimE: A tool for identifying health claims in health news headlines , 2019, Inf. Process. Manag..

[53]  Jürgen Schmidhuber,et al.  Framewise phoneme classification with bidirectional LSTM and other neural network architectures , 2005, Neural Networks.

[54]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[55]  Kaiz Merchant,et al.  NLP Based Latent Semantic Analysis for Legal Text Summarization , 2018, 2018 International Conference on Advances in Computing, Communications and Informatics (ICACCI).

[56]  Christopher D. Manning,et al.  Entity-Centric Coreference Resolution with Model Stacking , 2015, ACL.

[57]  B. L. William Wong,et al.  An interactive human centered data science approach towards crime pattern analysis , 2019, Inf. Process. Manag..

[58]  Karima Meftouh,et al.  Machine translation for Arabic dialects (survey) , 2017, Inf. Process. Manag..

[59]  Luke S. Zettlemoyer,et al.  Deep Contextualized Word Representations , 2018, NAACL.