Annotating Entity and Causal Relationships on Japanese Vehicle Recall Information

A vehicle recall system is a process of recall-ing and repairing vehicles with defective de-signs or potential for accidents and failures. The recall document concisely explains the circumstances and causes of product defects. This paper presents two types of annotations on public vehicle recall reports, part entities and their relations, and causality. We annotated 6,394 car-recall text documents. Named entity and relation annotation suggests a relationship between the elements of an auto-mobile, and causality annotation indicates the cause of a malfunction. The entity and relation annotation and causality annotation allow the system to automatically extract knowledge in the automotive design domain. Subsequently, we present the experimental results for named entity recognition and relation extraction and causality extraction of our annotated corpus to verify the feasibility of building a system for extracting part information and causality. Fi-nally, the experimental results show that employing named entity and relation information as the external knowledge improves causality extraction.

[1]  S. Waqar Jaffry,et al.  Named Entity Recognition and Relation Extraction , 2021, ACM Comput. Surv..

[2]  Danqi Chen,et al.  A Frustratingly Easy Approach for Joint Entity and Relation Extraction , 2020, ArXiv.

[3]  Naoaki Okazaki,et al.  Named Entity Recognition and Relation Extraction using Enhanced Table Filling by Contextualized Representations , 2020, Journal of Natural Language Processing.

[4]  Jaime G. Carbonell,et al.  The BECauSE Corpus 2.0: Annotating Causality and Overlapping Relations , 2017, LAW@ACL.

[5]  Nathanael Chambers,et al.  CaTeRS: Causal and Temporal Relation Scheme for Semantic Annotation of Event Structures , 2016, EVENTS@HLT-NAACL.

[6]  Nathanael Chambers,et al.  A Corpus and Cloze Evaluation for Deeper Understanding of Commonsense Stories , 2016, NAACL.

[7]  Jaime G. Carbonell,et al.  Annotating Causal Language Using Corpus Lexicography of Constructions , 2015, LAW@NAACL-HLT.

[8]  Makoto Miwa,et al.  Modeling Joint Entity and Relation Extraction with Table Representation , 2014, EMNLP.

[9]  Paramita Mirza,et al.  Annotating Causality in the TempEval-3 Corpus , 2014, EACL 2014.

[10]  Sampo Pyysalo,et al.  BioCause: Annotating and analysing causality in the biomedical domain , 2013, BMC Bioinformatics.

[11]  Sampo Pyysalo,et al.  brat: a Web-based Tool for NLP-Assisted Text Annotation , 2012, EACL.

[12]  W. Chapman,et al.  What can natural language processing do for clinical decision support? , 2009, J. Biomed. Informatics.

[13]  Dan Roth,et al.  Design Challenges and Misconceptions in Named Entity Recognition , 2009, CoNLL.

[14]  Preslav Nakov,et al.  SemEval-2010 Task 8: Multi-Way Classification of Semantic Relations Between Pairs of Nominals , 2009, SEW@NAACL-HLT.

[15]  Preslav Nakov,et al.  SemEval-2007 Task 04: Classification of Semantic Relations between Nominals , 2007, Fourth International Workshop on Semantic Evaluations (SemEval-2007).

[16]  Jari Björne,et al.  BioInfer: a corpus for information extraction in the biomedical domain , 2007, BMC Bioinformatics.

[17]  Nancy Chinchor,et al.  Overview of MUC-7 , 1998, MUC.

[18]  Livio Robaldo,et al.  The Penn Discourse Treebank 2.0 Annotation Manual , 2007 .

[19]  Taku Kudo,et al.  MeCab : Yet Another Part-of-Speech and Morphological Analyzer , 2005 .

[20]  James Pustejovsky,et al.  TimeML: Robust Specification of Event and Temporal Expressions in Text , 2003, New Directions in Question Answering.