Applying a Convolutional Neural Network to Legal Question Answering

Our legal question answering system combines legal information retrieval and textual entailment, and we describe a legal question answering system that exploits a deep convolutional neural network. We have evaluated our system using the training/test data from the competition on legal information extraction/entailment (COLIEE). The competition focuses on the legal information processing related to answering yes/no questions from Japanese legal bar exams, and it consists of three phases: ad-hoc legal information retrieval, textual entailment, and a learning model-driven combination of the two phases. Phase 1 requires the identification of Japan civil law articles relevant to a legal bar exam query. For that phase, we have implemented a combined TF-IDF and Ranking SVM information retrieval component. Phase 2 requires the system to answer “Yes” or “No” to previously unseen queries, by comparing extracted meanings of queries with relevant articles. Our training of an entailment model focuses on features based on word embeddings, syntactic similarities and identification of negation/antonym relations. We augment our textual entailment component with a convolutional neural network with dropout regularization and Rectified Linear Units. To our knowledge, our study is the first to adapt deep learning for textual entailment. Experimental evaluation demonstrates the effectiveness of the convolutional neural network and dropout regularization. The results show that our deep learning-based method outperforms our baseline SVM-based supervised model and K-means clustering.

[1]  Jason Weston,et al.  Question Answering with Subgraph Embeddings , 2014, EMNLP.

[2]  Mi-Young Kim,et al.  Resolving Ambiguity in Inter-chunk Dependency Parsing , 2001, NLPRS.

[3]  Phil Blunsom,et al.  A Convolutional Neural Network for Modelling Sentences , 2014, ACL.

[4]  Nitish Srivastava,et al.  Improving neural networks by preventing co-adaptation of feature detectors , 2012, ArXiv.

[5]  Andrew Y. Ng,et al.  Semantic Compositionality through Recursive Matrix-Vector Spaces , 2012, EMNLP.

[6]  Alessandro Moschitti,et al.  Learning textual entailment from examples , 2006 .

[7]  Thorsten Joachims,et al.  Optimizing search engines using clickthrough data , 2002, KDD.

[8]  Tara N. Sainath,et al.  Improving deep neural networks for LVCSR using rectified linear units and dropout , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[9]  Christopher Meek,et al.  Semantic Parsing for Single-Relation Question Answering , 2014, ACL.

[10]  Lei Yu,et al.  Deep Learning for Answer Sentence Selection , 2014, ArXiv.

[11]  Natheer K. Gharaibeh,et al.  Development of Yes/No Arabic Question Answering System , 2013, ArXiv.

[12]  Mi-Young Kim,et al.  Legal Question Answering Using Ranking SVM and Syntactic/Semantic Similarity , 2014, JSAI-isAI Workshops.

[13]  Li Deng,et al.  A deep convolutional neural network using heterogeneous pooling for trading acoustic invariance with phonetic confusion , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[14]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[15]  W. Bruce Croft,et al.  Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , 2013 .

[16]  W. Bruce Croft,et al.  Feature-Based Selection of Dependency Paths in Ad Hoc Information Retrieval , 2013, ACL.

[17]  Stefan Harmeling An Extensible Probabilistic Transformation-based Approach to the Third Recognizing Textual Entailment Challenge , 2007, ACL-PASCAL@ACL.

[18]  Rodney D. Nielsen,et al.  Toward Dependency Path based Entailment , 2006 .

[19]  Karen Spärck Jones A statistical interpretation of term specificity and its application in retrieval , 2021, J. Documentation.

[20]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[21]  Phil Blunsom,et al.  Multilingual Models for Compositional Distributed Semantics , 2014, ACL.

[22]  Emiel Krahmer,et al.  Dependency-based paraphrasing for recognizing textual entailment , 2007, ACL-PASCAL@ACL.

[23]  B. Magnini,et al.  Tree Edit Distance for Recognizing Textual Entailment : Estimating the Cost of Insertion , 2006 .