Learning Fine-Grained Fact-Article Correspondence in Legal Cases

Automatically recommending relevant law articles to a given legal case has attracted much attention as it can greatly release human labor from searching over the large database of laws. However, current researches only support coarse-grained recommendation where all relevant articles are predicted as a whole without explaining which specific fact each article is relevant with. Since one case can be formed of many supporting facts, traversing over them to verify the correctness of recommendation results can be time-consuming. We believe that learning fine-grained correspondence between each single fact and law articles is crucial for an accurate and trustworthy AI system. With this motivation, we perform a pioneering study and create a corpus with manually annotated fact-article correspondences. We treat the learning as a text matching task and propose a multi-level matching network to address it. To help the model better digest the content of law articles, we parse articles in form of premise-conclusion pairs with random forest. Experiments show that the parsed form yielded better performance and the resulting model surpassed other popular text matching baselines. Furthermore, we compare with previous researches and find that establishing the fine-grained fact-article correspondences can improve the recommendation accuracy by a large margin. Our best system reaches an F1 score of 96.3%, making it of great potential for practical use. It can also significantly boost the downstream task of legal decision prediction, increasing the F1 score by up to 12.7%. Code and dataset are available at https://github.com/gjdnju/MLMN.

[1]  Hongying Zan,et al.  Charge Prediction with Legal Attention , 2019, NLPCC.

[2]  Jason Weston,et al.  Translating Embeddings for Modeling Multi-relational Data , 2013, NIPS.

[3]  Xueqi Cheng,et al.  Text Matching as Image Recognition , 2016, AAAI.

[4]  Larry P. Heck,et al.  Learning deep structured semantic models for web search using clickthrough data , 2013, CIKM.

[5]  Xiaoyan Wang,et al.  Distinguish Confusing Law Articles for Legal Judgment Prediction , 2020, ACL.

[6]  Bin Luo,et al.  Information Extraction from Chinese Judgment Documents , 2017, 2017 14th Web Information Systems and Applications Conference (WISA).

[7]  Xiang Ji,et al.  MatchZoo: A Learning, Practicing, and Developing System for Neural Text Matching , 2019, SIGIR.

[8]  Zhiyuan Liu,et al.  Few-Shot Charge Prediction with Discriminative Legal Attributes , 2018, COLING.

[9]  Yang Zhao,et al.  A comprehensive study: Sentence compression with linguistic knowledge-enhanced gated neural network , 2018, Data Knowl. Eng..

[10]  Yann LeCun,et al.  Signature Verification Using A "Siamese" Time Delay Neural Network , 1993, Int. J. Pattern Recognit. Artif. Intell..

[11]  Rich Caruana,et al.  Overfitting in Neural Nets: Backpropagation, Conjugate Gradient, and Early Stopping , 2000, NIPS.

[12]  Zhiyuan Liu,et al.  Legal Judgment Prediction via Topological Learning , 2018, EMNLP.

[13]  Jie Zhou,et al.  Diversifying Dialogue Generation with Non-Conversational Text , 2020, ACL.

[14]  Nikolaos Aletras,et al.  Predicting judicial decisions of the European Court of Human Rights: a Natural Language Processing perspective , 2016, PeerJ Comput. Sci..

[15]  Gerard de Melo,et al.  Data Augmentation for Multiclass Utterance Classification – A Systematic Study , 2020, COLING.

[16]  Xueqi Cheng,et al.  A Deep Architecture for Semantic Matching with Multiple Positional Sentence Representations , 2015, AAAI.

[17]  Dietrich Klakow,et al.  NEXUS Network: Connecting the Preceding and the Following in Dialogue Generation , 2018, EMNLP.

[18]  Maria Medvedeva,et al.  Judicial decisions of the European Court of Human Rights: looking into the crystall ball , 2018 .

[19]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[20]  Yun Chen,et al.  Dialogue Generation With GAN , 2018, AAAI.

[21]  Shou-De Lin,et al.  Exploiting Machine Learning Models for Chinese Legal Documents Labeling, Case Classification, and Sentencing Prediction , 2012, Int. J. Comput. Linguistics Chin. Lang. Process..

[22]  Feng Ji,et al.  Simple and Effective Text Matching with Richer Alignment Features , 2019, ACL.

[23]  Ting Liu,et al.  Attention-over-Attention Neural Networks for Reading Comprehension , 2016, ACL.

[24]  Zhen-Hua Ling,et al.  Neural Natural Language Inference Models Enhanced with External Knowledge , 2017, ACL.

[25]  Dongyan Zhao,et al.  Learning to Predict Charges for Criminal Cases with Legal Basis , 2017, EMNLP.

[26]  Pengfei Wang,et al.  Hierarchical Matching Network for Crime Classification , 2019, SIGIR.

[27]  Chao-Lin Liu,et al.  Classifying Criminal Charges in Chinese for Web-Based Legal Services , 2005, APWeb.

[28]  W. Bruce Croft,et al.  A Deep Relevance Matching Model for Ad-hoc Retrieval , 2016, CIKM.

[29]  Weijia Jia,et al.  Legal Judgment Prediction via Multi-Perspective Bi-Feedback Network , 2019, IJCAI.

[30]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[31]  Dietrich Klakow,et al.  Improving Latent Alignment in Text Summarization by Generalizing the Pointer Generator , 2019, EMNLP.

[32]  Zhoujun Li,et al.  Knowledge Enhanced Hybrid Neural Network for Text Matching , 2018, AAAI.

[33]  Ion Androutsopoulos,et al.  Neural Legal Judgment Prediction in English , 2019, ACL.

[34]  Hang Li,et al.  Convolutional Neural Network Architectures for Matching Natural Language Sentences , 2014, NIPS.

[35]  Xiaoyu Shen,et al.  DART: A Lightweight Quality-Suggestive Data-to-Text Annotation Tool , 2020, COLING.

[36]  Michael Backes,et al.  Simulating the Large-Scale Erosion of Genomic Privacy Over Time , 2018, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[37]  Dietrich Klakow,et al.  Estimation of Gap Between Current Language Models and Human Performance , 2017, INTERSPEECH.

[38]  Xiaoyu Shen,et al.  Neural Data-to-Text Generation with LM-based Text Augmentation , 2021, EACL.

[39]  Xiaoyu Shen,et al.  Unsupervised Pidgin Text Generation By Pivoting English Data and Self-Training , 2020, ArXiv.

[40]  Yang Zhao,et al.  Gated Neural Network for Sentence Compression Using Linguistic Knowledge , 2017, NLDB.

[41]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[42]  Jie Zhou,et al.  MovieChats: Chat like Humans in a Closed Domain , 2020, EMNLP.

[43]  Chao-Lin Liu,et al.  Exploring Phrase-Based Classification of Judicial Documents for Criminal Charges in Chinese , 2006, ISMIS.

[44]  Yang Zhao,et al.  Unsupervised Rewriter for Multi-Sentence Compression , 2019, ACL.

[45]  Jan M. Smits,et al.  Elgar Encyclopedia of Comparative Law , 2008 .

[46]  Pengfei Wang,et al.  Modeling Dynamic Pairwise Attention for Crime Classification over Legal Articles , 2018, SIGIR.

[47]  Nitish Srivastava,et al.  Improving neural networks by preventing co-adaptation of feature detectors , 2012, ArXiv.

[48]  Xiaolong Li,et al.  EasyAug: An Automatic Textual Data Augmentation Platform for Classification Tasks , 2020, WWW.

[49]  Kentaro Inui,et al.  Select and Attend: Towards Controllable Content Selection in Text Generation , 2019, EMNLP.

[50]  Ting Liu,et al.  Gaussian Transformer: A Lightweight Approach for Natural Language Inference , 2019, AAAI.

[51]  Dietrich Klakow,et al.  Neural Data-to-Text Generation via Jointly Learning the Segmentation and Correspondence , 2020, ACL.