论文信息 - Interpretable Charge Predictions for Criminal Cases: Learning to Generate Court Views from Fact Descriptions - 字舞流文

Interpretable Charge Predictions for Criminal Cases: Learning to Generate Court Views from Fact Descriptions

In this paper, we propose to study the problem of COURT VIEW GENeration from the fact description in a criminal case. The task aims to improve the interpretability of charge prediction systems and help automatic legal document generation. We formulate this task as a text-to-text natural language generation (NLG) problem. Sequenceto-sequence model has achieved cutting-edge performances in many NLG tasks. However, due to the non-distinctions of fact descriptions, it is hard for Seq2Seq model to generate charge-discriminative court views. In this work, we explore charge labels to tackle this issue. We propose a label-conditioned Seq2Seq model with attention for this problem, to decode court views conditioned on encoded charge labels. Experimental results show the effectiveness of our method.

Xin Jiang | Hai Ye | Zhunchen Luo | Wen-Han Chao | Xin Jiang | Wen-Han Chao | Zhunchen Luo | Hai Ye

[1] Christopher D. Manning,et al. Effective Approaches to Attention-based Neural Machine Translation , 2015, EMNLP.

[2] Xinya Du,et al. Learning to Ask: Neural Question Generation for Reading Comprehension , 2017, ACL.

[3] Mirella Lapata,et al. Paraphrasing Revisited with Neural Machine Translation , 2017, EACL.

[4] Mirella Lapata,et al. Learning to Generate Product Reviews from Attributes , 2017, EACL.

[5] Xiaojun Wan,et al. Recent advances in document summarization , 2017, Knowledge and Information Systems.

[6] Chao-Lin Liu,et al. Classifying Criminal Charges in Chinese for Web-Based Legal Services , 2005, APWeb.

[7] Minh-Tien Nguyen,et al. Lexical-Morphological Modeling for Legal Text Analysis , 2015, JSAI-isAI Workshops.

[8] Chao-Lin Liu,et al. Case Instance Generation and Refinement for Case-Based Criminal Summary Judgments in Chinese , 2004, J. Inf. Sci. Eng..

[9] Yi-Hung Liu,et al. Predicting associated statutes for legal problems , 2015, Inf. Process. Manag..

[10] Wang Ling,et al. Latent Predictor Networks for Code Generation , 2016, ACL.

[11] Stephen E. Robertson,et al. Some simple effective approximations to the 2-Poisson model for probabilistic weighted retrieval , 1994, SIGIR '94.

[12] Regina Barzilay,et al. Rationalizing Neural Predictions , 2016, EMNLP.

[13] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[14] Dongyan Zhao,et al. Learning to Predict Charges for Criminal Cases with Legal Basis , 2017, EMNLP.

[15] Chao-Lin Liu,et al. Exploring Phrase-Based Classification of Judicial Documents for Criminal Charges in Chinese , 2006, ISMIS.

[16] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[17] Yi-Hung Liu,et al. A text mining approach to assist the general public in the retrieval of legal documents , 2013, J. Assoc. Inf. Sci. Technol..

[18] Mirella Lapata,et al. Neural Summarization by Extracting Sentences and Words , 2016, ACL.

[19] Mirella Lapata,et al. Language to Logical Form with Neural Attention , 2016, ACL.

[20] Mi-Young Kim,et al. Legal Question Answering Using Ranking SVM and Syntactic/Semantic Similarity , 2014, JSAI-isAI Workshops.

[21] Carlos Guestrin,et al. "Why Should I Trust You?": Explaining the Predictions of Any Classifier , 2016, ArXiv.

[22] Chin-Yew Lin,et al. ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.

[23] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[24] Zachary Chase Lipton. The mythos of model interpretability , 2016, ACM Queue.

[25] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[26] Bowen Zhou,et al. Abstractive Text Summarization using Sequence-to-sequence RNNs and Beyond , 2016, CoNLL.

[27] Wang Ling,et al. Program Induction by Rationale Generation: Learning to Solve and Explain Algebraic Word Problems , 2017, ACL.

[28] Joelle Pineau,et al. How NOT To Evaluate Your Dialogue System: An Empirical Study of Unsupervised Evaluation Metrics for Dialogue Response Generation , 2016, EMNLP.

[29] Quoc V. Le,et al. Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[30] Shou-De Lin,et al. Exploiting Machine Learning Models for Chinese Legal Documents Labeling, Case Classification, and Sentencing Prediction , 2012, Int. J. Comput. Linguistics Chin. Lang. Process..

[31] Xiaojun Wan,et al. Abstractive Document Summarization with a Graph-Based Attentional Neural Model , 2017, ACL.

[32] Trevor Darrell,et al. Generating Visual Explanations , 2016, ECCV.

[33] Yoshua Bengio,et al. Show, Attend and Tell: Neural Image Caption Generation with Visual Attention , 2015, ICML.

[34] Philipp Koehn,et al. Scalable Modified Kneser-Ney Language Model Estimation , 2013, ACL.

[35] Maria T. Pazienza,et al. Information Extraction , 2002, Lecture Notes in Computer Science.

[36] Emiel Krahmer,et al. Survey of the State of the Art in Natural Language Generation: Core tasks, applications and evaluation , 2017, J. Artif. Intell. Res..

[37] V. Balakista Reddy,et al. Analyzing the Extraction of Relevant Legal Judgments using Paragraph-level and Citation Information , 2016 .

[38] Alexander M. Rush,et al. Abstractive Sentence Summarization with Attentive Recurrent Neural Networks , 2016, NAACL.

[39] Philipp Koehn,et al. Moses: Open Source Toolkit for Statistical Machine Translation , 2007, ACL.

[40] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.