Automatic Summarization of Legal Decisions using Iterative Masking of Predictive Sentences

We report on a pilot experiment in automatic, extractive summarization of legal cases concerning Post-traumatic Stress Disorder from the US Board of Veterans' Appeals. We hypothesize that length-constrained extractive summaries benefit from choosing among sentences that are predictive for the case outcome. We develop a novel train-attribute-mask pipeline using a CNN classifier to iteratively select predictive sentences from the case, which measurably improves prediction accuracy on partially masked decisions. We then select a subset for the summary through type classification, maximum marginal relevance, and a summarization template. We use ROUGE metrics and a qualitative survey to evaluate generated summaries along with expert-extracted and expert-drafted summaries. We show that sentence predictiveness does not reliably cover all decision-relevant aspects of a case, illustrate that lexical overlap metrics are not well suited for evaluating legal summaries, and suggest that future work should focus on case-aspect coverage.

[1]  Yoon Kim,et al.  Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[2]  Jade Goldstein-Stewart,et al.  The use of MMR, diversity-based reranking for reordering documents and producing summaries , 1998, SIGIR '98.

[3]  Vern R. Walker,et al.  Semantic types for computational legal reasoning: propositional connectives and sentence roles in the veterans' claims dataset , 2017, ICAIL.

[4]  Jason Weston,et al.  A Neural Attention Model for Abstractive Sentence Summarization , 2015, EMNLP.

[5]  Claire Grover,et al.  Extractive summarisation of legal texts , 2006, Artificial Intelligence and Law.

[6]  Kevin D. Ashley,et al.  Sentence Boundary Detection in Adjudicatory Decisions in the United States , 2017, TAL.

[7]  Guy Lapalme,et al.  Legal Text Summarization by Exploration of the Thematic Structure and Argumentative Roles , 2004 .

[8]  Chin-Yew Lin,et al.  ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.

[9]  Claire Grover,et al.  Automatic summarisation of legal documents , 2003, ICAIL.

[10]  Elena Lloret,et al.  The role of statistical and semantic features in single-document extractive summarization , 2013, Artif. Intell. Res..

[11]  Jade Goldstein-Stewart,et al.  The Use of MMR, Diversity-Based Reranking for Reordering Documents and Producing Summaries , 1998, SIGIR Forum.

[12]  Karl Branting,et al.  Inducing Predictive Models for Decision Support in Administrative Adjudication , 2017, AICOL.

[13]  Jiawei Han,et al.  Opinosis: A Graph Based Approach to Abstractive Summarization of Highly Redundant Opinions , 2010, COLING.

[14]  Frank Schilder,et al.  Evaluating a summarizer for legal text with a large text collection , 2006 .

[15]  Nan Hua,et al.  Universal Sentence Encoder for English , 2018, EMNLP.

[16]  Ankur Taly,et al.  Axiomatic Attribution for Deep Networks , 2017, ICML.

[17]  Iryna Gurevych,et al.  Identifying Argumentative Discourse Structures in Persuasive Essays , 2014, EMNLP.

[18]  Mi-Young Kim,et al.  Summarization of Legal Texts with High Cohesion and Automatic Compression Rate , 2012, JSAI-isAI Workshops.

[19]  Guy Lapalme,et al.  Supervised Machine Learning for Summarizing Legal Documents , 2010, Canadian Conference on AI.

[20]  Diyi Yang,et al.  Hierarchical Attention Networks for Document Classification , 2016, NAACL.

[21]  Xin Liu,et al.  Generic text summarization using relevance measure and latent semantic analysis , 2001, SIGIR '01.

[22]  Claire Grover,et al.  Summarising Legal Texts: Sentential Tense and Argumentative Roles , 2003, HLT-NAACL 2003.

[23]  Dragomir R. Radev,et al.  LexRank: Graph-based Lexical Centrality as Salience in Text Summarization , 2004, J. Artif. Intell. Res..

[24]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.