Identification of Rhetorical Roles of Sentences in Indian Legal Judgments

Automatically understanding the rhetorical roles of sentences in a legal case judgement is an important problem to solve, since it can help in several downstream tasks like summarization of legal judgments, legal search, and so on. The task is challenging since legal case documents are usually not well-structured, and these rhetorical roles may be subjective (as evident from variation of opinions between legal experts). In this paper, we address this task for judgments from the Supreme Court of India. We label sentences in 50 documents using multiple human annotators, and perform an extensive analysis of the human-assigned labels. We also attempt automatic identification of the rhetorical roles of sentences. While prior approaches towards this task used Conditional Random Fields over manually handcrafted features, we explore the use of deep neural models which do not require hand-crafting of features. Experiments show that neural models perform much better in this task than baseline methods which use handcrafted features.

[1]  Peter Szolovits,et al.  Hierarchical Neural Networks for Sequential Sentence Classification in Medical Scientific Abstracts , 2018, EMNLP.

[2]  Kevin D. Ashley,et al.  Segmenting U.S. Court Decisions into Functional and Issue Specific Parts , 2018, JURIX.

[3]  Claire Grover,et al.  Extractive summarisation of legal texts , 2006, Artificial Intelligence and Law.

[4]  George Sanchez,et al.  Sentence Boundary Detection in Legal Text , 2019, Proceedings of the Natural Legal Language Processing Workshop 2019.

[5]  Wim Peters,et al.  A Case Study on Legal Case Annotation , 2013, JURIX.

[6]  Isar Nejadgholi,et al.  A Semi-Supervised Training Method for Semantic Search of Legal Facts in Canadian Immigration Cases , 2017, JURIX.

[7]  Chao-Lin Liu,et al.  Extracting the Gist of Chinese Judgments of the Supreme Court , 2019, ICAIL.

[8]  Vern R. Walker,et al.  Automatic Classification of Rhetorical Roles for Sentences: Comparing Rule-Based Scripts with Machine Learning , 2019, ASAIL@ICAIL.

[9]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[10]  Adam Wyner,et al.  Towards Annotating and Extracting Textual Legal Case Elements , 2010 .

[11]  M. Saravanan,et al.  Automatic Identification of Rhetorical Roles using Conditional Random Fields for Legal Document Summarization , 2008, IJCNLP.

[12]  Advaith Siddharthan,et al.  Recognizing cited facts and principles in legal judgements , 2017, Artificial Intelligence and Law.

[13]  Pengfei Wang,et al.  Modeling Dynamic Pairwise Attention for Crime Classification over Legal Articles , 2018, SIGIR.

[14]  Pengfei Wang,et al.  Hierarchical Matching Network for Crime Classification , 2019, SIGIR.

[15]  Matteo Pagliardini,et al.  Unsupervised Learning of Sentence Embeddings Using Compositional n-Gram Features , 2017, NAACL.

[16]  Guy Lapalme,et al.  LetSum, an automatic Legal Text Summarizing system , 2004 .

[17]  Kripabandhu Ghosh,et al.  A Comparative Study of Summarization Algorithms Applied to Legal Case Judgments , 2019, ECIR.

[18]  Jürgen Schmidhuber,et al.  Bidirectional LSTM Networks for Improved Phoneme Classification and Recognition , 2005, ICANN.