TULSI: an NLP system for extracting legal modificatory provisions

In this work we present the TULSI system (so named after Turin University Legal Semantic Interpreter), a system to produce automatic annotations of normative documents through the extraction of modificatory provisions. TULSI relies on a deep syntactic analysis and a shallow semantic interpreter that are illustrated in detail. We report the results of an experimental evaluation of the system and discuss them, also suggesting future directions for further improvement.

[1]  Stanley L. Paulson,et al.  Normativity and Norms: Critical Perspectives on Kelsenian Themes , 1999 .

[2]  Peter Jackson,et al.  Natural Language Processing of Online Applications , 2002 .

[3]  Cristina Bosco,et al.  Evalita'09 Parsing Task: comparing dependency parsers and treebanks , 2009 .

[4]  Tom M. van Engers,et al.  Automated Detection of Reference Structures in Law , 2006, JURIX.

[5]  Simonetta Montemagni,et al.  NLP-based metadata extraction for legal text consolidation , 2009, ICAIL.

[6]  Robert C. Berwick,et al.  Principle-Based Parsing: Computation and Psycholinguistics , 1991 .

[7]  Vincenzo Lombardo,et al.  Transformed Subcategorization Frames in Chunk Parsing , 2002, LREC.

[8]  Radboud Winkels,et al.  Machine Learning versus Knowledge Based Classification of Legal Texts , 2010, JURIX.

[9]  Marco Baroni,et al.  Morph-it! A free corpus-based morphological resource for the Italian language , 2005 .

[10]  Steven Abney,et al.  Parsing By Chunks , 1991 .

[11]  Monica Palmirani Legislative Change Management with Akoma-Ntoso , 2011 .

[12]  Bart Verheij,et al.  About the logical relations between cases and rules , 2008, JURIX.

[13]  Vito Pirrelli,et al.  Semantic Mark-up of Italian Legal Texts Through NLP-based Techniques , 2004, LREC.

[14]  Cristina Bosco,et al.  A treebank-based study on the influence of Italian word order on parsing performance , 2012, LREC.

[15]  Monica Palmirani,et al.  Model Regularity of Legal Language in Active Modifications , 2009, AICOL Workshops.

[16]  Yasuhiro Ogawa,et al.  Automatic Consolidation of Japanese Statutes Based on Formalization of Amendment Sentences , 2007, JSAI.

[17]  Luca Dini,et al.  For the Automated Mark-Up of Italian Legislative Texts in XML , 2002 .

[18]  Michael Collins,et al.  Three Generative, Lexicalised Models for Statistical Parsing , 1997, ACL.

[19]  Pedro M. Domingos The Role of Occam's Razor in Knowledge Discovery , 1999, Data Mining and Knowledge Discovery.

[20]  Monica Palmirani,et al.  Legal text analysis of the modification provisions: a pattern oriented approach , 2009, ICAIL.

[21]  Paulo Quaresma,et al.  A Methodology to Create Legal Ontologies in a Logic Programming Based Web Information Retrieval System , 2004, Artificial Intelligence and Law.

[22]  Claudia Soria,et al.  Automatic semantics extraction in law documents , 2005, ICAIL '05.

[23]  L. Thorne McCarty Deep semantic interpretations of legal texts , 2007, ICAIL.

[24]  Andrew Y. Ng,et al.  Robust Textual Inference via Graph Matching , 2005, HLT.

[25]  Ronald Leenes,et al.  Legal knowledge and information systems : JURIX 2000 : the thirteenth annual conference , 2000 .

[26]  Dan Roth,et al.  An Inference Model for Semantic Entailment in Natural Language , 2005, IJCAI.

[27]  Enrico Francesconi,et al.  Standards and tools for legislative drafting and legal document web publication , 2003 .

[28]  Peter Jackson,et al.  Natural language processing for online applications : text retrieval, extraction and categorization , 2002 .

[29]  Douglas E. Appelt,et al.  Introduction to Information Extraction Technology , 1999, IJCAI 1999.

[30]  Monica Palmirani,et al.  An XML Editor for Legal Information Management , 2003, EGOV.

[31]  Timothy Arnold-Moore Automatically processing amendments to legislation , 1995, ICAIL '95.

[32]  Monica Palmirani,et al.  Time Model for Managing the Dynamic of Normative System , 2006, EGOV.

[33]  Monica Palmirani,et al.  Processing normative references on the basis of natural language questions , 2004, Proceedings. 15th International Workshop on Database and Expert Systems Applications, 2004..

[34]  Montemagni,et al.  Automatic extraction of semantics in law documents , 2007 .

[35]  Timothy Arnold-Moore Automatic generation of amendment legislation , 1997, ICAIL '97.

[36]  Adam Wyner,et al.  Towards Annotating and Extracting Textual Legal Case Elements , 2010 .