A Bootstrapping Approach for Semi-Automated Legal Knowledge Extraction and Enrichment

In this paper, we propose a bootstrapping approach for semiautomated legal knowledge extraction. The approach is characterized by the use of a reference legal ontology that is progressively enriched with relevant concepts and related terms extracted from a corpus of legal documents (i.e., Court Decision documents). Supervised, multi-label classification techniques and black-box model explanation techniques are the core components of the bootstrapping approach i) to associate CD documents with appropriate concepts in the ontology and ii) to choose the terms that are decisive for determining the association between a document and a certain ontology concept, respectively. The goal of the proposed approach is to reduce the manual involvement of legal experts as much as possible and to improve the accuracy of document classification, by progressively enriching the term sets associated with ontology concepts. Preliminary experimental results are finally provided to show the contribution of the proposed approach on a corpus of real Court Decision documents.

[1]  Franco Turini,et al.  A Survey of Methods for Explaining Black Box Models , 2018, ACM Comput. Surv..

[2]  Daniela Tiscornia,et al.  The Lois Project: Lexical Ontologies for Legal Information Sharing , 2007 .

[3]  Rupali Sunil Wagh,et al.  Knowledge Discovery from Legal Documents Dataset using Text Mining Techniques , 2013 .

[4]  Silvana Castano,et al.  Crime Knowledge Extraction: an Ontology-driven Approach for Detecting Abstract Terms in Case Law Decisions , 2019, ICAIL.

[5]  Kevin D. Ashley,et al.  Extracting Case Law Sentences for Argumentation about the Meaning of Statutory Terms , 2016, ArgMining@ACL.

[6]  Pompeu Casanovas,et al.  Law and the Semantic Web: Legal Ontologies, Methodologies, Legal Information Retrieval, and Applications , 2005, Law and the Semantic Web.

[7]  Adeline Nazarenko,et al.  Legal NLP Introduction , 2017, TAL.

[8]  Chen Wang,et al.  Introducing LUIMA: an experiment in legal conceptual retrieval of vaccine injury decisions using a UIMA type system and tools , 2015, ICAIL.

[9]  Pompeu Casanovas,et al.  Approaches to Legal Ontologies: Theories, Domains, Methodologies , 2013 .

[10]  Antoine Isaac,et al.  SKOS Simple Knowledge Organization System Primer , 2009 .

[11]  Kevin D. Ashley Book review: Artificial Intelligence and Legal Analytics: New Tools for Law Practice in the Digital Age , 2015 .

[12]  Kevin D. Ashley,et al.  Mining Information from Statutory Texts in Multi-Jurisdictional Settings , 2014, JURIX.

[13]  Quoc V. Le,et al.  Distributed Representations of Sentences and Documents , 2014, ICML.

[14]  Monica Palmirani Legislative Change Management with Akoma-Ntoso , 2011 .

[15]  Kevin D. Ashley,et al.  Facilitating case comparison using value judgments and intermediate legal concepts , 2011, ICAIL.

[16]  Carlos Guestrin,et al.  "Why Should I Trust You?": Explaining the Predictions of Any Classifier , 2016, ArXiv.