Concept Hierarchy Extraction from Legal Literature

Due to the ever-increasing amount of legal regulations, it became an interest of scholars to find ways of capturing domain-relevant knowledge and facilitate the navigation in legal text corpora. Furthermore, the contextual nature of legislation requires enhanced semantic capabilities to identify relevant regulations for specific user needs. This work aims for collecting concept hierarchies from German literature in the legal domain which are then integrated into a knowledge base with multiple clusters, allowing for different perspectives and efficient lookups. Having references to regulations in the leaves of the concept tree and higher levels with an increasingly abstract context, the resulting hierarchies provide the basis for creating legal domain knowledge in German law. Starting with rule-based annotation, we cluster extracted references, given their context features derived from tables of contents and reasons for citing from various textbook formats. We study the expressiveness of the obtained reference context features. Since different authors have their own notion of hierarchy given by the table of contents, we propose a heterogeneous lightweight ontology allowing for the coexistence of similar, yet diverse concept hierarchies to dynamically determine the best fit for a user in a semisupervised setting. This approach is novel, since state-of-the-art ontologies are conventionally modeled under full integration and in a top-down manner, often not accounting for perspectives in knowledge representation.

[1]  Oren Etzioni,et al.  Identifying Relations for Open Information Extraction , 2011, EMNLP.

[2]  Benjamin Piwowarski,et al.  Discovering a Term Taxonomy from Term Similarities Using Principal Component Analysis , 2005, EWMF/KDO.

[3]  M. Cugmas,et al.  On comparing partitions , 2015 .

[4]  Steffen Staab,et al.  Clustering Concept Hierarchies from Text , 2004, LREC.

[5]  Nuanwan Soonthornphisaj,et al.  ATOB algorithm: an automatic ontology construction for Thai legal sentences retrieval , 2012, J. Inf. Sci..

[6]  Marti A. Hearst Automatic Acquisition of Hyponyms from Large Text Corpora , 1992, COLING.

[7]  Livio Robaldo,et al.  The European Taxonomy Syllabus: A multi-lingual, multi-level ontology framework to untangle the web of European legal terminology , 2017, Appl. Ontology.

[8]  Angelo Di Iorio,et al.  Integration of legal datasets: from meta-model to implementation , 2013, IIWAS '13.

[9]  Bo Zhao,et al.  PTR: Phrase-Based Topical Ranking for Automatic Keyphrase Extraction in Scientific Publications , 2016, ICONIP.

[10]  Frank van Harmelen,et al.  C-OWL: Contextualizing Ontologies , 2003, SEMWEB.

[11]  Radboud Winkels,et al.  Towards a Legal Recommender System , 2014, JURIX.

[12]  Marcie Hamilton Limitations , 2020, Terrorist Minds.

[13]  J. Meigs,et al.  WHO Technical Report , 1954, The Yale Journal of Biology and Medicine.

[14]  Erich Schweighofer,et al.  Legal Query Expansion using Ontologies and Relevance Feedback , 2007, LOAIT.

[15]  Paul Zhang,et al.  Semantics-based legal citation network , 2007, ICAIL.

[16]  J. Program Chair-Tsujii Proceedings of the 16th conference on Computational linguistics - Volume 2 , 1996 .

[17]  Mark J. Embrechts,et al.  On the Use of the Adjusted Rand Index as a Metric for Evaluating Supervised Classification , 2009, ICANN.

[18]  Christian Floerkemeier,et al.  Recognizing Products: A Per-exemplar Multi-label Image Classification Approach , 2014, ECCV.

[19]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[20]  Florian Matthes,et al.  Rule-based Information Extraction: Advantages, Limitations, and Perspectives , 2018 .

[21]  Andreas Nürnberger,et al.  An Experimental Comparison of Similarity Adaptation Approaches , 2011, Adaptive Multimedia Retrieval.

[22]  Derek Greene,et al.  Stability of topic modeling via matrix factorization , 2017, Expert Syst. Appl..

[23]  Livio Robaldo,et al.  Linking legal open data: breaking the accessibility and language barrier in european legislation and case law , 2015, ICAIL.

[24]  Rinke Hoekstra,et al.  The LKIF Core Ontology of Basic Legal Concepts , 2007, LOAIT.

[25]  Korhan Günel,et al.  Extracting learning concepts from educational texts in intelligent tutoring systems automatically , 2010, Expert Syst. Appl..

[26]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[27]  Simonetta Montemagni,et al.  Integrating a Bottom-Up and Top-Down Methodology for Building Semantic Resources for the Multilingual Legal Domain , 2010, Semantic Processing of Legal Texts.

[28]  Pepijn R. S. Visser,et al.  Heterogeneous Ontology Structures for Distributed Architectures , 1998 .

[29]  Florian Matthes,et al.  Differentiation and Empirical Analysis of Reference Types in Legal Documents , 2016, JURIX.

[30]  Carlos Bobed,et al.  The AIS Project: Boosting Information Extraction from Legal Documents by using Ontologies , 2016, ICAART.

[31]  D. Steinley Journal of Classification , 2004, Vegetatio.

[32]  C. MacLellan,et al.  TRESTLE: A Model of Concept Formation in Structured Domains , 2016 .

[33]  Huang-Cheng Kuo,et al.  Building a Concept Hierarchy by Hierarchical Clustering with Join/Merge Decision , 2006, JCIS.

[34]  Fabio Vitali,et al.  Towards a country-independent data format: the Akoma Ntoso experience , 2007 .

[35]  Anette Hulth,et al.  Improved Automatic Keyword Extraction Given More Linguistic Knowledge , 2003, EMNLP.

[36]  S. Asharaf,et al.  Unsupervised Concept Hierarchy Learning: A Topic Modeling Guided Approach , 2016 .

[37]  Douglas H. Fisher,et al.  Knowledge Acquisition Via Incremental Conceptual Clustering , 1987, Machine Learning.

[38]  K. Karteeka Pavan,et al.  An Automatic Clustering Technique for Optimal Clusters , 2011, ArXiv.

[39]  Paul Buitelaar,et al.  Automatic Taxonomy Generation - A Use-Case in the Legal Domain , 2017, LCT.

[40]  Radboud Winkels,et al.  Proceedings of the 11th international conference on Artificial intelligence and law , 2007 .

[41]  Benjamin Bräutigam,et al.  Concept Hierarchy Extraction from Textbooks , 2015, DocEng.