Ten Ways of Leveraging Ontologies for Rapid Natural Language Processing Customization for Multiple Use Cases in Disjoint Domains

With the ever-growing adoption of AI technologies by large enterprises, purely data-driven approaches have dominated the field in the recent years. For a single use case, a development process looks simple: agreeing on an annotation schema, labeling the data, and training the models. As the number of use cases and their complexity increases, the development teams face issues with collective governance of the models, scalability and reusablity of data and models. These issues are widely addressed on the engineering side, but not so much on the knowledge side. Ontologies have been a well-researched approach for capturing knowledge and can be used to augment a data-driven methodology. In this paper, we discuss 10 ways of leveraging ontologies for Natural Language Processing (NLP) and its applications. We use ontologies for rapid customization of a NLP pipeline, ontologyrelated standards to power a rule engine and provide standard output format. We also discuss various use cases for medical, enterprise, financial, legal, and security domains, centered around three NLP-based applications: semantic search, question answering and natural language querying.

[1]  Bahar Sateli,et al.  The LODeXporter: Flexible Generation of Linked Open Data Triples from NLP Frameworks for Automatic Knowledge Base Construction , 2018, LREC.

[2]  Robert Arp,et al.  Building Ontologies with Basic Formal Ontology , 2015 .

[3]  Paul Buitelaar,et al.  Teanga: A Linked Data based platform for Natural Language Processing , 2018, LREC.

[4]  Laurian M. Chirica,et al.  The entity-relationship model: toward a unified view of data , 1975, SIGF.

[5]  Subhash Bhalla,et al.  Entity Attribute Value Style Modeling Approach for Archetype Based Data , 2017, Inf..

[6]  Joaquin Vanschoren,et al.  ML-Schema: Exposing the Semantics of Machine Learning with Schemas and Ontologies , 2018, ICML 2018.

[7]  Thomas Lukasiewicz,et al.  Ontology Reasoning with Deep Neural Networks , 2018, J. Artif. Intell. Res..

[8]  Dan I. Moldovan,et al.  Composition of semantic relations: Theoretical framework and case study , 2014, TSLP.

[9]  Sophia Ananiadou,et al.  Making UIMA Truly Interoperable with SPARQL , 2013, LAW@ACL.

[10]  Sampo Pyysalo,et al.  brat: a Web-based Tool for NLP-Assisted Text Annotation , 2012, EACL.

[11]  Wernher Behrendt The Interactive Knowledge Stack (IKS): A Vision for the Future of CMS , 2012, Semantic Technologies in Content Management Systems.

[12]  Ernest Mwebaze,et al.  Ontology boosted deep learning for disease name extraction from Twitter messages , 2018, Journal of Big Data.

[13]  Olivier Bodenreider,et al.  The Unified Medical Language System (UMLS): integrating biomedical terminology , 2004, Nucleic Acids Res..

[14]  Kurt Sandkuhl,et al.  Ontology Development Strategies in Industrial Contexts , 2018, BIS.

[15]  Raphaël Troncy,et al.  NERD meets NIF: Lifting NLP Extraction Results to the Linked Data Cloud , 2012, LDOW.

[16]  E. Chang,et al.  A Software Engineering Ontology as Software Engineering Knowledge Representation , 2008, 2008 Third International Conference on Convergence and Hybrid Information Technology.

[17]  Jens Lehmann,et al.  Integrating NLP Using Linked Data , 2013, SEMWEB.

[18]  Goran Nenadic,et al.  Deep learning meets ontologies: experiments to anchor the cardiovascular disease ontology in the biomedical literature , 2018, Journal of Biomedical Semantics.

[19]  Mike Bennett,et al.  The financial industry business ontology: Best practice for big data , 2013, Journal of Banking Regulation.

[20]  Iryna Gurevych,et al.  The INCEpTION Platform: Machine-Assisted and Knowledge-Oriented Interactive Annotation , 2018, COLING.

[21]  Marta Tatu,et al.  Ten ways of leveraging ontologies for natural language processing and its enterprise applications , 2020, SBD@SIGMOD.

[22]  Tru H. Cao,et al.  Ontology-Based Query Expansion with Latently Related Named Entities for Semantic Text Search , 2018, Advances in Intelligent Information and Database Systems.

[23]  Ivan Lopez-Arevalo,et al.  Information extraction meets the Semantic Web: A survey , 2020, Semantic Web.

[24]  Adam Pease,et al.  Towards a standard upper ontology , 2001, FOIS.

[25]  N. F. Noy,et al.  Ontology Development 101: A Guide to Creating Your First Ontology , 2001 .

[26]  Dan I. Moldovan,et al.  A Semantic Question Answering Framework for Large Data Sets , 2016, Open J. Semantic Web.

[27]  Christian Chiarcos,et al.  CoNLL-RDF: Linked Corpora Done in an NLP-Friendly Way , 2017, LDK.

[28]  Yan Tang Demey Adapting the Fact-Based Modeling Approach in Requirement Engineering , 2014, OTM Workshops.

[29]  Dan I. Moldovan,et al.  Polaris: Lymba's Semantic Parser , 2012, LREC.

[30]  Michael Uschold,et al.  Ontologies: principles, methods and applications , 1996, The Knowledge Engineering Review.

[31]  John B. Lowe,et al.  The Berkeley FrameNet Project , 1998, ACL.

[32]  Dan I. Moldovan,et al.  Semi-Automatic Domain Ontology Creation from Text Resources , 2010, LREC.

[33]  Cecilia Zanni-Merk,et al.  Ontology population with deep learning-based NLP: a case study on the Biomolecular Network Ontology , 2019, KES.

[34]  Chiara Ghidini,et al.  Ontology Learning in the Deep , 2016, EKAW.

[35]  Dan Brickley,et al.  Rdf vocabulary description language 1.0 : Rdf schema , 2004 .