Knowledge Graphs and Semantic Web

Several annotation models have been proposed to enable a multilingual Semantic Web. Such models hone in on the word and its morphology and assume the language tag and URI comes from external resources. These resources, such as ISO 639 and Glottolog, have limited coverage of the world’s languages and have a very limited thesaurus-like structure at best, which hampers language annotation, hence constraining research in Digital Humanities and other fields. To resolve this ‘outsourced’ task of the current models, we developed a model for representing information about languages, the Model for Language Annotation (MoLA), such that basic language information can be recorded consistently and therewith queried and analyzed as well. This includes the various types of languages, families, and the relations among them. MoLA is formalized in OWL so that it can integrate with Linguistic Linked Data resources. Sufficient coverage of MoLA is demonstrated with the use case of French.

[1]  Markus Krötzsch,et al.  Reifying RDF: What Works Well With Wikidata? , 2015, SSWS@ISWC.

[2]  Pierre Nugues,et al.  Entity Extraction: From Unstructured Text to DBpedia RDF triples , 2012, WoLE@ISWC.

[3]  Ivan Lopez-Arevalo,et al.  Information extraction meets the Semantic Web: A survey , 2020, Semantic Web.

[4]  J. R. Landis,et al.  The measurement of observer agreement for categorical data. , 1977, Biometrics.

[5]  Sören Auer,et al.  Linked Open Data -- Creating Knowledge Out of Interlinked Data: Results of the LOD2 Project , 2014 .

[6]  Jens Lehmann,et al.  Introduction to Linked Data and Its Lifecycle on the Web , 2013, Reasoning Web.

[7]  Isabelle Augenstein,et al.  LODifier: Generating Linked Data from Unstructured Text , 2012, ESWC.

[8]  Nicholas Ross Milton,et al.  Knowledge Acquisition in Practice: A Step-by-step Guide , 2007 .

[9]  Isabelle Augenstein,et al.  Distantly supervised Web relation extraction for knowledge base population , 2016, Semantic Web.

[10]  Dunja Mladenic,et al.  Automatically Annotating Text with Linked Open Data , 2011, LDOW.

[11]  Douglas C. Schmidt,et al.  Model-Driven Engineering , 2006 .

[12]  Harald Sack,et al.  Enabled Generalized Vector Space Model to Improve Document Retrieval , 2015, NLP-DBPEDIA@ISWC.

[13]  Ryutaro Ichise,et al.  An Automatic Knowledge Graph Creation Framework from Natural Language Text , 2018, IEICE Trans. Inf. Syst..

[14]  Philipp Cimiano,et al.  Ontology learning and population from text - algorithms, evaluation and applications , 2006 .

[15]  Paula Severi,et al.  A Description Logic for Unifying Different Points of View , 2019, KGSWC.

[16]  Daniel Jurafsky,et al.  Automatic Labeling of Semantic Roles , 2002, CL.

[17]  Luciano Del Corro,et al.  ClausIE: clause-based open information extraction , 2013, WWW.

[18]  Justus J. Randolph Free-Marginal Multirater Kappa (multirater K[free]): An Alternative to Fleiss' Fixed-Marginal Multirater Kappa. , 2005 .

[19]  Heiner Stuckenschmidt,et al.  Enriching Structured Knowledge with Open Information , 2015, WWW.

[20]  Tom Heath,et al.  Linked Data: Evolving the Web into a Global Data Space , 2011, Linked Data.

[21]  Amal Zouaq,et al.  Collective Disambiguation and Semantic Annotation for Entity Linking and Typing , 2016, SemWebEval@ESWC.

[22]  Hugo Gonçalo Oliveira,et al.  Comparing the Performance of Different NLP Toolkits in Formal and Social Media Text , 2016, SLATE.

[23]  Paul T. Groth,et al.  A Semantic Web Primer. - 3rd ed. , 2012, CoopIS 2012.

[24]  Ana B. Rios-Alvarado,et al.  A Strategy for the Integration of Named Entity Extraction and Linking Results , 2018, IWSW.

[25]  Jos de Bruijn,et al.  Enabling Semantic Web Services , 2007 .

[26]  Mihai Surdeanu,et al.  The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.

[27]  Diego Reforgiato Recupero,et al.  Semantic Web Machine Reading with FRED , 2017, Semantic Web.