Topic-OPA: A Topic Ontology for Modeling Topics of Old Press Articles

This article introduces Topic-OPA, a general topic ontology for modeling topics of old press articles. In Topic-OPA, topics are represented as nodes in a structure with two different schemes: hierarchical and non-hierarchical. The hierarchical scheme is expressed by taxonomic (is-a) edges among the topics. The non-hierarchical scheme is represented by cross-references that relate different topics. The hierarchy of topics is extracted from the open knowledge graph Wikidata using SPARQL queries. Furthermore, a curation process is applied to refine and enrich the results. Topic-OPA is designed to be small enough for maintainability and curation and is aimed to cover the most relevant topics of old press articles domain. An experiment use-case is presented to demonstrate the utility of Topic-OPA for topic labeling of old press articles.

[1]  Mehdi Allahyari,et al.  Automatic Topic Labeling Using Ontology-Based Topic Models , 2015, 2015 IEEE 14th International Conference on Machine Learning and Applications (ICMLA).

[2]  Subramaniyaswamy Vairavasundaram Automatic Topic Ontology Construction Using Semantic Relations from WordNet and Wikipedia , 2013, Int. J. Intell. Inf. Technol..

[3]  Euripides G. M. Petrakis,et al.  X-Similarity: Computing Semantic Similarity between Concepts from Different Ontologies , 2006, J. Digit. Inf. Manag..

[4]  Jakob Voß Classification of Knowledge Organization Systems with Wikidata , 2016, NKOS@TPDL.

[5]  Yin Chen,et al.  COBrA and COBrA-CT: Ontology Engineering Tools , 2008, Anatomy Ontologies for Bioinformatics.

[6]  Dekang Lin,et al.  An Information-Theoretic Definition of Similarity , 1998, ICML.

[7]  Timothy W. Finin,et al.  Yahoo! as an ontology: using Yahoo! categories to describe documents , 1999, CIKM '99.

[8]  Thabet Slimani,et al.  Description and Evaluation of Semantic Similarity Measures Approaches , 2013, ArXiv.

[9]  Junzhong Gu,et al.  A New Model of Information Content for Semantic Similarity in WordNet , 2008, 2008 Second International Conference on Future Generation Communication and Networking Symposia.

[10]  Markus Krötzsch,et al.  Practical Linked Data Access via SPARQL: The Case of Wikidata , 2018, LDOW@WWW.

[11]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[12]  Preeti Bhargava,et al.  Learning to Map Wikidata Entities To Predefined Topics , 2019, WWW.

[13]  Filippo Menczer,et al.  Using Topic Ontologies and Semantic Similarity Data to Evaluate Topical Search , 2010 .

[14]  Michael Günther,et al.  Introducing Wikidata to the Linked Data Web , 2014, SEMWEB.

[15]  Preeti Bhargava,et al.  Klout Topics for Modeling Interests and Expertise of Users Across Social Networks , 2017, ArXiv.

[16]  Graeme Hirst,et al.  Lexical chains as representations of context for the detection and correction of malapropisms , 1995 .

[17]  Roy Rada,et al.  Development and application of a metric on semantic nets , 1989, IEEE Trans. Syst. Man Cybern..

[18]  Dunja Mladenic,et al.  Semi-automatic Construction of Topic Ontologies , 2005, EWMF/KDO.

[19]  Carlos Rojas,et al.  Querying Wikidata: Comparing SPARQL, Relational and Graph Databases , 2016, SEMWEB.

[20]  Philip Resnik,et al.  Semantic Similarity in a Taxonomy: An Information-Based Measure and its Application to Problems of Ambiguity in Natural Language , 1999, J. Artif. Intell. Res..

[21]  Fausto Giunchiglia,et al.  Lightweight Ontologies , 2009, Encyclopedia of Database Systems.

[22]  Riichiro Mizoguchi Tutorial on Ontological Engineering: Part 1: Introduction to Ontological Engineering. , 2003 .

[23]  R GruberThomas Toward principles for the design of ontologies used for knowledge sharing , 1995 .

[24]  Timothy W. Finin,et al.  Ontology-Grounded Topic Modeling for Climate Science Research , 2018, SW4SG@ISWC.

[25]  Max J. Egenhofer,et al.  Determining Semantic Similarity among Entity Classes from Different Ontologies , 2003, IEEE Trans. Knowl. Data Eng..

[26]  Thomas Hofmann,et al.  Probabilistic Latent Semantic Indexing , 1999, SIGIR Forum.

[27]  Asunción Gómez-Pérez,et al.  Methodologies, tools and languages for building ontologies: Where is their meeting point? , 2003, Data Knowl. Eng..

[28]  David Sánchez,et al.  Ontology-based semantic similarity: A new feature-based approach , 2012, Expert Syst. Appl..

[29]  Rui Jiang,et al.  From Ontology to Semantic Similarity: Calculation of Ontology-Based Semantic Similarity , 2013, TheScientificWorldJournal.

[30]  Markus Krötzsch,et al.  Getting the Most Out of Wikidata: Semantic Technology Usage in Wikipedia's Knowledge Graph , 2018, SEMWEB.