Exploiting Non-Taxonomic Relations for Measuring Semantic Similarity and Relatedness in WordNet

Various applications in the areas of computational linguistics and artificial intelligence employ semantic similarity to solve challenging tasks, such as word sense disambiguation, text classification, information retrieval, machine translation, and document clustering. Previous work on semantic similarity followed a mono-relational approach using mostly the taxonomic relation "ISA". This paper explores the benefits of using all types of non-taxonomic relations in large linked data, such as WordNet knowledge graph, to enhance existing semantic similarity and relatedness measures. We propose a holistic poly-relational approach based on a new relation-based information content and non-taxonomic-based weighted paths to devise a comprehensive semantic similarity and relatedness measure. To demonstrate the benefits of exploiting non-taxonomic relations in a knowledge graph, we used three strategies to deploy non-taxonomic relations at different granularity levels. We conducted experiments on four well-known gold standard datasets, and the results demonstrated the robustness and scalability of the proposed semantic similarity and relatedness measure, which significantly improves existing similarity measures.

[1]  Tony Veale,et al.  An Intrinsic Information Content Metric for Semantic Similarity in WordNet , 2004, ECAI.

[2]  Junzhong Gu,et al.  A New Model of Information Content for Semantic Similarity in WordNet , 2008, 2008 Second International Conference on Future Generation Communication and Networking Symposia.

[3]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[4]  Muhammad Jawad Hussain,et al.  An approach for measuring semantic similarity between Wikipedia concepts using multiple inheritances , 2020, Inf. Process. Manag..

[5]  Rongrong Li,et al.  Semantic Similarity Computation in Knowledge Graphs: Comparisons and Improvements , 2019, 2019 IEEE 35th International Conference on Data Engineering Workshops (ICDEW).

[6]  David Sánchez,et al.  Ontology-based semantic similarity: A new feature-based approach , 2012, Expert Syst. Appl..

[7]  Laith Mohammad Abualigah,et al.  Hybrid clustering analysis using improved krill herd algorithm , 2018, Applied Intelligence.

[8]  Tina Eliassi-Rad,et al.  Using Ontological Information to Accelerate Path-Finding in Large Semantic Graphs: A Probabilistic Approach , 2005 .

[9]  Laith Mohammad Abualigah,et al.  Unsupervised text feature selection technique based on hybrid particle swarm optimization algorithm with genetic operators for the text clustering , 2017, The Journal of Supercomputing.

[10]  Matthew Fisher,et al.  Semantic Web Programming , 2009 .

[11]  Edmond Chow,et al.  Knowledge Representation Issues in Semantic Graphs for Relationship Detection , 2005, AAAI Spring Symposium: AI Technologies for Homeland Security.

[12]  Palash Goyal,et al.  Pykg2vec: A Python Library for Knowledge Graph Embedding , 2019, J. Mach. Learn. Res..

[13]  Saravanan Muthaiyah,et al.  Improving Gloss Vector Semantic Relatedness Measure by Integrating Pointwise Mutual Information: Optimizing Second-Order Co-occurrence Vectors Computed from Biomedical Corpus and UMLS , 2013, 2013 International Conference on Informatics and Creative Multimedia.

[14]  Praveen Paritosh,et al.  Freebase: a collaboratively created graph database for structuring human knowledge , 2008, SIGMOD Conference.

[15]  Felix Hill,et al.  SimLex-999: Evaluating Semantic Models With (Genuine) Similarity Estimation , 2014, CL.

[16]  David Sánchez,et al.  Ontology-based information content computation , 2011, Knowl. Based Syst..

[17]  Fabian M. Suchanek,et al.  YAGO3: A Knowledge Base from Multilingual Wikipedias , 2015, CIDR.

[18]  Ted Pedersen,et al.  WordNet::Similarity - Measuring the Relatedness of Concepts , 2004, NAACL.

[19]  Jens Lehmann,et al.  DBpedia - A crystallization point for the Web of Data , 2009, J. Web Semant..

[20]  Junzhong Gu,et al.  New model of semantic similarity measuring in wordnet , 2008, 2008 3rd International Conference on Intelligent System and Knowledge Engineering.

[21]  Abdelmajid Ben Hamadou,et al.  Ontology-based approach for measuring semantic similarity , 2014, Eng. Appl. Artif. Intell..

[22]  Giuseppe Pirrò,et al.  A semantic similarity metric combining features and intrinsic information content , 2009, Data Knowl. Eng..

[23]  Eneko Agirre,et al.  A reproducible survey on word embeddings and ontology-based methods for word similarity: Linear combinations outperform the state of the art , 2019, Eng. Appl. Artif. Intell..

[24]  Ganggao Zhu,et al.  Computing Semantic Similarity of Concepts in Knowledge Graphs , 2017, IEEE Transactions on Knowledge and Data Engineering.

[25]  Wolfram Wöß,et al.  Towards a Definition of Knowledge Graphs , 2016, SEMANTiCS.

[26]  John B. Goodenough,et al.  Contextual correlates of synonymy , 1965, CACM.

[27]  De Xu,et al.  Concept vector for semantic similarity and relatedness based on WordNet structure , 2012, J. Syst. Softw..

[28]  Vijay Mago,et al.  Evolution of Semantic Similarity—A Survey , 2020, ACM Comput. Surv..

[29]  Yong Tang,et al.  Feature-based approaches to semantic similarity assessment of concepts using Wikipedia , 2015, Inf. Process. Manag..

[30]  Laith Mohammad Abualigah,et al.  Feature Selection and Enhanced Krill Herd Algorithm for Text Document Clustering , 2018, Studies in Computational Intelligence.

[31]  Ana M. García-Serrano,et al.  A new family of information content models with an experimental survey on WordNet , 2015, Knowl. Based Syst..

[32]  Wei Lu,et al.  A hybrid approach for measuring semantic similarity based on IC-weighted path distance in WordNet , 2017, Journal of Intelligent Information Systems.

[33]  Carlos Angel Iglesias,et al.  Sematch: Semantic similarity framework for Knowledge Graphs , 2017, Knowl. Based Syst..

[34]  Roberto Navigli,et al.  Nasari: Integrating explicit knowledge and corpus statistics for a multilingual representation of concepts and entities , 2016, Artif. Intell..

[35]  J. Akilandeswari,et al.  A Survey on Semantic Similarity Measure , 2014 .

[36]  William A. Perez,et al.  Semantic encoding and the stimulus prefix effect , 1981 .

[37]  G. Miller,et al.  Contextual correlates of semantic similarity , 1991 .

[38]  Evgeniy Gabrilovich,et al.  Large-scale learning of word relatedness with constraints , 2012, KDD.

[39]  Junzhong Gu,et al.  A New Model of Information Content Based on Concept ’ s Topology for Measuring Semantic Similarity in WordNet , 2012 .

[40]  Ehud Rivlin,et al.  Placing search in context: the concept revisited , 2002, TOIS.

[41]  Shouqian Sun,et al.  An information Content-Based Approach for Measuring Concept Semantic Similarity in WordNet , 2018, Wireless Personal Communications.

[42]  Yuanyuan Cai,et al.  Measuring distance-based semantic similarity using meronymy and hyponymy relations , 2018, Neural Computing and Applications.

[43]  Ahmad Abdollahzadeh Barforoush,et al.  A new word sense similarity measure in wordnet , 2008, 2008 International Multiconference on Computer Science and Information Technology.