Scale-Free Distribution in Chinese Semantic Field Network: A Main Cause of Using the Shortest Path Length for Representing Semantic Distance Between Terms

In large-scale network, using the weighted shortest path length to evaluate the relatedness between two terms is factually infeasible because of the actual time and space consumption, despite the fact that the related classic algorithm runs at complexity of \( o\left( {n^{3} } \right) \). However, in many natural language processing tasks, what we need to do is merely obtain the terms which are most related to a given term rather than obtain relatedness between every pair of terms, which makes it possible for using shortest path length between two terms within a large-scale complex network to evaluate semantic relatedness between two terms. Furthermore, one of the semantic field network’s important properties—scale-free distribution of node degree makes it much more feasible to use the shortest path length to evaluate semantic distance between two terms.

[1]  Ted Pedersen,et al.  Using WordNet-based Context Vectors to Estimate the Semantic Relatedness of Concepts , 2006 .

[2]  Pedro M. Domingos,et al.  Learning to map between ontologies on the semantic web , 2002, WWW '02.

[3]  Qun Liu,et al.  Semantic computation in a Chinese Question-Answering system , 2002, Journal of Computer Science and Technology.

[4]  Graeme Hirst,et al.  Distributional Measures as Proxies for Semantic Relatedness , 2012, ArXiv.

[5]  Philip Resnik,et al.  Semantic Similarity in a Taxonomy: An Information-Based Measure and its Application to Problems of Ambiguity in Natural Language , 1999, J. Artif. Intell. Res..

[6]  Ferdinand de Saussure Course in General Linguistics , 1916 .

[7]  Eneko Agirre,et al.  A Proposal for Word Sense Disambiguation using Conceptual Distance , 1995, ArXiv.

[8]  Ted Pedersen,et al.  Extended Gloss Overlaps as a Measure of Semantic Relatedness , 2003, IJCAI.

[9]  Robert L. Mercer,et al.  Word-Sense Disambiguation Using Statistical Methods , 1991, ACL.

[10]  Ido Dagan,et al.  Similarity-Based Models of Word Cooccurrence Probabilities , 1998, Machine Learning.

[11]  SeungJin Lim,et al.  A Graph Modeling of Semantic Similarity between Words , 2007 .

[12]  David W. Conrath,et al.  Semantic Similarity Based on Corpus Statistics and Lexical Taxonomy , 1997, ROCLING/IJCLCLP.

[13]  Thad Hughes,et al.  Lexical Semantic Relatedness with Random Graph Walks , 2007, EMNLP.

[14]  David J. Weir,et al.  Co-occurrence Retrieval: A Flexible Framework for Lexical Distributional Similarity , 2005, CL.