Semantic similarity measure for graph-based sentences

Graphical text representation method attempts to capture the syntactical structure and semantics of documents. As such, they are the preferred text representation approach for a wide range of problems namely in natural language processing, information retrieval and text mining. In a number of these applications, it is necessary to measure the similarity between knowledge represented in the graphs. In this paper, we present semantic similarity measure to compare graph based representation of sentences. The proposed method incorporates computational linguistic method to obtain syntactical information prior to representation with graph. Word synonyms are embedded in the graph representation to support semantic matching. In this paper, we present our idea and initial results on the feasibility of the proposed similarity measurement method.

[1]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[2]  Xiao-Ying Liu,et al.  Measuring semantic similarity within sentences , 2008, 2008 International Conference on Machine Learning and Cybernetics.

[3]  Daniel Dominic Sleator,et al.  Parsing English with a Link Grammar , 1995, IWPT.

[4]  Jun Wang,et al.  Measuring sentence similarity from different aspects , 2009, 2009 International Conference on Machine Learning and Cybernetics.

[5]  Carlo Strapparava,et al.  Corpus-based and Knowledge-based Measures of Text Semantic Similarity , 2006, AAAI.

[6]  William W. Cohen,et al.  Learning Graph Walk Based Similarity Measures for Parsed Text , 2008, EMNLP.

[7]  Lu Zhang,et al.  Graph-Based Text Similarity Measurement by Exploiting Wikipedia as Background Knowledge , 2011 .

[8]  Gerhard Weikum,et al.  Combining linguistic and statistical analysis to extract relations from web documents , 2006, KDD '06.

[9]  Anna-Lan Huang,et al.  Similarity Measures for Text Document Clustering , 2008 .

[10]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[11]  Xiao-Ying Liu,et al.  Sentence Similarity based on Dynamic Time Warping , 2007 .

[12]  Xiaohua Hu,et al.  The Evaluation of Sentence Similarity Measures , 2008, DaWaK.

[13]  Changhui Yan,et al.  A Graph-Based Semantic Similarity Measure for the gene Ontology , 2011, J. Bioinform. Comput. Biol..

[14]  William W. Cohen,et al.  Graph Based Similarity Measures for Synonym Extraction from Parsed Text , 2012, TextGraphs@ACL.