Information retrieval system based semantique and big data
暂无分享,去创建一个
Abstract In traditional word-based information retrieval systems, a document is considered a set of words representing graphs without semantics. In this paper, we focus on enriching the similarity measure by using synonymy and performance evaluation of semantic indexing approaches to a document corpus. We will also present comparisons showing that the use of synonymy with Leacock and Chodorow measures increases the semantic similarity that makes research more efficient.
[1] George A. Miller,et al. WordNet: A Lexical Database for English , 1995, HLT.
[2] Philip Resnik,et al. Semantic Similarity in a Taxonomy: An Information-Based Measure and its Application to Problems of Ambiguity in Natural Language , 1999, J. Artif. Intell. Res..
[3] Hamid Bagheri,et al. Big Data: Challenges, Opportunities and Cloud Based Solutions , 2015 .