Model of text representation based on concept
暂无分享,去创建一个
The information processing of text is advancing towards semantic direction,but nowadays the dominating model of text representation,which is called the Vector Space Model uses a single word to be the characteristic item.It neglects the lexical relation between words,thereby leading to a low precision of text information processing due to the fact that synonymy and polysemy exist in large numbers in natural languages.This paper uses the techniques and results of natural language processing,and introduces concept and distance of concept into the Vector Space Model.An improved model of text representation is then built based on concept as a characteristic item of the text from the perspective of semantics and concept.Proved by experiments,this method can resolve the synonymous and polysemantic problems commendably,improve the precision and recall to a great extent.