Semantic Disambiguation in Automatic Semantic Annotation

In order to generate the metadata of semantic web, semantic information need be extracted from web documents. Facing the mass scale of web documents, Compared to artificial or semi-automatic semantic annotation, automatic semantic annotation is a feasible method. To recognize candidate named entities, the semantic dictionary is designed and semantic distance between entities is calculated by semantic relevance path. The most complex problem in semantic annotation is semantic disambiguation. A semantic disambiguation method based on the shortest path and n-gram is proposed. Experiments have been made on a news corpus. The result shows that the method is effective for the task of automatic semantic annotation.