Word Sense Disambiguation Method Based on Improved Mutual Information with Wikipedia Extend

In view of word sense disambiguation shortcomings of the previous methods, they generally do not consider on word distance for computing semantic correlation of the influence of context, as well as the context is limited for ambiguous word sense disambiguation, and the use of part ambiguous context words make word senses more ambiguous. Therefore, this paper proposes the use of dependency parse tree and rules for feature word selection, then the ambiguous word is mapped to the Wikipedia pages to expand the feature words of the ambiguous word. Feature words expansion of word sense and feature words will eliminate the limitation of context words, by calculating improved mutual information between feature words of ambiguous context words senses and feature word of ambiguous word senses, then finally obtain the most suitable items of ambiguous word with context words in this sentence. Experimental results show that the proposed method compares with the previous method improves the accuracy of Chinese word sense disambiguation of 11.4%, with good scalability and practicality.