Word Sense Disambiguation of semantic document

A Max-Probability Density based Clustering (MPDC) algorithm is proposed in this paper to resolve the problem of Word Sense Disambiguation in semantic document. MPDC take the context information of a keyword based on WordNet into account and select the max probability sense by measuring the density of the concept. We also do experiment on semantic documents retrieving from Swoogle and Watson, two famous semantic web searching engines. The result shows MPDC get a good efficiency.

[1]  Roy Rada,et al.  Development and application of a metric on semantic nets , 1989, IEEE Trans. Syst. Man Cybern..

[2]  Alberto J. Cañas,et al.  Using WordNet for Word Sense Disambiguation to Support Concept Map Construction , 2003, SPIRE.

[3]  James Z. Wang,et al.  Concept Forest: A New Ontology-assisted Text Document Similarity Measurement Method , 2007 .

[4]  Philip Resnik,et al.  Semantic Similarity in a Taxonomy: An Information-Based Measure and its Application to Problems of Ambiguity in Natural Language , 1999, J. Artif. Intell. Res..

[5]  David M. W. Powers,et al.  Measuring Semantic Similarity in the Taxonomy of WordNet , 2005, ACSC.

[6]  Huajun Chen,et al.  The Semantic Web , 2011, Lecture Notes in Computer Science.

[7]  Ying Liu,et al.  Using WordNet to Disambiguate Word Senses for Text Classification , 2007, International Conference on Computational Science.

[8]  Dekang Lin,et al.  An Information-Theoretic Definition of Similarity , 1998, ICML.

[9]  Chang Choi,et al.  Word Sense Disambiguation Based on Relation Structure , 2008, 2008 International Conference on Advanced Language Processing and Web Information Technology.

[10]  James Z. Wang,et al.  Concept Forest: A New Ontology-assisted Text Document Similarity Measurement Method , 2007, IEEE/WIC/ACM International Conference on Web Intelligence (WI'07).

[11]  Kasmiran Jumari,et al.  2009 International Conference on Future Computer and Communication , 2009 .

[12]  Hae-Chang Rim,et al.  Unsupervised word sense disambiguation using WordNet relatives , 2004, Comput. Speech Lang..

[13]  Andrés Montoyo,et al.  Word sense disambiguation with specification marks in unrestricted texts , 2000, Proceedings 11th International Workshop on Database and Expert Systems Applications.

[14]  Maurizio Vincini,et al.  TUCUXI: The InTelligent Hunter Agent for Concept Understanding and LeXical ChaIning , 2004, IEEE/WIC/ACM International Conference on Web Intelligence (WI'04).