Classification of Semantic Documents Based on WordNet

There are a lot benefits to enable intelligent agent understanding the information from semantic web. It enhances the efficiency of information usage and at the same time, suffices the need of users. Semantic documents contain adequate semantic information which helps understanding. However, discrepancy between ontology which is an interpreter of semantic document prevents the share of knowledge. In this paper, we proposed a uniform representation for the content, which include concepts and relations, of semantic documents based on WordNet. First, disambiguation is preceded within the key words in a document for the purpose of mapping them to concepts. Then we present the whole document in the form of concept graph that Levenshtein Distance could be applied for making a classification of documents. We have empirical result that this methodology makes a promising raise in accuracy.

[1]  Hong-Gee Kim,et al.  Exploiting noun phrases and semantic relationships for text document clustering , 2009, Inf. Sci..

[2]  Susan Gauch,et al.  Document similarity based on concept tree distance , 2008, Hypertext.

[3]  Li Zhanhuai,et al.  A Concept-Relation Vector Model Based Method for Web Document Retrieval , 2008, 2008 International Symposium on Knowledge Acquisition and Modeling.

[4]  Chao Zeng,et al.  A New Approach to Email Classification Using Concept Vector Space Model , 2008, 2008 Second International Conference on Future Generation Communication and Networking Symposia.

[5]  Amy J. C. Trappey,et al.  A Fuzzy Ontological Knowledge Document Clustering Methodology , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[6]  Maurizio Vincini,et al.  TUCUXI: The InTelligent Hunter Agent for Concept Understanding and LeXical ChaIning , 2004, IEEE/WIC/ACM International Conference on Web Intelligence (WI'04).

[7]  Steffen Staab,et al.  WordNet improves text document clustering , 2003, SIGIR 2003.

[8]  James Z. Wang,et al.  Concept Forest: A New Ontology-assisted Text Document Similarity Measurement Method , 2007 .

[9]  Steffen Staab,et al.  Towards the self-annotating web , 2004, WWW '04.

[10]  Dik Lun Lee,et al.  Document Ranking and the Vector-Space Model , 1997, IEEE Softw..