The Improved Ontology kNN Algorithm and its Application

With the advances of the Web, more and more people, especially business people, use emails to communicate with each other. Hence, how to deal with business emails is becoming more and more important for decision makers, because among these emails, there hides valuable information such as the customer's complaints about a product or the interests of customers to a product. These are important information for a manager to propose marketing policies. In this paper, we develop an improved kNN algorithm - fkNN (fuzzy kNN) algorithm based on ontology ideology to classify the emails. After classifying the emails into different classes, we can mine knowledge more easily based on the classified emails. Therefore, the classification effect is very important for mining knowledge further. Fortunately, our improved algorithm behaves much better than other algorithms in classification performance for our email datasets and other datasets

[1]  Haibin Zhu,et al.  WebCom Miner - a system of trends analysis for company products , 2004, 2004 IEEE International Conference on Systems, Man and Cybernetics (IEEE Cat. No.04CH37583).

[2]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[3]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.

[4]  Harry Chen,et al.  ITTALKS: A Case Study in the Semantic Web and DAML , 2001, SWWS.

[5]  David D. Lewis,et al.  Naive (Bayes) at Forty: The Independence Assumption in Information Retrieval , 1998, ECML.

[6]  Hussein A. Abbass,et al.  Learning text classifier using the domain concept hierarchy , 2002, IEEE 2002 International Conference on Communications, Circuits and Systems and West Sino Expositions.

[7]  Yiming Yang,et al.  A re-examination of text categorization methods , 1999, SIGIR '99.

[8]  Thorsten Joachims,et al.  Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.

[9]  Yiming Yang,et al.  An Evaluation of Statistical Approaches to Text Categorization , 1999, Information Retrieval.

[10]  Vijayan Sugumaran,et al.  Ontologies for conceptual modeling: their creation, use, and management , 2002, Data Knowl. Eng..

[11]  Songbo Tan,et al.  Neighbor-weighted K-nearest neighbor for unbalanced text corpus , 2005, Expert Syst. Appl..