An Architecture of Active Learning SVMs with Relevance Feedback for Classifying E-mail

In this paper, we have proposed an architecture of active learning SVMs with relevance feedback (RF)for classifying e-mail. This architecture combines both active learning strategies where instead of using a randomly selected training set, the learner has access to a pool of unlabeled instances and can request the labels of some number of them and relevance feedback where if any mail misclassified then the next set of support vectors will be different from the present set otherwise the next set will not change. Our proposed architecture will ensure that a legitimate e-mail will not be dropped in the event of overflowing mailbox. The proposed architecture also exhibits dynamic updating characteristics making life as difficult for the spammer as possible.

[1]  Yixin Zhong,et al.  Statistical learning theory and state of the art in SVM , 2003, The Second IEEE International Conference on Cognitive Informatics, 2003. Proceedings..

[2]  Shu-Xia Lu,et al.  A comparison among four SVM classification methods: LSVM, NLSVM, SSVM and NSVM , 2004, Proceedings of 2004 International Conference on Machine Learning and Cybernetics (IEEE Cat. No.04EX826).

[3]  Tom Downs,et al.  Exact Simplification of Support Vector Solutions , 2002, J. Mach. Learn. Res..

[4]  D Haussler,et al.  Knowledge-based analysis of microarray gene expression data by using support vector machines. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[5]  William W. Cohen Learning Rules that Classify E-Mail , 1996 .

[6]  Harris Drucker,et al.  Support vector machines for spam categorization , 1999, IEEE Trans. Neural Networks.

[7]  David C. Gibbon,et al.  Relevance Feedback using Support Vector Machines , 2001, ICML.

[8]  Huang Houkuan,et al.  An architecture of active learning SVMs for spam , 2002, 6th International Conference on Signal Processing, 2002..