Comparative Analysis of Bayes and Lazy Classification Algorithms

Data mining is the non-trivial extraction of implicit, earlier unknown and potentially useful information about data. There are several data mining techniques have been developed and used in data mining projects which includes classification, clustering, association rules, prediction, and sequential patterns. Data mining applications are used in various areas such as sales, marketing, banking, finance, health care, insurance and medicine. There are various research domains in data mining namely web mining, text mining, image mining, sequence mining, privacy preserving data mining, etc. Text mining is a technique which extracts information from both structured and unstructured data and also finding patterns which is novel and not known earlier. It is also known as knowledge discovery from text (KDT), deals with the machine supported analysis of text. Text mining is used in various areas such as information retrieval, document similarity, natural language processing and so on. Searching for similar documents is an important problem in text mining. The first and essential step of document similarity is to classify the documents based on their category. In this research work, we have analysed the performance of Bayesian and Lazy classifiers for classifying the files which are stored in the computer hard disk. There are two algorithms in Bayesian classifier namely BayesNet, and Naive Bayes. In lazy classifier has three algorithms namely IBL, IBK and Kstar. The performances of Bayesian and lazy classifiers are analysed by applying various performance factors. From the experimental results, it is observed that the lazy classifier is more efficient than Bayesian classifier.

[1]  Mahendra Tiwari,et al.  Performance analysis of Data Mining algorithms in Weka , 2012 .

[2]  Sunila Godara,et al.  PERFORMANCE ANALYSIS OF CLUSTERING ALGORITHMS FOR CHARACTER RECOGNITION USING WEKA TOOL , 2013 .

[3]  Anshul Goyal,et al.  Performance Comparison of Naïve Bayes and J 48 Classification Algorithms , 2012 .

[4]  D. S. Guru,et al.  Representation and Classification of Text Documents: A Brief Review , 2010 .

[5]  Ian H. Witten,et al.  Learning a concept-based document similarity measure , 2012, J. Assoc. Inf. Sci. Technol..

[6]  Trilok Chand Sharma,et al.  WEKA Approach for Comparative Study of Classification Algorithm , 2013 .

[7]  Mohammed N. Al-Kabi,et al.  Comparative Assessment of the Performance of Three WEKA Text Classifiers Applied to Arabic Text , 2012 .

[8]  Khairullah Khan,et al.  A Review of Machine Learning Algorithms for Text-Documents Classification , 2010 .

[9]  Kaushik H. Raviya,et al.  Performance Evaluation of Different Data Mining Classification Algorithm Using WEKA , 2012 .

[10]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[11]  Samir Kumar Bandyopadhyay,et al.  A tutorial review on Text Mining Algorithms , 2012 .

[12]  Mohd Fauzi Othman,et al.  Comparison of different classification techniques using WEKA for breast cancer , 2007 .

[13]  M. Phil,et al.  Comparative Analysis of Classification Function Techniques for Heart Disease Prediction , 2013 .

[14]  Dr.S.Selvarajan S.Deepajothi A Comparative Study of Classification Techniques On Adult Data Set , 2012 .

[15]  Markus Reischl,et al.  Data mining tools , 2011, WIREs Data Mining Knowl. Discov..

[16]  John Akhilomen Data Mining Application for Cyber Credit-Card Fraud Detection System , 2013, ICDM.