Students Performance Prediction System Using Multi Agent Data Mining Technique

A high prediction accuracy of the students’ performance is more helpful to identify the low performance students at the beginning of the learning process. Data mining is used to attain this objective. Data mining techniques are used to discover models or patterns of data, and it is much helpful in the decision-making. Boosting technique is the most popular techniques for constructing ensembles of classifier to improve the classification accuracy. Adaptive Boosting (AdaBoost) is a generation of boosting algorithm. It is used for the binary classification and not applicable to multiclass classification directly. SAMME boosting technique extends AdaBoost to a multiclass classification without reduce it to a set of sub-binary classification. In this paper, students’ performance prediction system using Multi Agent Data Mining is proposed to predict the performance of the students based on their data with high prediction accuracy and provide help to the low students by optimization rules. The proposed system has been implemented and evaluated by investigate the prediction accuracy of Adaboost.M1 and LogitBoost ensemble classifiers methods and with C4.5 single classifier method. The results show that using SAMME Boosting technique improves the prediction accuracy and outperformed C4.5 single classifier and LogitBoost.

[1]  Pat Langley,et al.  Induction of One-Level Decision Trees , 1992, ML.

[2]  Daniel Neagu,et al.  Assessing information quality of e-learning systems: a web mining approach , 2011, Comput. Hum. Behav..

[3]  Ying Liu,et al.  Drug design by machine learning: ensemble learning for QSAR modeling , 2005, Fourth International Conference on Machine Learning and Applications (ICMLA'05).

[4]  Yoav Freund,et al.  A Short Introduction to Boosting , 1999 .

[5]  Azuraliza Abu Bakar,et al.  Agent based data classification approach for data mining , 2008, 2008 International Symposium on Information Technology.

[6]  Kamal Ali Albashiri An investigation into the issues of multi-agent data mining , 2010 .

[7]  D. Edwards Data Mining: Concepts, Models, Methods, and Algorithms , 2003 .

[8]  H. Gupta,et al.  Optimization of C4.5 Decision Tree Algorithm for Data Mining Application , 2013 .

[9]  Luis Baumela,et al.  Multi-class boosting with asymmetric binary weak-learners , 2014, Pattern Recognit..

[10]  R. Geetha Ramani,et al.  Gender specific classification of road accident patterns through data mining techniques , 2012, IEEE-International Conference On Advances In Engineering, Science And Management (ICAESM -2012).

[11]  Yanchun Zhang,et al.  AdaBoost algorithm with random forests for predicting breast cancer survivability , 2008, 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence).

[12]  Yoram Singer,et al.  Improved Boosting Algorithms Using Confidence-rated Predictions , 1998, COLT' 98.

[13]  Edin Osmanbegović,et al.  DATA MINING APPROACH FOR PREDICTING STUDENT PERFORMANCE , 2012 .

[14]  Chandan K. Reddy,et al.  Boosting Methods for Protein Fold Recognition: An Empirical Comparison , 2008, 2008 IEEE International Conference on Bioinformatics and Biomedicine.

[15]  Vivek Kumar,et al.  Multi-agent based decision Support System using Data Mining and Case Based Reasoning , 2011 .

[16]  Rui Jiang,et al.  Comparative study of ensemble learning approaches in the identification of disease mutations , 2010, 2010 3rd International Conference on Biomedical Engineering and Informatics.

[17]  Mehmed Kantardzic,et al.  Data-Mining Concepts , 2011 .

[18]  Aik Choon Tan,et al.  Ensemble machine learning on gene expression data for cancer classification. , 2003, Applied bioinformatics.

[19]  อนิรุธ สืบสิงห์,et al.  Data Mining Practical Machine Learning Tools and Techniques , 2014 .

[20]  Y. Freund,et al.  Discussion of the Paper \additive Logistic Regression: a Statistical View of Boosting" By , 2000 .

[21]  Trevor Hastie,et al.  Multi-class AdaBoost ∗ , 2009 .

[22]  Erhard Rahm,et al.  Data Cleaning: Problems and Current Approaches , 2000, IEEE Data Eng. Bull..

[23]  Robert E. Schapire,et al.  The strength of weak learnability , 1990, Mach. Learn..

[24]  M. N. Vijayalakshmi,et al.  Mining of student academic evaluation records in higher education , 2012, 2012 International Conference on Recent Advances in Computing and Software Systems.

[25]  Saurabh Pal,et al.  Data Mining: A prediction for performance improvement using classification , 2012, ArXiv.

[26]  Lilly Suriani Affendey,et al.  Improving Academic Performance Prediction using Voting Technique in Data Mining , 2010 .

[27]  William F. Punch,et al.  Enhancing Online Learning Performance: An Application of Data Mining Methods , 2004, CATE.

[28]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1995, EuroCOLT.

[29]  Chih-Ming Chen,et al.  Learning Performance Assessment Approach Using Web-Based Learning Portfolios for E-learning Systems , 2007, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[30]  Robin Prakash Mathur,et al.  Survey on Decision Tree Classification algorithms for the Evaluation of Student Performance , 2013, BIOINFORMATICS 2013.

[31]  Ching Wei Wang,et al.  New Ensemble Machine Learning Method for Classification and Prediction on Gene Expression Data , 2006, 2006 International Conference of the IEEE Engineering in Medicine and Biology Society.

[32]  John Elder,et al.  Handbook of Statistical Analysis and Data Mining Applications , 2009 .

[33]  Ch. Rupa,et al.  Data Mining: Performance Improvement In Education Sector Using Classification And Clustering Algorithm , 2013 .

[34]  Surjeet Kumar Yadav,et al.  Data Mining Applications: A comparative Study for Predicting Student's performance , 2012, ArXiv.

[35]  Barbara Messing,et al.  An Introduction to MultiAgent Systems , 2002, Künstliche Intell..

[36]  Aiko M. Hormann,et al.  Programs for Machine Learning. Part I , 1962, Inf. Control..

[37]  Ashfaqur Rahman,et al.  Cluster-Oriented Ensemble Classifier: Impact of Multicluster Characterization on Ensemble Classifier Learning , 2012, IEEE Transactions on Knowledge and Data Engineering.

[38]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[39]  Nazmul Huda,et al.  A Comparative Study of Bagging, Boosting and C4.5: The Recent Improvements in Decision Tree Learning Algorithm , 2010 .