Analysis of Educational Data Mining

This research paper aims to compare the performance of various clustering and classification algorithms which are applied on the same educational dataset. Educational Data Mining (EDM) uses these algorithms to explore educational statistics to discover patterns and predictions in data that illustrate learner’s performance. Various design challenges such as accuracy, objective and functionality, and overheads when the data set is extremely large, etc., have been highlighted. The algorithms discussed here can be classified as centroid-based clustering, graph-based clustering, and various supervised classification algorithms. Further, after comparison of these algorithms, this paper aims at using the cited literature survey to determine the most suited algorithm according to the need of EDM clustering or classification. The regular search is on to understand students better and to know the patterns in which they learn to make it more efficient for them. Nowadays, many educational institutions have educational databases that can be utilized in various ways to make it effective for students but are unutilized. Powerful tools are required to get benefits from these educational databases. EDM is one of those emerging tools that analyzes the data collected from learning and teaching and then applies the techniques from machine learning and data mining for predicting student’s future behavior by learning detailed information such as student’s grades, knowledge, achievements, motivation, and attitude.

[1]  Vinod Kumar Yadav,et al.  Enterprise Architecture for Semantic Web Mining in Education , 2009, 2009 Second International Conference on Computer and Electrical Engineering.

[2]  Jason M. Harley,et al.  Clustering and Profiling Students According to their Interactions with an Intelligent Tutoring System Fostering Self-Regulated Learning , 2013, EDM 2013.

[3]  Chin-Hwa Kuo,et al.  Concept-Based Pages Recommendation by Using Cluster Algorithm , 2008, 2008 Eighth IEEE International Conference on Advanced Learning Technologies.

[4]  Safwan Shatnawi,et al.  Applying a novel clustering technique based on FP-tree to university timetabling problem: A case study , 2010, The 2010 International Conference on Computer Engineering & Systems.

[5]  Hasan Hani,et al.  Identifying the factors affecting the success and failure of e-learning students using cluster analysis , 2013, 7th International Conference on e-Commerce in Developing Countries:with focus on e-Security.

[6]  Vasile Paul Bresfelean,et al.  Determining students’ academic failure profile founded on data mining methods , 2008, ITI 2008 - 30th International Conference on Information Technology Interfaces.

[7]  J LoboL.M.R.,et al.  Combination of Clustering, Classification & Association Rule based Approach for Course Recommender System in E-learning , 2012 .

[8]  Qing Shen,et al.  The analysis of student's grade based on Rough Sets , 2010, 2010 3rd IEEE International Conference on Ubi-Media Computing.

[9]  Ryan S. Baker,et al.  The State of Educational Data Mining in 2009: A Review and Future Visions. , 2009, EDM 2009.

[10]  Tang Van To,et al.  Clustering approach to examination scheduling , 2010, 2010 3rd International Conference on Advanced Computer Theory and Engineering(ICACTE).

[11]  S. Saranya,et al.  Student Progress Analysis and Educational Institutional Growth Prognosis Using Data Mining , 2014 .

[12]  S. Parack,et al.  Application of data mining in educational databases for predicting academic trends and patterns , 2012, 2012 IEEE International Conference on Technology Enhanced Education (ICTEE).

[13]  Lin Shi,et al.  Mining Association Rules Based on Apriori Algorithm and Application , 2009, 2009 International Forum on Computer Science-Technology and Applications.

[14]  Michael D. Cooper,et al.  Using clustering techniques to detect usage patterns in a Web-based information system , 2001, J. Assoc. Inf. Sci. Technol..

[15]  Yoshitaka Sakurai,et al.  Personalizing Learning Processes by Data Mining , 2010, 2010 10th IEEE International Conference on Advanced Learning Technologies.

[16]  Sebastián Ventura,et al.  Educational data mining: A survey from 1995 to 2005 , 2007, Expert Syst. Appl..

[17]  Paul Prinsloo,et al.  Benchmarking the habits and behaviours of successful students: A case study of academic-business collaboration , 2014 .

[18]  Asma A. Al-shargabi,et al.  Discovering vital patterns from UST students data by applying data mining techniques , 2010, 2010 The 2nd International Conference on Computer and Automation Engineering (ICCAE).

[19]  Norshahriah Wahab,et al.  Predicting NDUM Student's Academic Performance Using Data Mining Techniques , 2009, 2009 Second International Conference on Computer and Electrical Engineering.

[20]  Gong Youping,et al.  The application of data mining technology in analysis of college student's performance , 2010, The 2nd International Conference on Information Science and Engineering.

[21]  Edin Osmanbegović,et al.  DATA MINING APPROACH FOR PREDICTING STUDENT PERFORMANCE , 2012 .

[22]  Feng Wang,et al.  E-learning Behavior Analysis Based on Fuzzy Clustering , 2009, 2009 Third International Conference on Genetic and Evolutionary Computing.

[23]  Zlatko J. Kovacic,et al.  Early Prediction of Student Success: Mining Students Enrolment Data , 2010 .

[24]  Gao Ningning Proposing Data Warehouse and Data Mining in Teaching Management Research , 2010, 2010 International Forum on Information Technology and Applications.

[25]  Ellen Jansen,et al.  Predicting first-year achievement by pedagogy and skill development in the first weeks at university , 2011 .

[26]  T. Soni Madhulatha,et al.  An Overview on Clustering Methods , 2012, ArXiv.

[27]  Zhiwu Liu,et al.  Prediction and Analysis for Students' Marks Based on Decision Tree Algorithm , 2010, 2010 Third International Conference on Intelligent Networks and Intelligent Systems.

[28]  K. Iyakutti,et al.  Data mining techniques for teaching result analysis using rough set theory , 2008, 2008 International Conference on Computing, Communication and Networking.

[29]  Vive Kumar,et al.  Particle Swarm Optimization (PSO)-Based Clustering for Improving the Quality of Learning using Cloud Computing , 2013, 2013 IEEE 13th International Conference on Advanced Learning Technologies.

[30]  Shunli Zhang,et al.  Application of Data Mining on Students' Quality Evaluation , 2011, 2011 3rd International Workshop on Intelligent Systems and Applications.

[31]  Sebastián Ventura,et al.  Educational Data Mining: A Review of the State of the Art , 2010, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[32]  Yoshitaka Sakurai,et al.  Success Chances Estimation of University Curricula Based on Educational History, Self-Estimated Intellectual Traits and Vocational Ambitions , 2011, 2011 IEEE 11th International Conference on Advanced Learning Technologies.

[33]  Z. Alimazighi,et al.  A decisional tool for quality improvement in higher education , 2008, 2008 3rd International Conference on Information and Communication Technologies: From Theory to Applications.

[34]  Xiang Li,et al.  Research on Early-Warning Model of Students' Academic Records Based on Association Rules , 2009, 2009 WRI World Congress on Computer Science and Information Engineering.

[35]  Ch. Rupa,et al.  Data Mining: Performance Improvement In Education Sector Using Classification And Clustering Algorithm , 2013 .

[36]  Seref Sagiroglu,et al.  Big data: A review , 2013, 2013 International Conference on Collaboration Technologies and Systems (CTS).

[37]  Kamel Barkaoui,et al.  A Two-Step Clustering Approach for Improving Educational Process Model Discovery , 2016, 2016 IEEE 25th International Conference on Enabling Technologies: Infrastructure for Collaborative Enterprises (WETICE).