An Ensemble-Based Decision Tree Approach for Educational Data Mining

Nowadays, data mining and machine learning techniques are applied to a variety of different topics (e. g., healthcare and disease, security, decision support, sentiment analysis, education, etc.). Educational data mining investigates the performance of students and gives solutions to enhance the quality of education. The aim of this study is to use different data mining and machine learning algorithms on actual data sets related to students. To this end, we apply two decision tree methods. The methods can create several simple and understandable rules. Moreover, the performance of a decision tree is optimized by using an ensemble technique named Rotation Forest algorithm. Our findings indicate that the Rotation Forest algorithm can enhance the performance of decision trees in terms of different metrics. In addition, we found that the size of tree generated by decision trees ensemble were bigger than simple ones. This means that the proposed methodology can reveal more information concerning simple rules.

[1]  Raymond Y. K. Lau,et al.  A two-stage decision model for information filtering , 2012, Decis. Support Syst..

[2]  Moloud Abdar,et al.  Design of A Universal User Model for Dynamic Crowd Preference Sensing and Decision-Making Behavior Analysis , 2017, IEEE Access.

[3]  Moloud Abdar,et al.  Using Decision Trees in Data Mining for Predicting Factors Influencing of Heart Disease , 2015 .

[4]  Moloud Abdar,et al.  Impact of Patients’ Gender on Parkinson’s disease using Classification Algorithms , 2018 .

[5]  Rommel N. Carvalho,et al.  Educational data mining: Predictive analysis of academic performance of public school students in the capital of Brazil , 2019, Journal of Business Research.

[6]  K. Mandl,et al.  Associations Between Exposure to and Expression of Negative Opinions About Human Papillomavirus Vaccines on Social Media: An Observational Study , 2015, Journal of medical Internet research.

[7]  Moloud Abdar,et al.  Understanding regional characteristics through crowd preference and confidence mining in P2P accommodation rental service , 2017, Libr. Hi Tech.

[8]  Soumya K. Ghosh,et al.  Data mining based analysis to explore the effect of teaching on student performance , 2018, Education and Information Technologies.

[9]  Raymond Y. K. Lau,et al.  Utilizing Search Intent in Topic Ontology-Based User Profile for Web Mining , 2006, 2006 IEEE/WIC/ACM International Conference on Web Intelligence (WI 2006 Main Conference Proceedings)(WI'06).

[10]  E. Coiera,et al.  Citations alone were enough to predict favorable conclusions in reviews of neuraminidase inhibitors. , 2015, Journal of clinical epidemiology.

[11]  Lara Fontanella,et al.  Identifying Students at Risk of Academic Failure Within the Educational Data Mining Framework , 2019 .

[12]  Ching-Chieh Kiu Supervised Educational Data Mining to Discover Students’ Learning Process to Improve Students’ Performance , 2018 .

[13]  Sadiq Hussain,et al.  Educational Data Mining and Analysis of Students’ Academic Performance Using WEKA , 2018 .

[14]  S. Anupama Kumar,et al.  Efficiency of Multi-instance Learning in Educational Data Mining , 2018 .

[15]  Zhenyu Yang,et al.  Sentiment analysis on tweets for social events , 2013, Proceedings of the 2013 IEEE 17th International Conference on Computer Supported Cooperative Work in Design (CSCWD).

[16]  Tshilidzi Marwala,et al.  An efficient educational data mining approach to support e-learning , 2017, Wirel. Networks.

[17]  Moloud Abdar,et al.  Performance analysis of classification algorithms on early detection of liver disease , 2017, Expert Syst. Appl..

[18]  Ji Zhang,et al.  Sentiment Analysis for Depression Detection on Social Networks , 2016, ADMA.

[19]  Ji Zhang,et al.  Coupling topic modelling in opinion mining for social media analysis , 2017, WI.

[20]  Moloud Abdar,et al.  Using PSO Algorithm for Producing Best Rules in Diagnosis of Heart Disease , 2017, 2017 International Conference on Computer and Applications (ICCA).