Ordinal regression by a gravitational model in the field of educational data mining

Educational data mining EDM is a research area where the goal is to develop data mining methods to examine data critically from educational environments. Traditionally, EDM has addressed the following problems: clustering, classification, regression, anomaly detection and association rule mining. In this paper, the ordinal regression OR paradigm, is introduced in the field of EDM. The goal of OR problems is the classification of items in an ordinal scale. For instance, the prediction of students' performance in categories where the different grades could be ordered according to Ai¾?Bi¾?Ci¾?D is a classical example of an OR problem. The EDM community has not yet explored this paradigm despite the importance of these problems in the field of EDM. Furthermore, an amenable and interpretable OR model based on the concept of gravitation is proposed. The model is an extension of a recently proposed gravitational model that tackles imbalanced nominal classification problems. The model is carefully adapted to the ordinal scenario and validated with four EDM datasets. The results obtained were compared with state-of-the-art OR algorithms and nominal classification ones. The proposed models can be used to better understand the learning-teaching process in higher education environments.

[1]  Ernestina Menasalvas Ruiz,et al.  Web Usage Mining Project for Improving Web-Based Learning Sites , 2005, EUROCAST.

[2]  Josep Domingo-Ferrer,et al.  Regression for ordinal variables without underlying continuous variables , 2006, Inf. Sci..

[3]  Àngela Nebot,et al.  Identification of fuzzy models to predict students performance in an e-learning environment , 2006 .

[4]  César Hervás-Martínez,et al.  Evolutionary Generalized Radial Basis Function neural networks for improving prediction accuracy in gene classification using feature selection , 2012, Appl. Soft Comput..

[5]  Graham Gibbs,et al.  The Impact Of Training Of University Teachers on their Teaching Skills, their Approach to Teaching and the Approach to Learning of their Students , 2004 .

[6]  Elena Barberà,et al.  SYSTEMIC MULTICULTURAL MODEL FOR ONLINE EDUCATION: Tracing Connections among Learner Inputs, Instructional Processes, and Outcomes , 2011 .

[7]  Sebastián Ventura,et al.  Weighted Data Gravitation Classification for Standard and Imbalanced Data , 2013, IEEE Transactions on Cybernetics.

[8]  Dowming Yeh,et al.  What drives a successful e-Learning? An empirical investigation of the critical factors influencing learner satisfaction , 2008, Comput. Educ..

[9]  Sebastián Ventura,et al.  Knowledge Discovery with Genetic Programming for Providing Feedback to Courseware Authors , 2004, User Modeling and User-Adapted Interaction.

[10]  Roger A. Smith,et al.  Effectiveness of Personal Interaction in a Learner-Centered Paradigm Distance Education Class Based on Student Satisfaction , 2008 .

[11]  Bernard Monjardet,et al.  Concordance between two linear orders: The Spearman and Kendall coefficients revisited , 1997 .

[12]  Chen Wang,et al.  Improving Nearest Neighbor Classification with Simulated Gravitational Collapse , 2005, ICNC.

[13]  George Siemens,et al.  Learning analytics and educational data mining: towards communication and collaboration , 2012, LAK.

[14]  Eibe Frank,et al.  Logistic Model Trees , 2003, Machine Learning.

[15]  César Hervás-Martínez,et al.  Data Mining Algorithms to Classify Students , 2008, EDM.

[16]  Juan D. Velásquez,et al.  Text mining applied to plagiarism detection: The use of words for detecting deviations in the writing style , 2013, Expert Syst. Appl..

[17]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[18]  Sebastián Ventura,et al.  Educational data mining: A survey from 1995 to 2005 , 2007, Expert Syst. Appl..

[19]  Bo Yang,et al.  Data gravitation based classification , 2009, Inf. Sci..

[20]  Bruno Martins,et al.  Learning to rank academic experts in the DBLP dataset , 2015, Expert Syst. J. Knowl. Eng..

[21]  M. Yay,et al.  Application of Ordinal Logistic Regression and Artifical Neural Networks in a Study of Student Satistaction , 2009 .

[22]  Matjaz Gams,et al.  Combining domain knowledge and machine learning for robust fall detection , 2014, Expert Syst. J. Knowl. Eng..

[23]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[24]  Wei Chu,et al.  New approaches to support vector ordinal regression , 2005, ICML.

[25]  Terry Anderson,et al.  E-Learning in the 21st Century: A Community of Inquiry Framework for Research and Practice , 2016 .

[26]  Sebastián Ventura,et al.  Educational Data Mining: A Review of the State of the Art , 2010, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[27]  Pedro Antonio Gutiérrez,et al.  Negative Correlation Ensemble Learning for Ordinal Regression , 2013, IEEE Transactions on Neural Networks and Learning Systems.

[28]  Peter Shea,et al.  Community of inquiry as a theoretical framework to foster "epistemic engagement" and "cognitive presence" in online education , 2009, Comput. Educ..

[29]  Jiawei Han,et al.  Discovering Web access patterns and trends by applying OLAP and data mining technology on Web logs , 1998, Proceedings IEEE International Forum on Research and Technology Advances in Digital Libraries -ADL'98-.

[30]  Alejandro Peña-Ayala Review: Educational data mining: A survey and a data mining-based analysis of recent works , 2014 .

[31]  Erman Yukselturk,et al.  Do Entry Characteristics of Online Learners Affect Their Satisfaction , 2009 .

[32]  A. A. Akiri,et al.  Teachers’ Effectiveness and Students’ Academic Performance in Public Secondary Schools in Delta State, Nigeria , 2009 .

[33]  Xing Liu,et al.  Journal of Modern Applied Statistical Methods Ordinal Regression Analysis: Fitting the Proportional Odds Model Using Stata, Sas and Spss Statistical Software Applications & Review Ordinal Regression Analysis: Fitting the Proportional Odds Model Using Stata, Sas and Spss , 2022 .

[34]  William F. Punch,et al.  Using Genetic Algorithms for Data Mining Optimization in an Educational Web-Based System , 2003, GECCO.

[35]  Sebastián Ventura,et al.  Association rule mining using genetic programming to provide feedback to instructors from multiple‐choice quiz data , 2012, Expert Syst. J. Knowl. Eng..

[36]  Yang Zong-chang A vector gravitational force model for classification , 2007, Pattern Analysis and Applications.

[37]  Rynson W. H. Lau,et al.  Personalized courseware construction based on Web data mining , 2000, Proceedings of the First International Conference on Web Information Systems Engineering.

[38]  María Pérez-Ortiz,et al.  An Experimental Study of Different Ordinal Regression Methods and Measures , 2012, HAIS.

[39]  Jaime S. Cardoso,et al.  The unimodal model for the classification of ordinal data , 2008, Neural Networks.

[40]  L. Atay,et al.  Determining the factors that affect the satisfaction of students having undergraduate tourism education with the department by means of the method of classification tree. , 2010 .

[41]  Ray-I Chang,et al.  Data mining for providing a personalized learning path in creativity: An application of decision trees , 2013, Comput. Educ..

[42]  Maomi Ueno On-Line Statistical Outlier Detection of irregular learning processes for e-learning , 2003 .

[43]  Ian Witten,et al.  Data Mining , 2000 .

[44]  Arthur Bangert,et al.  The influence of social presence and teaching presence on the quality of online critical inquiry , 2008, J. Comput. High. Educ..

[45]  C. Tanner,et al.  Effects of school design on student outcomes , 2009 .

[46]  Nikolaus Hansen,et al.  Completely Derandomized Self-Adaptation in Evolution Strategies , 2001, Evolutionary Computation.

[47]  Sebastián Ventura,et al.  Multiple instance learning for classifying students in learning management systems , 2011, Expert Syst. Appl..

[48]  P. McCullagh Regression Models for Ordinal Data , 1980 .

[49]  Ernest Fokoue,et al.  Data Mining and Machine Learning Techniques for Extracting Patterns in Students’ Evaluations of Instructors , 2013 .

[50]  Stuart R. Palmer,et al.  Examining student satisfaction with wholly online learning , 2009, J. Comput. Assist. Learn..

[51]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[52]  Wei Chu,et al.  Support Vector Ordinal Regression , 2007, Neural Computation.