Using Data Mining for Predicting Relationships between Online Question Theme and Final Grade

Introduction According to a recent survey conducted by Campus Computing (campuscomputing.net) and WCET (wcet.info), almost 88% of the surveyed institutions reported having used an LMS (Learning Management System) as a medium for course delivery for both on-campus and online offerings. In addition to various student information management systems (SISs), LMSs are providing the educational community with a goldmine of unexploited data about students' learning characteristics, behaviours, and patterns. The turning of such raw data into useful information and knowledge will enable institutes of higher education (HEIs) to rethink and improve students' learning experiences by using the data to streamline their teaching and learning processes, to extract and analyse students' learning and navigation patterns and behaviours, to analyse threaded discussion and interaction logs, and to provide feedback to students and to faculty about the unfolding of their students' learning experiences (Hung & Crooks, 2009; Garcia, Romero, Ventura, & de Castro, 2011). To this end, data mining has emerged as a powerful analytical and exploratory tool supported by faster multi-core 64 CPUs with larger memories, and by powerful database reporting tools. Originating in corporate business practices, data mining is multidisciplinary by nature and springs from several different disciplines including computer science, artificial intelligence, statistics, and biometrics. Using various approaches (such as classification, clustering, association rules, and visualization), data mining has been gaining momentum in higher education, which is now using a variety of applications, most notably in enrolment, learning patterns, personalization, and threaded discussion analysis. By discovering hidden relationships, patterns, and interdependencies, and by correlating raw/unstructured institutional data, data mining is beginning to facilitate the decision-making process in higher educational institutions. This interest in data mining is timely and critical, particularly as universities are diversifying their delivery modes to include more online and mobile learning environments. EDM has the potential to help HEIs understand the dynamics and patterns of a variety of learning environments and to provide insightful data for rethinking and improving students' learning experiences. This paper is focused on understanding live video streaming (LVS) students' learning behaviours, their interactions, and their learning outcomes. More specifically, this study explores how the interaction of students with each other and with their instructors predicts their learning outcomes (as measured by their final grades). By investigating these interrelated dimensions, this study aims to enrich the existing body of literature, while augmenting the understanding of effective learning strategies across a variety of new delivery modes. This paper is divided into four sections. It begins by reviewing the literature dealing with the use of data mining in administrative and academic environments, followed by a short discussion of the way in which data mining is used to understand various dimensions of learning. The second section explains the purpose and the research questions explored in this paper. The third section describes the background of the study and details its methodological approach (sampling, data collection, and analysis). The paper concludes by highlighting key findings, by discussing the study's limitations, and by proposing several recommendations for distance education administrators and practitioners. Data mining applications in administrative and academic environments At the intersection of several disciplines including computer science, statistics, psychometrics (Garcia et al., 2011), data mining has thrived in business practices as a knowledge discovery tool intended to transform raw data into highlevel knowledge for decision support (Hen & Lee, 2008). …

[1]  Osmar R. Zaïane,et al.  Building a Recommender Agent for e-Learning Systems , 2002, ICCE.

[2]  Osmar R. Za ¨ õane Building a Recommender Agent for e-Learning Systems , 2002 .

[3]  Robert Faulkner,et al.  The value of data mining in music education research and some findings from its application to a study of instrumental learning during childhood , 2010 .

[4]  Kara Dawson,et al.  Data for free: Using LMS activity logs to measure community in online courses , 2008, Internet High. Educ..

[5]  Jun Luo,et al.  Towards evaluating learners' behaviour in a Web-based distance learning environment , 2001, Proceedings IEEE International Conference on Advanced Learning Technologies.

[6]  Tung-Ching Lin,et al.  A design to promote group learning in e-learning: Experiences from the field , 2008, Comput. Educ..

[7]  Andrew Kusiak,et al.  Data Mining in Manufacturing: A Review , 2006 .

[8]  Ryan S. Baker,et al.  The State of Educational Data Mining in 2009: A Review and Future Visions. , 2009, EDM 2009.

[9]  Xiaohui Liu,et al.  The contribution of data mining to information science , 2004, J. Inf. Sci..

[10]  W. F. Punch,et al.  Predicting student performance: an application of data mining methods with an educational Web-based system , 2003, 33rd Annual Frontiers in Education, 2003. FIE 2003..

[11]  Paul Baepler,et al.  Academic Analytics and Data Mining in Higher Education , 2010 .

[12]  Shane Dawson,et al.  Mining LMS data to develop an "early warning system" for educators: A proof of concept , 2010, Comput. Educ..

[13]  Sebastián Ventura,et al.  A collaborative educational association rule mining tool , 2011, Internet High. Educ..

[14]  Àngela Nebot,et al.  Applying Data Mining Techniques to e-Learning Problems , 2007 .

[15]  Lin Chang,et al.  Applying Data Mining to Predict College Admissions Yield: A Case Study. , 2006 .

[16]  Dirk H. R. Spennemann,et al.  Patterns of user behavior in University on-line forums , 2004 .

[17]  Michael A. Herbert Staying the Course: A Study in Online Student Satisfaction and Retention , 2006 .

[18]  Gail Fitzgerald,et al.  An Investigation of Communicative Competence of ESL Students Using Electronic Discussion Boards , 2006 .

[19]  Steven M. Crooks,et al.  Examining Online Learning Patterns with Data Mining Techniques in Peer-Moderated and Teacher-Moderated Courses , 2009 .

[20]  Luis Martínez,et al.  Orieb, A Crs For Academic Orientation Using Qualitative Assessments , 2008, e-Learning.

[21]  Stanley Lemeshow,et al.  Applied Logistic Regression, Second Edition , 1989 .

[22]  Somnuk Phon-Amnuaisuk,et al.  Data Mining Application in Higher Learning Institutions , 2008, Informatics Educ..

[23]  J. Osborne Best Practices in Quantitative Methods , 2009 .

[24]  Laurie P. Dringus,et al.  Using data mining as a strategy for assessing asynchronous discussion forums , 2005, Comput. Educ..

[25]  Sai Peck Lee,et al.  Performance analysis of data mining tools cumulating with a proposed data mining middleware , 2008 .

[26]  Wu He,et al.  Using text mining to uncover students' technology-related problems in live video streaming , 2011, Br. J. Educ. Technol..

[27]  Sebastián Ventura,et al.  Educational Data Mining: A Review of the State of the Art , 2010, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[28]  William F. Punch,et al.  PREDICTING STUDENT PERFORMANCE: AN APPLICATION OF DATA MINING METHODS WITH THE EDUCATIONAL WEB-BASED SYSTEM LON-CAPA , 2003 .

[29]  Jafar Habibi,et al.  Using Educational Data Mining Methods to Study the Impact of Virtual Classroom in E-Learning , 2010, EDM.

[30]  Li Xiu,et al.  Application of data mining techniques in customer relationship management: A literature review and classification , 2009, Expert Syst. Appl..

[31]  Ke Zhang,et al.  Revealing Online Learning Behaviors and Activity Patterns and Making Predictions with Data Mining Techniques in Online Teaching , 2008 .

[32]  Cherng-Jyh Yen,et al.  A predictive study of learner satisfaction and outcomes in face-to-face, satellite broadcast, and live video-streaming learning environments , 2010, Internet High. Educ..

[33]  Jacob Cohen,et al.  Applied multiple regression/correlation analysis for the behavioral sciences , 1979 .

[34]  Fu-Ren Lin,et al.  Discovering genres of online discussion threads via text mining , 2009, Comput. Educ..

[35]  Alvaro Ortigosa,et al.  A data mining approach to guide students through the enrollment process based on academic performance , 2011, User Modeling and User-Adapted Interaction.

[36]  Sebastián Ventura,et al.  Predicting Student Grades in Learning Management Systems with Multiple Instance Genetic Programming. , 2009, EDM 2009.

[37]  Sebastián Ventura,et al.  Data mining in course management systems: Moodle case study and tutorial , 2008, Comput. Educ..

[38]  Judy Kay,et al.  Clustering and Sequential Pattern Mining of Online Collaborative Learning Data , 2009, IEEE Transactions on Knowledge and Data Engineering.

[39]  D. Hosmer,et al.  Applied Logistic Regression , 1991 .

[40]  J. E. King Binary Logistic Regression , 2008 .

[41]  M. Norusis IBM SPSS Statistics 19 Advanced Statistical Procedures Companion , 2011 .

[42]  Ilias Petrounias,et al.  A Framework for Using Web Usage Mining to Personalise E-learning , 2007, Seventh IEEE International Conference on Advanced Learning Technologies (ICALT 2007).

[43]  A. O'connell Logistic Regression Models for Ordinal Response Variables , 2005 .

[44]  Lakhmi C. Jain,et al.  Evolution of Teaching and Learning Paradigms in Intelligent Environment , 2007 .

[45]  Sebastián Ventura,et al.  Web usage mining for predicting final marks of students that use Moodle courses , 2013, Comput. Appl. Eng. Educ..