A Clustering Methodology of Web Log Data for Learning Management Systems

Learning Management Systems (LMS) collect large amounts of data. Data mining techniques can be applied to analyse their web data log files. The instructors may use this data for assessing and measuring their courses. In this respect, we have proposed a methodology for analysing LMS courses and students’ activity. This methodology uses a Markov CLustering (MCL) algorithm for clustering the students’ activity and a SimpleKMeans algorithm for clustering the courses. Additionally we provide a visualisation of the results using scatter plots and 3D graphs. We propose specific metrics for the assessment of the courses based on the course usage. These metrics applied to data originated from the LMS log files of the Information Management Department of the TEI of Kavala. The results show that these metrics, if combined properly, can quantify quality characteristics of the courses. Furthermore, the application of the MCL algorithm to students’ activities provides useful insights to their usage of the LMS platform.

[1]  Jaideep Srivastava,et al.  Web Mining , 2004, Data Mining and Knowledge Discovery.

[2]  J. Beck,et al.  An Educational Data Mining Tool to Browse Tutor-Student Interactions : Time Will Tell ! , 2005 .

[3]  Anton J. Enright,et al.  An efficient algorithm for large-scale detection of protein families. , 2002, Nucleic acids research.

[4]  Sebastián Ventura,et al.  Data mining in course management systems: Moodle case study and tutorial , 2008, Comput. Educ..

[5]  V. Komis,et al.  Logging of fingertip actions is not enough for analysis of learning activities , 2005 .

[6]  D. Edwards Data Mining: Concepts, Models, Methods, and Algorithms , 2003 .

[7]  Stavros Valsamidis,et al.  Proposed framework for data mining in e-learning: The case of open e-class , 2009, IADIS AC.

[8]  Riccardo Mazza,et al.  GISMO: a Graphical Interactive Student Monitoring Tool for Course Management Systems , 2004 .

[9]  Jaideep Srivastava,et al.  Web usage mining: discovery and applications of usage patterns from Web data , 2000, SKDD.

[10]  Johannes Fürnkranz,et al.  Web Mining , 2005, Data Mining and Knowledge Discovery Handbook.

[11]  Hendrik Blockeel,et al.  Web mining research: a survey , 2000, SKDD.

[12]  Neil T. Heffernan,et al.  Informing Teachers Live about Student Learning: Reporting in Assistment System , 2005 .

[13]  Rong Gu,et al.  Interest mining in virtual learning environments , 2008, Online Inf. Rev..

[14]  Stavros Valsamidis,et al.  Homogeneity and Enrichment: Two Metrics for Web Applications Assessment , 2010 .

[15]  K. Nagi,et al.  Research analysis of moodle reports to gauge the level of interactivity in elearning courses at Assumption University, Thailand , 2008, 2008 International Conference on Computer and Communication Engineering.

[16]  Vania Dimitrova,et al.  CourseVis: A graphical student monitoring tool for supporting instructors in web-based distance courses , 2007, Int. J. Hum. Comput. Stud..

[17]  Sebastián Ventura,et al.  Mining and Visualizing Visited Trails in Web-Based Educational Systems , 2008, EDM.

[18]  Leon Goldovsky,et al.  BioLayout(Java): versatile network visualisation of structural and functional relationships. , 2005, Applied bioinformatics.

[19]  N. R. Srinivasa Raghavan,et al.  Data mining in e-commerce: A survey , 2005 .

[20]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques with Java implementations , 2002, SGMD.

[21]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[22]  S. vanDongen Graph Clustering by Flow Simulation , 2000 .

[23]  Elena Álvarez,et al.  MATEP: Monitoring and Analysis Tool for E-Learning Platforms , 2008, 2008 Eighth IEEE International Conference on Advanced Learning Technologies.

[24]  Sebastián Ventura,et al.  Educational data mining: A survey from 1995 to 2005 , 2007, Expert Syst. Appl..

[25]  Haji Binali,et al.  A new significant area: Emotion detection in E-learning using opinion mining techniques , 2009, 2009 3rd IEEE International Conference on Digital Ecosystems and Technologies.

[26]  Edith Schonberg,et al.  Analysis and Visualization of Metrics for Online Merchandising , 1999, WEBKDD.

[27]  Wilhelmiina Hämäläinen,et al.  Comparison of Machine Learning Methods for Intelligent Tutoring Systems , 2006, Intelligent Tutoring Systems.

[28]  Mohamed Jemni,et al.  Automatic Recommendations for E-Learning Personalization Based on Web Usage Mining Techniques and Information Retrieval , 2008, 2008 Eighth IEEE International Conference on Advanced Learning Technologies.

[29]  Myra Spiliopoulou,et al.  Data Mining for the Web , 1999, PKDD.

[30]  Ranieri Baraglia,et al.  SUGGEST: a Web usage mining system , 2002, Proceedings. International Conference on Information Technology: Coding and Computing.

[31]  L. Imhof Matrix Algebra and Its Applications to Statistics and Econometrics , 1998 .

[32]  Umeshwar Dayal,et al.  From User Access Patterns to Dynamic Hypertext Linking , 1996, Comput. Networks.

[33]  Bing Liu,et al.  Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data , 2006, Data-Centric Systems and Applications.

[34]  Oren Etzioni,et al.  Adaptive Web Sites: Conceptual Cluster Mining , 1999, IJCAI.

[35]  Elizabeth Chang,et al.  Usability Metrics for E-learning , 2003, OTM Workshops.

[36]  Stavros Valsamidis,et al.  Course Ranking and Automated Suggestions through Web Mining , 2010, 2010 10th IEEE International Conference on Advanced Learning Technologies.

[37]  Lefteris Angelis,et al.  PuReD-MCL: a graph-based PubMed document clustering methodology , 2008, Bioinform..

[38]  Zahir Tari,et al.  On the Move to Meaningful Internet Systems. OTM 2018 Conferences , 2018, Lecture Notes in Computer Science.