Clustering and Sequential Pattern Mining of Online Collaborative Learning Data

Group work is widespread in education. The growing use of online tools supporting group work generates huge amounts of data. We aim to exploit this data to support mirroring: presenting useful high-level views of information about the group, together with desired patterns characterizing the behavior of strong groups. The goal is to enable the groups and their facilitators to see relevant aspects of the group's operation and provide feedback if these are more likely to be associated with positive or negative outcomes and indicate where the problems are. We explore how useful mirror information can be extracted via a theory-driven approach and a range of clustering and sequential pattern mining. The context is a senior software development project where students use the collaboration tool TRAC. We extract patterns distinguishing the better from the weaker groups and get insights in the success factors. The results point to the importance of leadership and group interaction, and give promising indications if they are occurring. Patterns indicating good individual practices were also identified. We found that some key measures can be mined from early data. The results are promising for advising groups at the start and early identification of effective and poor practices, in time for remediation.

[1]  Dana E. Sims,et al.  Is there a “Big Five” in Teamwork? , 2005 .

[2]  Judy Kay,et al.  Mirroring of Group Activity to Support Learning as Participation , 2007, AIED.

[3]  Alan M. Lesgold,et al.  A Computational Approach to Analyzing Online Knowledge Sharing Interaction , 2003 .

[4]  C. Rust The Impact of Assessment on Student Learning , 2002 .

[5]  Beatriz Barros,et al.  Analysing student interaction processes in order to improve collaboration. The DEGREE approach , 2000 .

[6]  Departamento de Ingeniería Eléctrica Analysing student interaction processes in order to improve collaboration . The DEGREE approach , 2000 .

[7]  R. Campbell,et al.  A Theory of Leadership Effectiveness. , 1968 .

[8]  Ian Witten,et al.  Data Mining , 2000 .

[9]  Alejandra Martínez-Monés,et al.  From Mirroring to Guiding: A Review of State of the Art Technology for Supporting Collaborative Learning , 2005, Int. J. Artif. Intell. Educ..

[10]  Umeshwar Dayal,et al.  FreeSpan: frequent pattern-projected sequential pattern mining , 2000, KDD '00.

[11]  Irena Koprinska,et al.  A Sequence Based Recommender System for Learning Resources , 2006, Aust. J. Intell. Inf. Process. Syst..

[12]  Judy Kay,et al.  Visualisations for team learning: small teams working on long-term projects , 2007, CSCL.

[13]  Julita Vassileva,et al.  Design and evaluation of an adaptive incentive mechanism for sustained educational online communities , 2006, User Modeling and User-Adapted Interaction.

[14]  Ramakrishnan Srikant,et al.  Mining Sequential Patterns: Generalizations and Performance Improvements , 1996, EDBT.

[15]  Christine Halverson,et al.  Social translucence: designing social infrastructures that make collective activity visible , 2002, CACM.

[16]  Stephen D. Bay,et al.  Detecting Group Differences: Mining Contrast Sets , 2001, Data Mining and Knowledge Discovery.

[17]  Muan Hong Ng,et al.  Using Genetic Algorithms for Data Mining in Web-based Educational Hypermedia Systems , 2002 .

[18]  Vania Dimitrova,et al.  CourseVis: Externalising Student Information to Facilitate Instructors in Distance Learning , 2003 .

[19]  Amy Soller,et al.  Computational Modeling and Analysis of Knowledge Sharing in Collaborative Distance Learning , 2004, User Modeling and User-Adapted Interaction.

[20]  Judy Kay,et al.  Mining patterns of events in students’ teamwork data , 2006 .

[21]  Wendy A. Kellogg,et al.  Social translucence: an approach to designing systems that support social processes , 2000, TCHI.

[22]  Kalina Yacef,et al.  Clustering Students to Help Evaluate Learning , 2004 .

[23]  Wei Wang,et al.  Learning portfolio analysis and mining in SCORM compliant environment , 2004, 34th Annual Frontiers in Education, 2004. FIE 2004..

[24]  Elena Gaudioso,et al.  Mining Student Data To Characterize Similar Behavior Groups In Unstructured Collaboration Spaces , 2004 .

[25]  P. Ramsden Learning to Teach in Higher Education , 1991 .

[26]  D. Bligh Learning to teach in higher education , 1993 .

[27]  Jinyan Li,et al.  Efficient mining of emerging patterns: discovering trends and differences , 1999, KDD '99.

[28]  E. A. Fleishman,et al.  The description of supervisory behavior. , 1953 .

[29]  Qiming Chen,et al.  PrefixSpan,: mining sequential patterns efficiently by prefix-projected pattern growth , 2001, Proceedings 17th International Conference on Data Engineering.