Revisiting interestingness of strong symmetric association rules in educational data

Association rules are very useful in Educational Data Mining since they extract associations between educational items and present the results in an intuitive form to the teachers. Furthermore, they require less extensive expertise in Data Mining than other methods. We have extracted association rules with data from the Logic-ITA, a web-based learning environment to practice logic formal proofs. We were interested in detecting associations of mistakes. The rules we found were symmetrical, such as X→Y and Y→X, both with a strong support and a strong confidence. Furthermore, P(X) and P(Y) are both significantly higher than P(X,Y). Such figures lead to the fact that several interestingness measures such as lift, correlation or conviction rate X and Y as independent. Does it mean that these rules are not interesting? We argue in this paper that this is not necessarily the case. We investigated other relevance measures such as Chi square, cosine and contrasting rules and found that the results were leaning towards a positive correlation between X and Y. We also argue pragmatically with our experience of using these association rules to change parts of the course and of the positive impact of these changes on students' marks. We conclude with some thoughts about the appropriateness of relevance measures for Educational data.

[1]  Kalina Yacef,et al.  Mining Student Data Captured from a Web-Based Tutoring Tool: Initial Exploration and Results , 2004 .

[2]  Kalina Yacef,et al.  The Logic-ITA in the Classroom: A Medium Scale Experiment , 2005, Int. J. Artif. Intell. Educ..

[3]  Jaideep Srivastava,et al.  Selecting the right interestingness measure for association patterns , 2002, KDD.

[4]  Jie Lu,et al.  A Personalized e-Learning material Recommender System , 2004 .

[5]  William F. Punch,et al.  PREDICTING STUDENT PERFORMANCE: AN APPLICATION OF DATA MINING METHODS WITH THE EDUCATIONAL WEB-BASED SYSTEM LON-CAPA , 2003 .

[6]  Feng-Hsu Wang,et al.  Effective personalized recommendation based on time-framed navigation clustering and association mining , 2004, Expert Syst. Appl..

[7]  Geert Wets,et al.  Defining interestingness for association rules , 2003 .

[8]  William F. Punch,et al.  Mining interesting contrast rules for a web-based educational system , 2004, 2004 International Conference on Machine Learning and Applications, 2004. Proceedings..

[9]  F. Wang,et al.  On Using Data Mining For Browsing Log AnalysisIn Learning Environments , 2006 .

[10]  Agathe Merceron,et al.  A Web-Based Tutoring Tool with Mining Facilities to Improve Learning and Teaching , 2003 .

[11]  J. Gerring A case study , 2011, Technology and Society.

[12]  Muan Hong Ng,et al.  Using Genetic Algorithms for Data Mining in Web-based Educational Hypermedia Systems , 2002 .

[13]  Jie Lu A Personalized e-Learning Material Recommender System , 2004 .

[14]  R. Agarwal Fast Algorithms for Mining Association Rules , 1994, VLDB 1994.

[15]  Kalina Yacef,et al.  Educational Data Mining: a Case Study , 2005, AIED.