The 2010 KDD Cup Competition Dataset: Engaging the machine learning community in predictive learning analytics

In the spring of 2010, the Association for Computing Machinery (ACM) Special Interest Group on Knowledge Discovery and Data-mining (KDD) selected a dataset from an educational technology for its annual competition. The competition, titled “Educational Data Mining Challenge”, tasked participants with predicting the correctness of student answers to questions within an Intelligent Tutoring System (ITS) from The Cognitive Tutors suite of tutors. This challenge was hosted by the PSLC DataShop, and included data provided by the Carnegie Learning Inc., producers of The Cognitive Tutors. Consisting of over 9GB of student data this was the largest KDD Cup dataset up to that point in time. The competition brought in 655 competitors submitting 3,400 solutions. Five years later, we believe the competition dataset has been the most often cited from an educational technology platform.