Content-Based Collaborative Filtering for Question Difficulty Calibration

Question calibration especially on difficulty degree is important for supporting Web-based testing and assessment. Currently, Item Response Theory (IRT) has traditionally applied for question difficulty calibration. However, it is tedious and time-consuming to collect sufficient historical response information manually, and computational expensive to calibrate question difficulty especially for large-scale question datasets. In this paper, we propose an effective Content-based Collaborative Filtering (CCF) approach for automatic calibration of question difficulty degree. In the proposed approach, a dataset of questions with available user responses and knowledge features is first gathered. Next, collaborative filtering is used to predict unknown user responses from known responses of questions. With all the responses, the difficulty degree of each question in the dataset is then estimated using IRT. Finally, when a new question is queried, the content-based similarity approach is used to find similar existing questions from the dataset based on the knowledge features for estimating the new question's difficulty degree. In this paper, we will present the proposed CCF approach and its performance evaluation in comparison with other techniques.

[1]  R. Hambleton,et al.  Item Response Theory , 1984, The History of Educational Measurement.

[2]  Taghi M. Khoshgoftaar,et al.  A Survey of Collaborative Filtering Techniques , 2009, Adv. Artif. Intell..

[3]  Gediminas Adomavicius,et al.  Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions , 2005, IEEE Transactions on Knowledge and Data Engineering.

[4]  Wim van den Noortgate,et al.  Acquiring Item Difficulty Estimates: a Collaborative Effort of Data and Judgment. Nominee for Best Paper Award , 2011, EDM.

[5]  John Riedl,et al.  Item-based collaborative filtering recommendation algorithms , 2001, WWW '01.

[6]  Yehuda Koren,et al.  Matrix Factorization Techniques for Recommender Systems , 2009, Computer.

[7]  H. Wainer Computerized Adaptive Testing , 2010 .

[8]  Ryan S. Baker,et al.  The State of Educational Data Mining in 2009: A Review and Future Visions. , 2009, EDM 2009.

[9]  Kenneth R. Koedinger,et al.  A Data Repository for the EDM Community: The PSLC DataShop , 2010 .

[10]  Donald Ervin Knuth,et al.  The Art of Computer Programming , 1968 .

[11]  Seonghoon Kim A Comparative Study of IRT Fixed Parameter Calibration Methods. , 2006 .

[12]  Gwo-Jen Hwang,et al.  A test-sheet-generating algorithm for multiple assessment requirements , 2003, IEEE Trans. Educ..

[13]  Rynson W. H. Lau,et al.  Guest Editors' Introduction: Emerging Internet Technologies for E-Learning , 2009, IEEE Internet Computing.

[14]  George Karypis,et al.  Item-based top-N recommendation algorithms , 2004, TOIS.

[15]  Wim van den Noortgate,et al.  Item difficulty estimation: An auspicious collaboration between data and judgment , 2012, Comput. Educ..

[16]  Jonathan L. Herlocker,et al.  Evaluating collaborative filtering recommender systems , 2004, TOIS.

[17]  Howard J. Hamilton,et al.  Interestingness measures for data mining: A survey , 2006, CSUR.