Data Sets for Offline Evaluation of Scholar's Recommender System

In an offline evaluation of recommender systems, data sets have been extensively used to measure the performance of recommender systems through statistical analysis. However, many data sets are domain and application dependent and cannot be engaged in different domains. This paper presents the construction of data sets for the offline evaluation of a scholar's recommender system that suggests papers to scholars based on their background knowledge. We design a cross-validation approach to reduce the risk of false interpretations by relying on multiple independent sources of information. Our approach addresses four important issues including the privacy and diversity of knowledge resources, the quality of knowledge, and the timely knowledge. The resulting data sets represent the instance of scholar's background knowledge in clusters of learning themes, which can be used to measure the performance of the scholar's recommender system.

[1]  Tong Zhang,et al.  Fundamentals of Predictive Text Mining , 2010, Texts in Computer Science.

[2]  Nikos Manouselis,et al.  Developing a Recommendation Web Service for a Federation of Learning Repositories , 2009, 2009 International Conference on Intelligent Networking and Collaborative Systems.

[3]  Roelof van Zwol,et al.  Flickr: Who is Looking? , 2007, Web Intelligence.

[4]  Erik Duval,et al.  Dataset-driven research for improving recommender systems for learning , 2011, LAK.

[5]  Jie Tang,et al.  ArnetMiner: extraction and mining of academic social networks , 2008, KDD.

[6]  Yu Zhenhua,et al.  Measuring Semantic Similarity between Words Using Wikipedia , 2009, 2009 International Conference on Web Information Systems and Mining.

[7]  Ryan Shaun Joazeiro de Baker,et al.  PSLC DataShop: A Data Analysis Service for the Learning Science Community , 2010, Intelligent Tutoring Systems.

[8]  Ian H. Witten,et al.  Topic indexing with Wikipedia , 2008 .

[9]  Dan Frankowski,et al.  Collaborative Filtering Recommender Systems , 2007, The Adaptive Web.

[10]  K. Jehn,et al.  Using triangulation to validate themes in qualitative studies , 2009 .

[11]  Alfred Kobsa,et al.  The Adaptive Web, Methods and Strategies of Web Personalization , 2007, The Adaptive Web.

[12]  Jonathan L. Herlocker,et al.  Evaluating collaborative filtering recommender systems , 2004, TOIS.

[13]  Guy Shani,et al.  Evaluating Recommendation Systems , 2011, Recommender Systems Handbook.

[14]  Meng Li,et al.  Stream Operators for Querying Data Streams , 2005, WAIM.

[15]  Christian Wartena,et al.  Topic Detection by Clustering Keywords , 2008, 2008 19th International Workshop on Database and Expert Systems Applications.

[16]  Robin Burke,et al.  Knowledge-based recommender systems , 2000 .

[17]  Erik Duval,et al.  TEL as a Recommendation Context , 2013 .

[18]  Yurdaer N. Doganata,et al.  Glossary extraction and utilization in the information search and delivery system for IBM Technical Support , 2004, IBM Syst. J..

[19]  Rob Nadolski,et al.  Evaluating the Effectiveness of Personalised Recommender Systems in Learning Networks , 2009, Learning Network Services for Professional Development.

[20]  C. Butts Social network analysis: A methodological introduction , 2008 .

[21]  Sholom M. Weiss,et al.  Information Retrieval and Text Mining , 2015 .

[22]  Lior Rokach,et al.  Recommender Systems Handbook , 2010 .

[23]  Jie Tang,et al.  A Unified Approach to Researcher Profiling , 2007 .

[24]  Erik Duval,et al.  Recommender Systems for Learning , 2012, Springer Briefs in Electrical and Computer Engineering.

[25]  Graeme Hirst,et al.  Semantic distance in WordNet: An experimental, application-oriented evaluation of five measures , 2004 .

[26]  Juan-Zi Li,et al.  Keyword Extraction Using Support Vector Machine , 2006, WAIM.

[27]  Rob Koper,et al.  Learning Network Services for Professional Development , 2009 .

[28]  Nuria Oliver,et al.  Data Mining Methods for Recommender Systems , 2015, Recommender Systems Handbook.