Meta-analysis of evaluation methods and metrics used in context-aware scholarly recommender systems

With the current growth of the proposed contextual recommending algorithms, evaluating them becomes more critical. Researchers of recommender systems have expressed concerns that the evaluation quality cannot be properly judged. We carried out meta-analyses of the evaluation methods and metrics of 67 studies related to context-aware scholarly recommender systems, from the years 2000 to 2014. The analysis of variance results shows that offline evaluation methods are more commonly used compared to online and user studies, with the maximum rate of success. It also reveals the popularity order of accuracy metrics (31%) including “Recall, Precision, F-Measure”, “Mean Absolute Error, and Questionnaire studies, Reliability, Accessibility, Feasibility, Usability, Applicability and Performance”. By using factor analysis, 28 different evaluation metrics were classified into eight groups. The results of analysis have shown the difference in evaluation methods in applying different groups of metrics. This study highlights the importance of how an evaluation method should be adequately designed and implemented. Additionally, a few recommendations for future investigations on recommending evaluation are proposed.

[1]  Carlo Tasso,et al.  Personalized Access to Scientific Publications: from Recommendation to Explanation , 2013, UMAP.

[2]  Michele Gorgoglione,et al.  Context-Aware Recommender Systems: A Comparison Of Three Approaches , 2011, DART@AI*IA.

[3]  Alexander Tuzhilin,et al.  Experimental comparison of pre- vs. post-filtering approaches in context-aware recommender systems , 2009, RecSys '09.

[4]  Jon Herlocker,et al.  Collaborative Filtering for Digital Libraries , 2012 .

[5]  Yan Yang,et al.  Literature recommendation based on reference graph , 2010, 2010 3rd International Conference on Advanced Computer Theory and Engineering(ICACTE).

[6]  Enrique Herrera-Viedma,et al.  A quality based recommender system to disseminate information in a university digital library , 2014, Inf. Sci..

[7]  Harith Alani,et al.  Exploiting Synergy Between Ontologies and Recommender Systems , 2002, Semantic Web Workshop.

[8]  Li Chen,et al.  Evaluating recommender systems from the user’s perspective: survey of the state of the art , 2012, User Modeling and User-Adapted Interaction.

[9]  Andreas Geyer-Schulz,et al.  Others Also Use: A Robust Recommender System for Scientific Libraries , 2003, ECDL.

[10]  Giseli Rabello Lopes,et al.  A personalized recommender system for digital libraries , 2008, WebMedia.

[11]  Enrique Herrera-Viedma,et al.  A google wave-based fuzzy recommender system to disseminate information in University Digital Libraries 2.0 , 2011, Inf. Sci..

[12]  Wolfgang Wörndl,et al.  A User Interface Concept for Context-Aware Recommender Systems , 2016, MuC.

[13]  Guy Shani,et al.  A Survey of Accuracy Evaluation Metrics of Recommendation Tasks , 2009, J. Mach. Learn. Res..

[14]  Enrique Herrera-Viedma,et al.  A Filtering and Recommender System Prototype for Scholarly Users of Digital Libraries , 2009, WSKS.

[15]  Joseph A. Konstan,et al.  Introduction to recommender systems: Algorithms and Evaluation , 2004, TOIS.

[16]  Sean M. McNee,et al.  Making recommendations better: an analytic model for human-recommender interaction , 2006, CHI Extended Abstracts.

[17]  Judith Masthoff,et al.  A Survey of Explanations in Recommender Systems , 2007, 2007 IEEE 23rd International Conference on Data Engineering Workshop.

[18]  Pasquale Lops,et al.  Introducing Serendipity in a Content-Based Recommender System , 2008, 2008 Eighth International Conference on Hybrid Intelligent Systems.

[19]  Jöran Beel,et al.  A Comparison of Offline Evaluations, Online Evaluations, and User Studies in the Context of Research-Paper Recommender Systems , 2015, TPDL.

[20]  I-En Liao,et al.  A library recommender system based on a personal ontology model and collaborative filtering technique for English collections , 2010, Electron. Libr..

[21]  Markus Franke,et al.  Recommender Services in Scientific Digital Libraries , 2008 .

[22]  Wan-Shiou Yang,et al.  A task-focused literature recommender system for digital libraries , 2013, Online Inf. Rev..

[23]  Panagiotis Adamopoulos,et al.  On Unexpectedness in Recommender Systems: Or How to Expect the Unexpected , 2011, DiveRS@RecSys.

[24]  Joseph A. Konstan,et al.  Research resources for recommender systems , 1999 .

[25]  Min-Yen Kan,et al.  Scholarly paper recommendation via user's recent research interests , 2010, JCDL '10.

[26]  Hiroshi Itsumura,et al.  Use of Library Loan Records for Book Recommendation , 2012, 2012 IIAI International Conference on Advanced Applied Informatics.

[27]  André Vellino,et al.  A Hybrid, Multi-dimensional Recommender for Journal Articles in a Scientific Digital Library , 2007, 2007 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Workshops.

[28]  Luis Mateus Rocha TalkMine : a soft computing approach to adaptive knowledge recommendation , 2002 .

[29]  Gediminas Adomavicius,et al.  Incorporating contextual information in recommender systems using a multidimensional approach , 2005, TOIS.

[30]  Joseph A. Konstan,et al.  Content-Independent Task-Focused Recommendation , 2001, IEEE Internet Comput..

[31]  Hiep Phuc Luong,et al.  Concept-Based Document Recommendations for CiteSeer Authors , 2008, AH.

[32]  Il Im,et al.  Search Personalization: Knowledge-Based Recommendation in Digital Libraries , 2009, AMCIS.

[33]  Gediminas Adomavicius,et al.  Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions , 2005, IEEE Transactions on Knowledge and Data Engineering.

[34]  Rashmi R. Sinha,et al.  The role of transparency in recommender systems , 2002, CHI Extended Abstracts.

[35]  John Riedl,et al.  Recommender systems: from algorithms to user experience , 2012, User Modeling and User-Adapted Interaction.

[36]  Lior Rokach,et al.  Recommender Systems Handbook , 2010 .

[37]  Liwei Liu,et al.  The Implication of Context and Criteria Information in Recommender Systems as applied to the Service Domain , 2013 .

[38]  Maria F. Trujillo,et al.  A Recommender System Based on Multi-features , 2007, ICCSA.

[39]  Marko A. Rodriguez,et al.  A Recommender System to Support the Scholarly Communication Process , 2009, ArXiv.

[40]  Thomas E. Potok,et al.  Discovery & Refinement of Scientific Information via a Recommender System , 2012 .

[41]  Anind K. Dey,et al.  Understanding and Using Context , 2001, Personal and Ubiquitous Computing.

[42]  Po-Yao Chao,et al.  Extending e-books with contextual knowledge recommenders by analyzing personal portfolio and annotation to help learners solve problems in time , 2004, IEEE International Conference on Advanced Learning Technologies, 2004. Proceedings..

[43]  Chih-Ping Wei,et al.  Coauthorship networks and academic literature recommendation , 2010, Electron. Commer. Res. Appl..

[44]  Shuaiqiang Wang,et al.  A survey of serendipity in recommender systems , 2016, Knowl. Based Syst..

[45]  Franca Garzotto,et al.  User-Centric vs. System-Centric Evaluation of Recommender Systems , 2013, INTERACT.

[46]  Mark Levy,et al.  Offline evaluation of recommender systems: all pain and no gain? , 2013, RepSys '13.

[47]  Mohsen Kahani,et al.  SemCiR: A citation recommendation system based on a novel semantic distance measure , 2013, Program.

[48]  Guy Shani,et al.  Evaluating Recommendation Systems , 2011, Recommender Systems Handbook.

[49]  Lars Schmidt-Thieme,et al.  MyMediaLite: a free recommender system library , 2011, RecSys '11.

[50]  Zhen-Hua Dong,et al.  Research of information recommendation system based on reading behavior , 2008, 2008 International Conference on Machine Learning and Cybernetics.

[51]  Zhang Yujie,et al.  Some challenges for context-aware recommender systems , 2010, 2010 5th International Conference on Computer Science & Education.

[52]  Eric Horvitz,et al.  Collaborative filtering by personality diagnosis , 2000, UAI 2000.

[53]  Jonathan L. Herlocker,et al.  Evaluating collaborative filtering recommender systems , 2004, TOIS.

[54]  V. Loia,et al.  TalkMine : a Soft Computing Approach to Adaptive Knowledge Recommendation , 2000 .

[55]  Jöran Beel,et al.  Scienstein : A Research Paper Recommender System , 2009 .

[56]  Yannick Pouliot,et al.  SmartSearch: automated recommendations using librarian expertise and the National Center for Biotechnology Information's Entrez programming utilities. , 2010, Journal of the Medical Library Association : JMLA.

[57]  Ryohei Orihara,et al.  Metrics for Evaluating the Serendipity of Recommendation Lists , 2007, JSAI.

[58]  Kai Yu,et al.  Temporal Social Tagging Based Collaborative Filtering Recommender for Digital Library , 2012, ICADL.

[59]  Marisa R. De Giusti,et al.  An Ontology-based Context Aware System for Selective Dissemination of Information in a Digital Library , 2010, ArXiv.

[60]  Jonathan L. Herlocker,et al.  COLLABORATIVE FILTERING: A NEW APPROACH TO SEARCHING DIGITAL LIBRARIES , 2004 .

[61]  Francesco Ricci,et al.  Context-based splitting of item ratings in collaborative filtering , 2009, RecSys '09.

[62]  Sean M. McNee,et al.  Being accurate is not enough: how accuracy metrics have hurt recommender systems , 2006, CHI Extended Abstracts.

[63]  Cecilia di Sciascio Advanced User Interfaces and Hybrid Recommendations for Exploratory Search , 2017, IUI Companion.

[64]  Katrien Verbert,et al.  HCI for Recommender Systems: the Past, the Present and the Future , 2016, RecSys.

[65]  J. Bobadilla,et al.  Recommender systems survey , 2013, Knowl. Based Syst..

[66]  V. E. Vinzi,et al.  A global Goodness – of – Fit index for PLS structural equation modelling 1 , 2004 .

[67]  Mouzhi Ge,et al.  Beyond accuracy: evaluating recommender systems by coverage and serendipity , 2010, RecSys '10.

[68]  Joseph A. Konstan,et al.  Evaluating recommender behavior for new users , 2014, RecSys '14.

[69]  Christoph Rensing,et al.  Evaluating Recommender Systems for Technology Enhanced Learning: A Quantitative Survey , 2015, IEEE Transactions on Learning Technologies.

[70]  Mohd Shahizan Othman,et al.  Data Sets for Offline Evaluation of Scholar's Recommender System , 2013, ACIIDS.

[71]  Yueheng Sun,et al.  A Personalized Paper Recommendation Approach Based on Web Paper Mining and Reviewer's Interest Modeling , 2009, 2009 International Conference on Research Challenges in Computer Science.

[72]  Sean M. McNee,et al.  On the recommending of citations for research papers , 2002, CSCW '02.

[73]  Linda J. Ferrier,et al.  Dysarthric speakers' intelligibility and speech characteristics in relation to computer speech recognition , 1995 .

[74]  Joost de Wit,et al.  Evaluating Recommender Systems -- An evaluation framework to predict user satisfaction for recommender systems in an electronic programme guide context , 2008 .

[75]  Félix Hernández-del-Olmo,et al.  Evaluation of recommender systems: A new approach , 2008, Expert Syst. Appl..

[76]  Jöran Beel,et al.  Towards reproducibility in recommender-systems research , 2016, User Modeling and User-Adapted Interaction.

[77]  Claudio Bettini,et al.  Private context-aware recommendation of points of interest: An initial investigation , 2012, 2012 IEEE International Conference on Pervasive Computing and Communications Workshops.

[78]  Andreas Nürnberger,et al.  Research paper recommender system evaluation: a quantitative literature survey , 2013, RepSys '13.

[79]  Kevin Harris,et al.  SERF: integrating human recommendations with search , 2004, CIKM '04.

[80]  Alexander Tuzhilin,et al.  Comparing context-aware recommender systems in terms of accuracy and diversity , 2012, User Modeling and User-Adapted Interaction.

[81]  Jia Zhang,et al.  A Social Tagging Based Collaborative Filtering Recommendation Algorithm for Digital Library , 2011, ICADL.

[82]  Lisa R. Schiff,et al.  The Melvyl Recommender Project: Developing Library Recommendation Services , 2006, D Lib Mag..

[83]  Jöran Beel,et al.  A comparative analysis of offline and online evaluations and discussion of research paper recommender system evaluation , 2013, RepSys '13.

[84]  Umberto Straccia,et al.  A personalized collaborative Digital Library environment: a model and an application , 2005, Inf. Process. Manag..

[85]  Demetrios G. Sampson,et al.  Layered evaluation in recommender systems: A retrospective assessment , 2014 .

[86]  Michael D. Ekstrand Towards Recommender Engineering: tools and experiments for identifying recommender differences , 2014 .

[87]  A. Ant Ozok,et al.  Design guidelines for effective recommender system interfaces based on a usability criteria conceptual model: results from a college student population , 2010, Behav. Inf. Technol..

[88]  Bracha Shapira,et al.  Recommender Systems Handbook , 2015, Springer US.

[89]  Gary Geisler,et al.  Developing recommendation services for a digital library with uncertain and changing data , 2001, JCDL '01.

[90]  Eric Horvitz,et al.  Collaborative Filtering by Personality Diagnosis: A Hybrid Memory and Model-Based Approach , 2000, UAI.

[91]  F. E. Croxton,et al.  Applied General Statistics. Third Edition. , 1968 .

[92]  Enrique Herrera-Viedma,et al.  A multi-disciplinar recommender system to advice research resources in University Digital Libraries , 2009, Expert Syst. Appl..

[93]  Frederick E. Croxton,et al.  Applied General Statistics. , 1940 .

[94]  Seyed Reza Shahamiri,et al.  A systematic review of scholar context-aware recommender systems , 2015, Expert Syst. Appl..

[95]  June-Jei Kuo,et al.  A Library Recommender System Using Interest Change over Time and Matrix Clustering , 2012, ICADL.

[96]  Liang He,et al.  Evaluating recommender systems , 2012, Seventh International Conference on Digital Information Management (ICDIM 2012).

[97]  Enrique Herrera-Viedma,et al.  Dealing with incomplete information in a fuzzy linguistic recommender system to disseminate information in university digital libraries , 2010, Knowl. Based Syst..

[98]  Stuart E. Middleton,et al.  Ontological user profiling in recommender systems , 2004, TOIS.

[99]  Daniel Kifer,et al.  Context-aware citation recommendation , 2010, WWW '10.

[100]  Takayuki Ito,et al.  An implementation of a knowledge recommendation system based on similarity among users' profiles , 2002, Proceedings of the 41st SICE Annual Conference. SICE 2002..

[101]  Mohd Shahizan Othman,et al.  Incorporating scholar's background knowledge into recommender system for digital libraries , 2011, 2011 Malaysian Conference in Software Engineering.

[102]  Pearl Brereton,et al.  Performing systematic literature reviews in software engineering , 2006, ICSE.

[103]  E. B. Andersen,et al.  Modern factor analysis , 1961 .

[104]  Mu-Yen Chen,et al.  Using adaptive resonance theory and data-mining techniques for materials recommendation based on the e-library environment , 2008, Electron. Libr..

[105]  K. Nageswara Rao,et al.  Content-Based Document Recommender System for Aerospace Grey Literature: System Design , 2011 .

[106]  Li Chen,et al.  A user-centric evaluation framework for recommender systems , 2011, RecSys '11.

[107]  Maurizio Morisio,et al.  A systematic literature review of Linked Data‐based recommender systems , 2015, Concurr. Comput. Pract. Exp..

[108]  Timo Ojala,et al.  SmartLibrary - Location-Aware Mobile Library Service , 2003, Mobile HCI.

[109]  Joongmin Choi,et al.  An Ontology-Based Recommendation System Using Long-Term and Short-Term Preferences , 2011, 2011 International Conference on Information Science and Applications.

[110]  Z. Zaier,et al.  Recommendation Quality Evolution Based on Neighborhood Size , 2007, Third International Conference on Automated Production of Cross Media Content for Multi-Channel Distribution (AXMEDIS'07).

[111]  Adrian F. Clark,et al.  Engene: A genetic algorithm classifier for content-based recommender systems that does not require continuous user feedback , 2010, 2010 UK Workshop on Computational Intelligence (UKCI).

[112]  F. Timmins Nursing Research Generating and Assessing Evidence for Nursing Practice , 2013 .

[113]  Feng-Hsu Wang,et al.  Effective personalized recommendation based on time-framed navigation clustering and association mining , 2004, Expert Syst. Appl..

[114]  Anind K. Dey,et al.  Why and why not explanations improve the intelligibility of context-aware intelligent systems , 2009, CHI.

[115]  Bart P. Knijnenburg,et al.  Explaining the user experience of recommender systems , 2012, User Modeling and User-Adapted Interaction.

[116]  Matthias Jarke,et al.  A Clustering Approach for Collaborative Filtering Recommendation Using Social Network Analysis , 2011, J. Univers. Comput. Sci..

[117]  Tao Luo,et al.  Effective personalization based on association rule discovery from web usage data , 2001, WIDM '01.

[118]  Shaghayegh Sahebi,et al.  Recommender Systems: Sources of Knowledge and Evaluation Metrics , 2013 .

[119]  San-Yih Hwang,et al.  A prototype WWW literature recommendation system for digital libraries , 2003, Online Inf. Rev..

[120]  Xiaoming Li,et al.  A Paper Recommender for Scientific Literatures Based on Semantic Concept Similarity , 2008, ICADL.

[121]  Alan Said,et al.  Comparative recommender system evaluation: benchmarking recommendation frameworks , 2014, RecSys '14.

[122]  Pádraig Cunningham,et al.  An on-line evaluation framework for recommender systems , 2002 .

[123]  Maik Thiele,et al.  Setting Goals and Choosing Metrics for Recommender System Evaluations , 2011 .

[124]  Sean M. McNee,et al.  Meeting user information needs in recommender systems , 2006 .

[125]  Z. Zaier,et al.  Evaluating Recommender Systems , 2008, 2008 International Conference on Automated Solutions for Cross Media Content and Multi-Channel Distribution.

[126]  D. Powers Evaluation: From Precision, Recall and F-Factor to ROC, Informedness, Markedness & Correlation , 2008 .

[127]  Seyed Reza Shahamiri,et al.  The Role of Context for Recommendations in Digital Libraries , 2015 .

[128]  Alan Said,et al.  Evaluating the Accuracy and Utility of Recommender Systems , 2013 .

[129]  Stuart E. Middleton,et al.  Capturing interest through inference and visualization: ontological user profiling in recommender systems , 2003, K-CAP '03.

[130]  Eric Hanson A Beginner's Guide to Creating Library Linked Data: Lessons from NCSU's Organization Name Linked Data Project , 2014 .

[131]  Bernd Ludwig,et al.  Context relevance assessment and exploitation in mobile recommender systems , 2012, Personal and Ubiquitous Computing.

[132]  Dietmar Jannach,et al.  What Recommenders Recommend - An Analysis of Accuracy, Popularity, and Sales Diversity Effects , 2013, UMAP.

[133]  Simon Wakeling The user-centered design of a recommender system for a universal library catalogue , 2012, RecSys '12.

[134]  Sean M. McNee,et al.  Enhancing digital libraries with TechLens , 2004, Proceedings of the 2004 Joint ACM/IEEE Conference on Digital Libraries, 2004..

[135]  Ebrahim Afshar,et al.  A multi-layer contextual model for recommender systems in digital libraries , 2011, Aslib Proc..