Personalised Reranking of Paper Recommendations Using Paper Content and User Behavior

Academic search engines have been widely used to access academic papers, where users’ information needs are explicitly represented as search queries. Some modern recommender systems have taken one step further by predicting users’ information needs without the presence of an explicit query. In this article, we examine an academic paper recommender that sends out paper recommendations in email newsletters, based on the users’ browsing history on the academic search engine. Specifically, we look at users who regularly browse papers on the search engine, and we sign up for the recommendation newsletters for the first time. We address the task of reranking the recommendation candidates that are generated by a production system for such users. We face the challenge that the users on whom we focus have not interacted with the recommender system before, which is a common scenario that every recommender system encounters when new users sign up. We propose an approach to reranking candidate recommendations that utilizes both paper content and user behavior. The approach is designed to suit the characteristics unique to our academic recommendation setting. For instance, content similarity measures can be used to find the closest match between candidate recommendations and the papers previously browsed by the user. To this end, we use a knowledge graph derived from paper metadata to compare entity similarities (papers, authors, and journals) in the embedding space. Since the users on whom we focus have no prior interactions with the recommender system, we propose a model to learn a mapping from users’ browsed articles to user clicks on the recommendations. We combine both content and behavior into a hybrid reranking model that outperforms the production baseline significantly, providing a relative 13% increase in Mean Average Precision and 28% in Precision@1. Moreover, we provide a detailed analysis of the model components, highlighting where the performance boost comes from. The obtained insights reveal useful components for the reranking process and can be generalized to other academic recommendation settings as well, such as the utility of graph embedding similarity. Also, recent papers browsed by users provide stronger evidence for recommendation than historical ones.

[1]  Wang-Chien Lee,et al.  CiteSeerx: an architecture and web service design for an academic document search engine , 2006, WWW '06.

[2]  Schubert Foo,et al.  Can I have more of these please?: Assisting researchers in finding similar research papers from a seed basket of papers , 2018, Electron. Libr..

[3]  Hao-Ren Ke,et al.  Exploring behavior of E-journal users in science and technology: Transaction log analysis of Elsevier's ScienceDirect OnSite in Taiwan , 2002 .

[4]  John Riedl,et al.  Item-based collaborative filtering recommendation algorithms , 2001, WWW '01.

[5]  Lars Schmidt-Thieme,et al.  BPR: Bayesian Personalized Ranking from Implicit Feedback , 2009, UAI.

[6]  George Karypis,et al.  User-Specific Feature-Based Similarity Models for Top-n Recommendation of New Items , 2015, ACM Trans. Intell. Syst. Technol..

[7]  Yoram Singer,et al.  Adaptive Subgradient Methods for Online Learning and Stochastic Optimization , 2011, J. Mach. Learn. Res..

[8]  Tat-Seng Chua,et al.  Neural Factorization Machines for Sparse Predictive Analytics , 2017, SIGIR.

[9]  John Riedl,et al.  Automatically building research reading lists , 2010, RecSys '10.

[10]  Enhong Chen,et al.  Sparse Factorization Machines for Click-through Rate Prediction , 2016, 2016 IEEE 16th International Conference on Data Mining (ICDM).

[11]  Mingyu Lu,et al.  Increasing Serendipity of Recommender System with Ranking Topic Model , 2014 .

[12]  Jason Weston,et al.  Translating Embeddings for Modeling Multi-relational Data , 2013, NIPS.

[13]  Zhiyuan Liu,et al.  Learning Entity and Relation Embeddings for Knowledge Graph Completion , 2015, AAAI.

[14]  Anasua Mitra,et al.  On Low Overlap among Search Results of Academic Search Engines , 2017, WWW.

[15]  W. Bruce Croft,et al.  Recommending citations for academic papers , 2007, SIGIR.

[16]  Martin Ester,et al.  Collaborative Denoising Auto-Encoders for Top-N Recommender Systems , 2016, WSDM.

[17]  Evangelia Christakopoulou,et al.  Local Item-Item Models For Top-N Recommendation , 2016, RecSys.

[18]  Ann Blandford,et al.  Keeping up to date: An academic researcher's information journey , 2017, J. Assoc. Inf. Sci. Technol..

[19]  M. de Rijke,et al.  Online Exploration for Detecting Shifts in Fresh Intent , 2014, CIKM.

[20]  M. de Rijke,et al.  Information Processing and Management Investigating Queries and Search Failures in Academic Search , 2022 .

[21]  Roberto Turrin,et al.  Performance of recommender algorithms on top-n recommendation tasks , 2010, RecSys '10.

[22]  Sean M. McNee,et al.  Enhancing digital libraries with TechLens , 2004, Proceedings of the 2004 Joint ACM/IEEE Conference on Digital Libraries, 2004..

[23]  Jian Pei,et al.  Citation recommendation without author supervision , 2011, WSDM '11.

[24]  Saul Vargas,et al.  Building recommender systems for scholarly information , 2017, SWM '17.

[25]  Wenyi Huang,et al.  A Neural Probabilistic Model for Context Based Citation Recommendation , 2015, AAAI.

[26]  Jöran Beel,et al.  Mr. DLib: Recommendations-as-a-Service (RaaS) for Academia , 2017, 2017 ACM/IEEE Joint Conference on Digital Libraries (JCDL).

[27]  Eric Horvitz,et al.  Collaborative Filtering by Personality Diagnosis: A Hybrid Memory and Model-Based Approach , 2000, UAI.

[28]  Wei Chu,et al.  Refining Recency Search Results with User Click Feedback , 2011, ArXiv.

[29]  Maarten de Rijke,et al.  Academic Search in Response to Major Scientific Events , 2017, BIR@ECIR.

[30]  Yong Yu,et al.  LorSLIM: Low Rank Sparse Linear Methods for Top-N Recommendations , 2014, 2014 IEEE International Conference on Data Mining.

[31]  Ruoming Jin,et al.  A Topic Modeling Approach and Its Integration into the Random Walk Framework for Academic Search , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[32]  Sean M. McNee,et al.  Enhancing digital libraries with TechLens+ , 2004, JCDL.

[33]  James P. Callan,et al.  Explicit Semantic Ranking for Academic Search via Knowledge Graph Embedding , 2017, WWW.

[34]  Guocong Song songgc Point-Wise Approach for Yandex Personalized Web Search Challenge , 2014 .

[35]  Carlo Tasso,et al.  A Keyphrase-Based Paper Recommender System , 2011, IRCDL.

[36]  Ümit V. Çatalyürek,et al.  Recommendation on Academic Networks using Direction Aware Citation Analysis , 2012, ArXiv.

[37]  Zhao Kang,et al.  Top-N Recommendation with Novel Rank Approximation , 2016, SDM.

[38]  Thorsten Joachims,et al.  Training linear SVMs in linear time , 2006, KDD '06.

[39]  M. de Rijke,et al.  Do Topic Shift and Query Reformulation Patterns Correlate in Academic Search? , 2017, ECIR.

[40]  Yu He,et al.  The YouTube video recommendation system , 2010, RecSys '10.

[41]  Hugo Larochelle,et al.  Leveraging user libraries to bootstrap collaborative filtering , 2014, KDD.

[42]  Suzanne Fricke,et al.  Semantic Scholar , 2018, Journal of the Medical Library Association : JMLA.

[43]  M. de Rijke,et al.  Click-based Hot Fixes for Underperforming Torso Queries , 2016, SIGIR.

[44]  Yi Fang,et al.  Neural Citation Network for Context-Aware Citation Recommendation , 2017, SIGIR.

[45]  M. de Rijke,et al.  Characterizing and predicting downloads in academic search , 2019, Inf. Process. Manag..

[46]  Christopher J. C. Burges,et al.  From RankNet to LambdaRank to LambdaMART: An Overview , 2010 .

[47]  George Karypis,et al.  FISM: factored item similarity models for top-N recommender systems , 2013, KDD.

[48]  Yuhong Guo,et al.  Improving Top-N Recommendation with Heterogeneous Loss , 2016, IJCAI.

[49]  Chao Liu,et al.  Recommender systems with social regularization , 2011, WSDM '11.

[50]  Steffen Rendle,et al.  Factorization Machines with libFM , 2012, TIST.

[51]  Amin Mantrach,et al.  Item cold-start recommendations: learning local collective embeddings , 2014, RecSys '14.

[52]  Feng Xia,et al.  Context-Based Collaborative Filtering for Citation Recommendation , 2015, IEEE Access.

[53]  Lior Rokach,et al.  Introduction to Recommender Systems Handbook , 2011, Recommender Systems Handbook.

[54]  Fabio Aiolli A Preliminary Study on a Recommender System for the Million Songs Dataset Challenge , 2013, IIR.

[55]  Yong Yu,et al.  SVDFeature: a toolkit for feature-based collaborative filtering , 2012, J. Mach. Learn. Res..

[56]  Orland Hoeber,et al.  Supporting academic search tasks through citation visualization and exploration , 2017, International Journal on Digital Libraries.

[57]  George Karypis,et al.  Item-based top-N recommendation algorithms , 2004, TOIS.

[58]  Marcos André Gonçalves,et al.  A source independent framework for research paper recommendation , 2011, JCDL '11.

[59]  Ann Blandford,et al.  Understanding “influence:” an exploratory study of academics' processes of knowledge construction through iterative and interactive information seeking , 2015, J. Assoc. Inf. Sci. Technol..

[60]  Jie Tang,et al.  AMiner: Toward Understanding Big Scholar Data , 2016, WSDM.

[61]  George Karypis,et al.  SLIM: Sparse Linear Methods for Top-N Recommender Systems , 2011, 2011 IEEE 11th International Conference on Data Mining.

[62]  Dongyan Zhao,et al.  Recommending academic papers via users' reading purposes , 2012, RecSys.

[63]  Paul Covington,et al.  Deep Neural Networks for YouTube Recommendations , 2016, RecSys.

[64]  John Riedl,et al.  Analysis of recommendation algorithms for e-commerce , 2000, EC '00.

[65]  Zhaohui Wu,et al.  Towards better understanding of academic search , 2016, 2016 IEEE/ACM Joint Conference on Digital Libraries (JCDL).

[66]  Chong Wang,et al.  Collaborative topic modeling for recommending scientific articles , 2011, KDD.

[67]  Min-Yen Kan,et al.  Scholarly paper recommendation via user's recent research interests , 2010, JCDL '10.

[68]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[69]  Greg Linden,et al.  Amazon . com Recommendations Item-to-Item Collaborative Filtering , 2001 .

[70]  Bradley M. Hemminger,et al.  A study of factors that affect the information-seeking behavior of academic scientists , 2012, J. Assoc. Inf. Sci. Technol..