Personalised PageRank for making recommendations in digital cultural heritage collections

In this paper we describe the use of Personalised PageRank (PPR) to generate recommendations from a large collection of cultural heritage items. Various methods for computing item-to-item similarities are investigated, together with representing the collection as a network over which random walks can be taken. The network can represent similarity between item metadata, item co-occurrences in search logs, and the similarity of items based on linking them to Wikipedia articles and categories. To evaluate the use of PPR, search logs from Europeana are used to simulate user interactions. PPR on each information source is compared to a standard retrieval-based baseline, resulting in higher performance.

[1]  Tsvi Kuflik,et al.  Personalization in cultural heritage: the road travelled and the one ahead , 2011, User Modeling and User-Adapted Interaction.

[2]  François Fouss,et al.  Random-Walk Computation of Similarities between Nodes of a Graph with Application to Collaborative Recommendation , 2007, IEEE Transactions on Knowledge and Data Engineering.

[3]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[4]  Lora Aroyo,et al.  The effects of transparency on trust in and acceptance of a content-based art recommender , 2008, User Modeling and User-Adapted Interaction.

[5]  Taher H. Haveliwala Topic-sensitive PageRank , 2002, IEEE Trans. Knowl. Data Eng..

[6]  Brigitte Trousse,et al.  Advanced data preprocessing for intersites Web usage mining , 2004, IEEE Intelligent Systems.

[7]  Ellen M. Voorhees,et al.  Retrieval evaluation with incomplete information , 2004, SIGIR '04.

[8]  Jonathan L. Herlocker,et al.  Evaluating collaborative filtering recommender systems , 2004, TOIS.

[9]  Alan F. Smeaton,et al.  Personalisation and recommender systems in digital libraries , 2005, International Journal on Digital Libraries.

[10]  Arantxa Otegi,et al.  Implementing Recommendations in the PATHS System , 2013, TPDL Workshops.

[11]  Ian H. Witten,et al.  Learning to link with wikipedia , 2008, CIKM '08.

[12]  Marco Gori,et al.  ItemRank: A Random-Walk Based Scoring Algorithm for Recommender Engines , 2007, IJCAI.

[13]  Alexander Tuzhilin,et al.  Towards the Next Generation of Recommender Systems , 2010, ICE-B 2010.

[14]  Fabio Persia,et al.  A Novel Strategy for Recommending Multimedia Objects and its Application in the Cultural Heritage Domain , 2011 .

[15]  Mukkai S. Krishnamoorthy,et al.  A random walk method for alleviating the sparsity problem in collaborative filtering , 2008, RecSys '08.

[16]  James Allan,et al.  A comparison of statistical significance tests for information retrieval evaluation , 2007, CIKM '07.

[17]  Mark Stevenson,et al.  Computing similarity between items in a digital library of cultural heritage , 2013, JOCCH.

[18]  CloughPaul,et al.  Computing similarity between items in a digital library of cultural heritage , 2013 .