REPERIO: A Flexible Architecture for Recommendation in an Industrial Context

In this chapter, the authors describe Reperio, a flexible and generic industrial recommender system able to deal with several kinds of data sources (content-based, collaborative, social network) in the same framework and to work on multi-platforms (Web service in a multi-user mode and mobile device in a mono-user mode). The item-item matrix is the keystone of the architecture for its efficiency and flexibility properties. In the first part, the authors present core functionalities and requirements of recommendation in an industrial context. In the second part, they present the architecture of the system and the main issues involved in its development. In the last part, the authors report experimental results obtained using Reperio on benchmarks extracted from the Netflix Prize with different filtering strategies. To illustrate the interest and flexibility of the architecture, they also explain how it is possible to take into account, for recommendations, external sources of information. In particular, the authors show how to exploit user generated contents posted on social networks to fill the item-item matrix. The process proposed includes a step of opinion classification.

[1]  John Riedl,et al.  GroupLens: an open architecture for collaborative filtering of netnews , 1994, CSCW '94.

[2]  Gediminas Adomavicius,et al.  Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions , 2005, IEEE Transactions on Knowledge and Data Engineering.

[3]  Isabelle Tellier,et al.  Reducing the Cold-Start Problem in Content Recommendation through Opinion Classification , 2010, 2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.

[4]  David Heckerman,et al.  Empirical Analysis of Predictive Algorithms for Collaborative Filtering , 1998, UAI.

[5]  Naoaki Okazaki,et al.  Opinion classification with tree kernel SVM using linguistic modality analysis , 2009, CIKM.

[6]  K. Nageswara Rao,et al.  Application Domain and Functional Classification of Recommender Systems—A Survey , 2008 .

[7]  Yu He,et al.  The YouTube video recommendation system , 2010, RecSys '10.

[8]  Peter D. Turney Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[9]  Diego Reforgiato Recupero,et al.  Sentiment Analysis: Adjectives and Adverbs are Better than Adjectives Alone , 2007, ICWSM.

[10]  John Riedl,et al.  Explaining collaborative filtering recommendations , 2000, CSCW '00.

[11]  William W. Cohen,et al.  Recommendation as Classification: Using Social and Content-Based Information in Recommendation , 1998, AAAI/IAAI.

[12]  Jonathan L. Herlocker,et al.  Evaluating collaborative filtering recommender systems , 2004, TOIS.

[13]  Bing Liu,et al.  The utility of linguistic rules in opinion mining , 2007, SIGIR.

[14]  George Karypis,et al.  Evaluation of Item-Based Top-N Recommendation Algorithms , 2001, CIKM '01.

[15]  George Karypis,et al.  Item-based top-N recommendation algorithms , 2004, TOIS.

[16]  Sean M. McNee,et al.  Being accurate is not enough: how accuracy metrics have hurt recommender systems , 2006, CHI Extended Abstracts.

[17]  Hiroshi Kanayama,et al.  Fully Automatic Lexicon Expansion for Domain-oriented Sentiment Analysis , 2006, EMNLP.

[18]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[19]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[20]  Greg Linden,et al.  Amazon . com Recommendations Item-to-Item Collaborative Filtering , 2001 .

[21]  Hong Yu,et al.  Towards Answering Opinion Questions: Separating Facts from Opinions and Identifying the Polarity of Opinion Sentences , 2003, EMNLP.

[22]  Françoise Fessant,et al.  Designing Specific Weighted Similarity Measures to Improve Collaborative Filtering Systems , 2008, ICDM.

[23]  Soo-Min Kim,et al.  Automatic Identification of Pro and Con Reasons in Online Reviews , 2006, ACL.

[24]  Yehuda Koren,et al.  Factor in the neighbors: Scalable and accurate collaborative filtering , 2010, TKDD.

[25]  Hsinchun Chen,et al.  Sentiment analysis in multiple languages: Feature selection for opinion classification in Web forums , 2008, TOIS.

[26]  Sean M. McNee,et al.  Improving recommendation lists through topic diversification , 2005, WWW '05.