Use of discrete choice models with recommender systems

Recommender systems, also known as personalization systems, are a popular technique for reducing information overload and finding items that are of interest to the user. Increasingly, people are turning to these systems to help them find the information that is most valuable to them. A variety of techniques have been proposed for performing recommendation, including content-based, collaborative, knowledge-based and other techniques. All of the known recommendation techniques have strengths and weaknesses, and many researchers have chosen to combine techniques in different ways. In this dissertation, we investigate the use of discrete choice models as a radically new technique for giving personalized recommendations. Discrete choice modeling allows the integration of item and user specific data as well as contextual information that may be crucial in some applications. By giving a general multidimensional model that depends on a range of inputs, discrete choice subsumes other techniques used in the literature. We present a software package that allows the adaptation of generalized discrete choice models to the recommendation task. Using a generalized framework that integrates recent advances and extensions of discrete choice allows the estimation of complex models that give a realistic representation of the behavior inherent in the choice process, and consequently a better understanding of behavior and improvements in predictions. Statistical learning, an important part of personalization, is realized using Bayesian procedures to update the model as more observations are collected. As a test bed for investigating the effectiveness of this approach, we explore the application of discrete choice as a solution to the problem of recommending academic courses to students. The goal is to facilitate the course selection task by recommending subjects that would satisfy students' personal preferences and suit their abilities and interests. A generalized mixed logit model is used to analyze survey and course evaluation data. The resulting model identifies factors that make an academic subject “recommendable”. It is used as the backbone for the recommender system application. The dissertation finally presents the software architecture of this system to highlight the software package's adaptability and extensibility to other applications. (Copies available exclusively from MIT Libraries, Rm. 14-0551, Cambridge, MA 02139-4307. Ph. 617-253-5668; Fax 617-253-1690.)

[1]  John Riedl,et al.  An algorithmic framework for performing collaborative filtering , 1999, SIGIR '99.

[2]  Yoav Shoham,et al.  Fab: content-based, collaborative recommendation , 1997, CACM.

[3]  Joan L. Walker,et al.  Integration of Choice and Latent Variable Models , 1999 .

[4]  John R. Hauser,et al.  Fast Polyhedral Adaptive Conjoint Estimation , 2002 .

[5]  Dean P. Foster,et al.  Clustering Methods for Collaborative Filtering , 1998, AAAI 1998.

[6]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[7]  Robin Cohen,et al.  Hybrid Recommender Systems for Electronic Commerce , 2000 .

[8]  S. Rabe-Hesketh,et al.  Generalized linear latent and mixed models , 2000 .

[9]  Pattie Maes,et al.  Social information filtering: algorithms for automating “word of mouth” , 1995, CHI '95.

[10]  Moshe Ben-Akiva,et al.  Discrete Choice Analysis: Theory and Application to Travel Demand , 1985 .

[11]  Mark Claypool,et al.  Combining Content-Based and Collaborative Filters in an Online Newspaper , 1999, SIGIR 1999.

[12]  Robin Burke,et al.  Knowledge-based recommender systems , 2000 .

[13]  Jordan J. Louviere,et al.  Combining sources of preference data , 1998 .

[14]  C. Bhat Quasi-random maximum simulated likelihood estimation of the mixed multinomial logit model , 2001 .

[15]  D. Hensher,et al.  Stated Choice Methods: Analysis and Applications , 2000 .

[16]  Wiebke Kuklys,et al.  Stated choice methods: analysis and application, Jordan J. Louviere, David A. Hensher and Joffre D. Swait, Cambridge University Press, ISBN: 0-521-78830-7 , 2002 .

[17]  John R. Hauser,et al.  Application and Test of Web-based Adaptive Polyhedral Conjoint Analysis , 2002 .

[18]  John Riedl,et al.  GroupLens: an open architecture for collaborative filtering of netnews , 1994, CSCW '94.

[19]  Joan L. Walker Extended discrete choice models : integrated framework, flexible error structures, and latent variables , 2001 .

[20]  John Riedl,et al.  Analysis of recommendation algorithms for e-commerce , 2000, EC '00.

[21]  Bradley N. Miller,et al.  PocketLens: Toward a personal recommender system , 2004, TOIS.

[22]  Joshua Alspector,et al.  Comparing feature-based and clique-based user models for movie selection , 1998, DL '98.

[23]  William Gould,et al.  Maximum likelihood estimation with stata , 1999 .

[24]  Joel Huber,et al.  Customer-Specific Taste Parameters and Mixed Logit , 1999 .

[25]  D. McFadden,et al.  MIXED MNL MODELS FOR DISCRETE RESPONSE , 2000 .

[26]  D. McFadden A Method of Simulated Moments for Estimation of Discrete Response Models Without Numerical Integration , 1989 .

[27]  David Heckerman,et al.  Empirical Analysis of Predictive Algorithms for Collaborative Filtering , 1998, UAI.

[28]  Gediminas Adomavicius,et al.  Incorporating contextual information in recommender systems using a multidimensional approach , 2005, TOIS.

[29]  K. Train,et al.  On the Similarity of Classical and Bayesian Estimates of Individual Mean Partworths , 2000 .

[30]  Loriene Roy,et al.  Content-based book recommending using learning for text categorization , 1999, DL '00.

[31]  Michael J. Pazzani,et al.  A Framework for Collaborative, Content-Based and Demographic Filtering , 1999, Artificial Intelligence Review.

[32]  D. McFadden Conditional logit analysis of qualitative choice behavior , 1972 .

[33]  Michael J. Pazzani,et al.  User Modeling for Adaptive News Access , 2000, User Modeling and User-Adapted Interaction.

[34]  Christian Gourieroux,et al.  Simulation-based econometric methods , 1996 .

[35]  J. Hauser,et al.  The virtual customer , 2002 .

[36]  C. Bhat An analysis of evening commute stop-making behavior using repeated choice observations from a multi-day survey , 1999 .

[37]  John Riedl,et al.  E-Commerce Recommendation Applications , 2004, Data Mining and Knowledge Discovery.

[38]  William W. Cohen,et al.  Recommendation as Classification: Using Social and Content-Based Information in Recommendation , 1998, AAAI/IAAI.

[39]  Peter Y. Chen,et al.  Correlation: Parametric and Nonparametric Measures , 2002 .