User Profiles for Personalized Information Access

The amount of information available online is increasing exponentially. While this information is a valuable resource, its sheer volume limits its value. Many research projects and companies are exploring the use of personalized applications that manage this deluge by tailoring the information presented to individual users. These applications all need to gather, and exploit, some information about individuals in order to be effective. This area is broadly called user profiling. This chapter surveys some of the most popular techniques for collecting information about users, representing, and building user profiles. In particular, explicit information techniques are contrasted with implicitly collected user information using browser caches, proxy servers, browser agents, desktop agents, and search logs. We discuss in detail user profiles represented as weighted keywords, semantic networks, and weighted concepts. We review how each of these profiles is constructed and give examples of projects that employ each of these techniques. Finally, a brief discussion of the importance of privacy protection in profiling is presented.

[1]  Tomonari Kamba,et al.  Learning Personal Preferences on Online Newspaper Articles from User Behaviors , 1997, Comput. Networks.

[2]  Michael J. Pazzani,et al.  Content-Based Recommendation Systems , 2007, The Adaptive Web.

[3]  Myra Spiliopoulou,et al.  Web Usage Analysis and User Profiling , 2002, Lecture Notes in Computer Science.

[4]  Andrew McCallum,et al.  Automating the Construction of Internet Portals with Machine Learning , 2000, Information Retrieval.

[5]  Alexander Pretschner,et al.  Ontology-Based User Profiles for Search and Browsing , 2002 .

[6]  Timothy W. Finin,et al.  Yahoo! as an ontology: using Yahoo! categories to describe documents , 1999, CIKM '99.

[7]  Inderjeet Mani,et al.  Machine Learning of User Profiles: Representational Issues , 1996, AAAI/IAAI, Vol. 1.

[8]  Michael J. Pazzani,et al.  Syskill & Webert: Identifying Interesting Web Sites , 1996, AAAI/IAAI, Vol. 1.

[9]  Peter Dolog,et al.  Semantic Web Technologies for the Adaptive Web , 2007, The Adaptive Web.

[10]  Alfred Kobsa,et al.  The Adaptive Web, Methods and Strategies of Web Personalization , 2007, The Adaptive Web.

[11]  Bradley N. Miller,et al.  GroupLens: applying collaborative filtering to Usenet news , 1997, CACM.

[12]  Michael Wooldridge,et al.  Proceedings of the Second International Conference on Autonomous Agents, Minneapolis/St. Paul, MN USA, May 9-13, 1998 , 1998 .

[13]  Fabrizio Sebastiani,et al.  Machine learning in automated text categorization , 2001, CSUR.

[14]  Yiming Yang,et al.  A re-examination of text categorization methods , 1999, SIGIR '99.

[15]  Alessandro Micarelli,et al.  Infoweb: An adaptive information filtering system for the cultural heritage domain , 2003, Appl. Artif. Intell..

[16]  G Salton,et al.  Developments in Automatic Text Retrieval , 1991, Science.

[17]  Stuart E. Middleton,et al.  Capturing interest through inference and visualization: ontological user profiling in recommender systems , 2003, K-CAP '03.

[18]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[19]  David R. Karger,et al.  Haystack: per-user information environments , 1999, CIKM '99.

[20]  Beerud Dilip Sheth,et al.  A learning approach to personalized information filtering , 1994 .

[21]  Katia P. Sycara,et al.  WebMate: a personal agent for browsing and searching , 1998, AGENTS '98.

[22]  Susan Gauch,et al.  Personalizing Search Based on User Search Histories , 2004 .

[23]  Hinrich Schütze,et al.  Personalized search , 2002, CACM.

[24]  David M. Nichols,et al.  Implicit Rating and Filtering , 1998 .

[25]  Yi-Shin Chen,et al.  Automatically Improving the Accuracy of User Profiles with Genetic Algorithm , 2001 .

[26]  Henry Lieberman,et al.  Letizia: An Agent That Assists Web Browsing , 1995, IJCAI.

[27]  Alfred Kobsa,et al.  Privacy-Enhanced Web Personalization , 2007, The Adaptive Web.

[28]  Robert Kass,et al.  Modeling users' interests in information filters , 1992, CACM.

[29]  Yoav Shoham,et al.  Fab: content-based, collaborative recommendation , 1997, CACM.

[30]  Bamshad Mobasher,et al.  Data Mining for Web Personalization , 2007, The Adaptive Web.

[31]  Susan T. Dumais,et al.  Hierarchical classification of Web content , 2000, SIGIR '00.

[32]  John Yen,et al.  Alipes: A Swift Messenger in Cyberspace , 1999 .

[33]  P. R. Chesnais,et al.  The Fishwrap personalized news system , 1995, Proceedings of the Second International Workshop on Community Networking 'Integrated Multimedia Services to the Home'.

[34]  Alexandros Moukas Amalthaea Information Discovery and Filtering Using a Multiagent Evolving Ecosystem , 1997, Appl. Artif. Intell..

[35]  Tina Eliassi-Rad,et al.  Intelligent Agents for Web-based Tasks: An Advice-Taking Approach , 1998 .

[36]  Alfred Kobsa,et al.  Generic User Modeling Systems , 2001, User Modeling and User-Adapted Interaction.

[37]  Wendy S. Chien Learning Query Behavior in the Haystack System , 2000 .

[38]  Susan Gauch,et al.  Improving Ontology-Based User Profiles , 2004, RIAO.

[39]  Peter Haase,et al.  Collaborative and Usage-driven Evolution of Personal Ontologies , 2005, LWA.

[40]  Padmini Srinivasan,et al.  Hierarchical neural networks for text categorization (poster abstract) , 1999, SIGIR '99.

[41]  Mike P. Papazoglou,et al.  Agent-oriented technology in support of e-business , 2001, CACM.

[42]  Krishna Bharat,et al.  Supporting cooperative and personal surfing with a desktop assistant , 1997, UIST '97.

[43]  Ramanathan V. Guha,et al.  Semantic search , 2003, WWW '03.

[44]  Fabio Gasparetti,et al.  Personalized Search on the World Wide Web , 2007, The Adaptive Web.

[45]  Elaine Rich Users are individuals: individualizing user models , 1999, Int. J. Hum. Comput. Stud..

[46]  Dan Frankowski,et al.  Collaborative Filtering Recommender Systems , 2007, The Adaptive Web.

[47]  I. Barry Crabtree,et al.  Identifying and tracking changing interests , 1998, International Journal on Digital Libraries.

[48]  Oren Etzioni,et al.  Adaptive Web Sites: Automatically Synthesizing Web Pages , 1998, AAAI/IAAI.

[49]  Robin Burke,et al.  Inferring User’s Information Context from User Profiles and Concept Hierarchies , 2004 .

[50]  Alessandro Micarelli,et al.  Anatomy and Empirical Evaluation of an Adaptive Web-Based Information Filtering System , 2004, User Modeling and User-Adapted Interaction.

[51]  G Stix,et al.  The mice that warred. , 2001, Scientific American.

[52]  Susan Gauch,et al.  Personal ontologies for web navigation , 2000, CIKM '00.

[53]  Thomas W. Malone,et al.  Intelligent Information Sharing Systems , 1986 .

[54]  Philip K. Chan,et al.  Learning implicit user interest hierarchy for context in personalization , 2003, IUI.

[55]  Clement T. Yu,et al.  Personalized web search by mapping user queries to categories , 2002, CIKM '02.

[56]  Susan T. Dumais,et al.  Personalizing Search via Automated Analysis of Interests and Activities , 2005, SIGIR.

[57]  Matthew Montebello,et al.  A personal evolvable advisor for WWW knowledge-based systems , 1998 .

[58]  Clement T. Yu,et al.  Personalized Web search for improving retrieval effectiveness , 2004, IEEE Transactions on Knowledge and Data Engineering.

[59]  Dunja Mladenic,et al.  Turning Yahoo to Automatic Web-Page Classifier , 1998, European Conference on Artificial Intelligence.

[60]  Javed Mostafa,et al.  Empirical evaluation of explicit versus implicit acquisition of user profiles in information filtering systems , 1999, DL '99.

[61]  Ryen W. White,et al.  Comparing Explicit and Implicit Feedback Techniques for Web Retrieval: TREC-10 Interactive Track Report , 2001, TREC.

[62]  Jaime Teevan,et al.  Implicit feedback for inferring user preference: a bibliography , 2003, SIGF.

[63]  Keiichiro Hoashi,et al.  Document filtering method using non-relevant information profile , 2000, SIGIR '00.

[64]  John Yen,et al.  Learning user interest dynamics with a three-descriptor representation , 2001 .

[65]  Philip K. Chan,et al.  Constructing Web User Profiles: A non-invasive Learning Approach , 1999, WEBKDD.

[66]  Padmini Srinivasan,et al.  Hierarchical neural networks for text categorization , 1999, SIGIR 1999.

[67]  Jeff Heflin,et al.  SHOE: A Knowledge Representation Language for Internet Applications , 1999 .

[68]  T. Landauer,et al.  Indexing by Latent Semantic Analysis , 1990 .

[69]  Ah-Hwee Tan,et al.  Learning user profiles for personalized information dissemination , 1998, 1998 IEEE International Joint Conference on Neural Networks Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98CH36227).

[70]  Alexander Pretschner,et al.  Ontology-based personalized search and browsing , 2003, Web Intell. Agent Syst..

[71]  Ah-Hwee Tan,et al.  Adaptive resonance associative map , 1995, Neural Networks.

[72]  James A. Hendler,et al.  The Semantic Web" in Scientific American , 2001 .

[73]  Alessandro Micarelli,et al.  Web Document Modeling , 2007, The Adaptive Web.

[74]  Kevin Knight,et al.  Building a Large-Scale Knowledge Base for Machine Translation , 1994, AAAI.

[75]  Jonathan L. Herlocker,et al.  Evaluating collaborative filtering recommender systems , 2004, TOIS.

[76]  Yong Yu,et al.  An Approach for Semantic Search by Matching RDF Graphs , 2002, FLAIRS.

[77]  Alexander Pretschner,et al.  Ontology based personalized search , 1999, Proceedings 11th International Conference on Tools with Artificial Intelligence.

[78]  Tina Eliassi-Rad,et al.  An instructable, adaptive interface for discovering and monitoring information on the World-Wide Web , 1998, IUI '99.

[79]  Chien Chin Chen,et al.  PVA: A Self-Adaptive Personal View Agent , 2004, Journal of Intelligent Information Systems.

[80]  Masatoshi Yoshikawa,et al.  Adaptive web search based on user profile constructed without any effort from users , 2004, WWW '04.

[81]  I. B. Crabtree,et al.  Automatic Learning of User Profiles — Towards the Personalisation of Agent Services , 1998 .

[82]  Francisco Tanudjaja,et al.  Persona: a contextualized and personalized web search , 2002, Proceedings of the 35th Annual Hawaii International Conference on System Sciences.

[83]  Hector Garcia-Molina,et al.  SIFT - a Tool for Wide-Area Information Dissemination , 1995, USENIX.

[84]  James A. Hendler,et al.  Ontology-based Web agents , 1997, AGENTS '97.

[85]  Paul P. Maglio,et al.  How to personalize the Web , 1997, CHI.

[86]  Nicola Guarino,et al.  OntoSeek: content-based access to the Web , 1999, IEEE Intell. Syst..

[87]  Henry Lieberman,et al.  Autonomous interface agents , 1997, CHI.

[88]  Matthew Montebello,et al.  Evolvable intelligent user interface for WWW knowledge-based systems , 1998, Proceedings. IDEAS'98. International Database Engineering and Applications Symposium (Cat. No.98EX156).

[89]  Annika Waern,et al.  User Involvement in Automatic Filtering: An Experimental Study , 2004 .