Creating User Profiles Using Wikipedia

Creating user profiles is an important step in personalization. Many methods for user profile creation have been developed to date using different representations such as term vectors and concepts from an ontology like DMOZ. In this paper, we propose and evaluate different methods for creating user profiles using Wikipedia as the representation. The key idea in our approach is to map documents to Wikipedia concepts at different levels of resolution: words, key phrases, sentences, paragraphs, the document summary and the entire document itself. We suggest a method for evaluating profile recall by pooling the relevant results from the different methods and evaluate our results for both precision and recall. We also suggest a novel method for profile evaluation by assessing the recall over a known ontological profile drawn from DMOZ.

[1]  Carlotta Domeniconi,et al.  Building semantic kernels for text classification using wikipedia , 2008, KDD.

[2]  Ian H. Witten,et al.  Learning to link with wikipedia , 2008, CIKM '08.

[3]  Wolfgang Nejdl,et al.  Using ODP metadata to personalize search , 2005, SIGIR '05.

[4]  Michael J. Pazzani,et al.  Learning and Revising User Profiles: The Identification of Interesting Web Sites , 1997, Machine Learning.

[5]  Stuart E. Middleton,et al.  Capturing interest through inference and visualization: ontological user profiling in recommender systems , 2003, K-CAP '03.

[6]  Alfred Kobsa,et al.  The Adaptive Web, Methods and Strategies of Web Personalization , 2007, The Adaptive Web.

[7]  Evgeniy Gabrilovich,et al.  Overcoming the Brittleness Bottleneck using Wikipedia: Enhancing Text Categorization with Encyclopedic Knowledge , 2006, AAAI.

[8]  Susan Gauch,et al.  Improving Ontology-Based User Profiles , 2004, RIAO.

[9]  Philip K. Chan,et al.  Learning implicit user interest hierarchy for context in personalization , 2003, IUI.

[10]  Susan T. Dumais,et al.  Personalizing Search via Automated Analysis of Interests and Activities , 2005, SIGIR.

[11]  Ke Wang,et al.  Privacy-enhancing personalized web search , 2007, WWW '07.

[12]  Alessandro Micarelli,et al.  User Profiles for Personalized Information Access , 2007, The Adaptive Web.

[13]  Analía Amandi,et al.  User profiling for Web page filtering , 2005, IEEE Internet Computing.

[14]  Alfred Kobsa,et al.  Privacy-enhanced personalization , 2007, CACM.

[15]  Zhenya Zhang,et al.  Keywords Extracting as Text Chance Discovery , 2007, Fourth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD 2007).

[16]  Zhiqiang Zheng,et al.  Personalization from incomplete data: what you don't know can hurt , 2001, KDD '01.

[17]  Bamshad Mobasher,et al.  Web search personalization with ontological user profiles , 2007, CIKM '07.