A Study of User Profile Generation from Folksonomies

Recommendation systems which aim at providing relevant information to users are becoming more and more important and desirable due to the enormous amount of information available on the Web. Crucial to the performance of a recommendation system is the accuracy of the user profiles used to represent the interests of the users. In recent years, popular collaborative tagging systems such as del.icio.us have aggregated an abundant amount of user-contributed metadata which provides valuable information about the interests of the users. In this paper, we present our analysis on the personal data in folksonomies, and investigate how accurate user profiles can be generated from this data. We reveal that the majority of users possess multiple interests, and propose an algorithm to generate user profiles which can accurately represent these multiple interests. We also discuss how these user profiles can be used for recommending Web pages and organising personal data.

[1]  Christoph Meinel,et al.  Web Search Personalization Via Social Bookmarking and Tagging , 2007, ISWC/ASWC.

[2]  Tereza Iofciu,et al.  Finding Communities of Practice from User Profiles Based on Folksonomies , 2006, EC-TEL Workshops.

[3]  Paolo Zandegiacomo Rizio,et al.  Information Filtering and Retrieving of Context-Aware Applications Within the MoBe Framework , 2005 .

[4]  M. Krötzsch,et al.  Wikipedia and the Semantic Web The Missing Links ? , 2005 .

[5]  Fabio Paternò,et al.  Supporting Museum Co-visits Using Mobile Devices , 2004, Mobile HCI.

[6]  Steve Cayzer,et al.  Semantic blogging and decentralized knowledge management , 2004, CACM.

[7]  Thomas R. Gruber,et al.  Collective knowledge systems: Where the Social Web meets the Semantic Web , 2008, J. Web Semant..

[8]  Silviu Cucerzan,et al.  Large-Scale Named Entity Disambiguation Based on Wikipedia Data , 2007, EMNLP.

[9]  Steve Cayzer,et al.  Learning User Profiles from Tagging Data and Leveraging them for Personal(ized) Information Access , 2007, WWW 2007.

[10]  Andreas Hotho,et al.  FolkRank : A Ranking Algorithm for Folksonomies , 2006, LWA.

[11]  Henry Lieberman,et al.  Letizia: An Agent That Assists Web Browsing , 1995, IJCAI.

[12]  Simone Paolo Ponzetto,et al.  WikiRelate! Computing Semantic Relatedness Using Wikipedia , 2006, AAAI.

[13]  M E J Newman,et al.  Fast algorithm for detecting community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[14]  Raymond Y. K. Lau,et al.  Utilizing Search Intent in Topic Ontology-Based User Profile for Web Mining , 2006, 2006 IEEE/WIC/ACM International Conference on Web Intelligence (WI 2006 Main Conference Proceedings)(WI'06).

[15]  Jens Lehmann,et al.  What Have Innsbruck and Leipzig in Common? Extracting Semantics from Wiki Content , 2007, ESWC.

[16]  Yoav Shoham,et al.  Learning Information Retrieval Agents: Experiments with Automated Web Browsing , 1995 .

[17]  Andreas Hotho,et al.  Information Retrieval in Folksonomies: Search and Ranking , 2006, ESWC.

[18]  S. Wasserman,et al.  Social Network Analysis: Computer Programs , 1994 .

[19]  Jennifer Trant,et al.  Exploring the potential for social tagging and folksonomy in art museums: Proof of concept , 2006, New Rev. Hypermedia Multim..

[20]  R. Guimerà,et al.  Functional cartography of complex metabolic networks , 2005, Nature.

[21]  John Scott What is social network analysis , 2010 .

[22]  Hideaki Takeda,et al.  SOCIOBIBLOG: A DECENTRALIZED PLATFORM FOR SHARING BIBLIOGRAPHIC INFORMATION , 2007 .

[23]  Michael R. Middleton,et al.  Cultural institutions and Web 2.0 , 2007 .

[24]  Bernardo A. Huberman,et al.  The Structure of Collaborative Tagging Systems , 2005, ArXiv.

[25]  Peter Mika Ontologies Are Us: A Unified Model of Social Networks and Semantics , 2005, International Semantic Web Conference.

[26]  Mining a Large-Scale Term-Concept Network from Wikipedia , 2006 .

[27]  Marc Ehrig,et al.  State of the art on ontology alignment , 2013 .

[28]  Andrea Marchetti,et al.  SemKey: A Semantic Collaborative Tagging System , 2007 .

[29]  Vladimir I. Levenshtein,et al.  Binary codes capable of correcting deletions, insertions, and reversals , 1965 .

[30]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[31]  Enrico Motta,et al.  Revyu.com: a Reviewing and Rating Site for the Web of Data , 2007, Semantic Web Challenge.

[32]  Tim Berners-Lee,et al.  World-Wide Web: The Information Universe , 1992, Electron. Netw. Res. Appl. Policy.

[33]  Wolfgang Nejdl,et al.  Extracting Semantics Relationships between Wikipedia Categories , 2006, SemWiki.

[34]  John G. Breslin,et al.  Using Semantics to Enhance the Blogging Experience , 2006, ESWC.

[35]  Analía Amandi,et al.  User profiling in personal information agents: a survey , 2005, The Knowledge Engineering Review.

[36]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[37]  Stefano Levialdi,et al.  Semantic Halo for Collaboration Tagging Systems , 2006 .

[38]  Evgeniy Gabrilovich,et al.  Computing Semantic Relatedness Using Wikipedia-based Explicit Semantic Analysis , 2007, IJCAI.

[39]  Vldb Endowment,et al.  The VLDB journal : the international journal on very large data bases. , 1992 .

[40]  Robin D. Burke,et al.  Hybrid Recommender Systems: Survey and Experiments , 2002, User Modeling and User-Adapted Interaction.

[41]  Alexander Dekhtyar,et al.  Information Retrieval , 2018, Lecture Notes in Computer Science.

[42]  Mary Shapcott,et al.  Generating semantically enriched user profiles for Web personalization , 2007, TOIT.

[43]  David Buttler,et al.  Tracking multiple topics for finding interesting articles , 2007, KDD '07.

[44]  David R. Karger,et al.  Potluck: Data mash-up tool for casual users , 2008, J. Web Semant..

[45]  Wolfgang Nejdl,et al.  Search strategies for scientific collaboration networks , 2005, P2PIR '05.

[46]  Martin Hepp,et al.  myOntology : The Marriage of Ontology Engineering and Collective Intelligence , 2007 .

[47]  Céline Van Damme,et al.  FolksOntology : An Integrated Approach for Turning Folksonomies into Ontologies , 2007 .

[48]  Gerald Reif,et al.  Semantic Clipboard - Semantically Enriched Data Exchange Between Desktop Applications , 2006, SemDesk.

[49]  Marko Grobelnik,et al.  User Profiling for Interest-focused Browsing History , 2005 .

[50]  Claudio Castellano,et al.  Defining and identifying communities in networks. , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[51]  Hugh C. Davis,et al.  MICROCOSM: An Open Model for Hypermedia with Dynamic Linking , 1990, ECHT.

[52]  Hugh C. Davis,et al.  Creating Structure from Disorder - Using Folksonomies to Create Semantic Metadata , 2007, WEBIST.

[53]  Daniel S. Weld,et al.  Autonomously semantifying wikipedia , 2007, CIKM '07.

[54]  Masatoshi Yoshikawa,et al.  Adaptive web search based on user profile constructed without any effort from users , 2004, WWW '04.

[55]  David R. Karger,et al.  What would it mean to blog on the semantic web? , 2005, J. Web Semant..

[56]  Mayer D. Schwartz,et al.  The Dexter Hypertext Reference Model , 1994, CACM.

[57]  Bernardo A. Huberman,et al.  Usage patterns of collaborative tagging systems , 2006, J. Inf. Sci..

[58]  Anupriya Ankolekar,et al.  The two cultures: mashing up web 2.0 and the semantic web , 2007, WWW '07.

[59]  Razvan C. Bunescu,et al.  Using Encyclopedic Knowledge for Named entity Disambiguation , 2006, EACL.

[60]  Hugh C. Davis,et al.  Towards an integrated information environment with open hypermedia systems , 1992, ECHT '92.

[61]  Stefano Mizzaro,et al.  Quality control in scholarly publishing: A new proposal , 2003, J. Assoc. Inf. Sci. Technol..

[62]  Enrico Motta,et al.  Integrating Folksonomies with the Semantic Web , 2007, ESWC.

[63]  David R. Karger,et al.  Exhibit: lightweight structured data publishing , 2007, WWW '07.

[64]  Sebastian Schaffert,et al.  IkeWiki: A Semantic Wiki for Collaborative Knowledge Management , 2006, 15th IEEE International Workshops on Enabling Technologies: Infrastructure for Collaborative Enterprises (WETICE'06).

[65]  Hideaki Takeda,et al.  Agent Organization and Communication with Multiple Ontologies , 1995, Int. J. Cooperative Inf. Syst..

[66]  Humphrey Sorensen,et al.  PSUN: A Profiling System for Usenet News , 1995, CIKM Information Agents Workshop.

[67]  Sebastian Schaffert Semantic Social Software: Semantically Enabled Social Software or Socially Enabled Semantic Web? , 2008 .

[68]  Stuart E. Middleton,et al.  Ontological user profiling in recommender systems , 2004, TOIS.