A survey on socio-semantic information retrieval

Abstract The rise of the Social Web and advances in the Semantic Web provides unprecedented possibilities for the development of novel methods to enhance the information retrieval (IR) process by including varying degrees of semantics. We shed light on the corresponding notion of semantically-enhanced information retrieval by presenting state-of-the art techniques in related research areas. We describe techniques based on the main processes of a typical IR workflow and map them onto three main types of semantics, which vary from formal semantic knowledge representations and content-based semantics to social semantics emerging through usage and user interactions.

[1]  Panagiotis G. Ipeirotis,et al.  Automatic construction of multifaceted browsing interfaces , 2005, CIKM '05.

[2]  Marti A. Hearst Search User Interfaces , 2009 .

[3]  Michiel Hildebrand,et al.  An analysis of search-based user interaction on the semantic web , 2007 .

[4]  Ah-Hwee Tan,et al.  OntoSearch: A Full-Text Search Engine for the Semantic Web , 2006, AAAI.

[5]  Weiguo Fan,et al.  Identifying vertical search intention of query through social tagging propagation , 2009, WWW '09.

[6]  Ramanathan V. Guha,et al.  Semantic search , 2003, WWW '03.

[7]  Gerhard Weikum,et al.  NAGA: harvesting, searching and ranking knowledge , 2008, SIGMOD Conference.

[8]  Pablo Castells,et al.  An Ontology-Based Information Retrieval Model , 2005, ESWC.

[9]  Nicola Henze,et al.  Context-based ranking in folksonomies , 2009, HT '09.

[10]  Pablo Castells,et al.  An Adaptation of the Vector-Space Model for Ontology-Based Information Retrieval , 2007, IEEE Transactions on Knowledge and Data Engineering.

[11]  Jun Wang,et al.  Personalization of tagging systems , 2010, Inf. Process. Manag..

[12]  Elie Sanchez,et al.  Object-fuzzy concept network: An enrichment of ontologies in semantic information retrieval , 2008 .

[13]  Michael G. Noll,et al.  Understanding and leveraging the social web for information retrieval , 2010 .

[14]  Mark Levene,et al.  Personalisation of Web Search , 2003, ITWP.

[15]  Andriy Nikolov,et al.  Algorithms for Generating Ontology Based Visualization from Semantic Search Results , 2009, 2009 20th International Workshop on Database and Expert Systems Application.

[16]  Atanas Kiryakov,et al.  Semantic annotation, indexing, and retrieval , 2004, J. Web Semant..

[17]  Peter Mika,et al.  Ontologies are us: A unified model of social networks and semantics , 2005, J. Web Semant..

[18]  Stefanie N. Lindstaedt,et al.  Information Retrieval on the Semantic Web - Does it exist? , 2007, LWA.

[19]  David Parry,et al.  A fuzzy ontology for medical document retrieval , 2004, ACSW.

[20]  Henrik Eriksson The semantic-document approach to combining documents and ontologies , 2007, Int. J. Hum. Comput. Stud..

[21]  Mohand Boughanem,et al.  A Personalized Graph-Based Document Ranking Model Using a Semantic User Profile , 2010, UMAP.

[22]  Farshad Fotouhi,et al.  Emergent semantics and the multimedia semantic web , 2002, SGMD.

[23]  Vassilis Christophides,et al.  Generating On the Fly Queries for the Semantic Web: The ICS-FORTH Graphical RQL Interface (GRQL) , 2004, SEMWEB.

[24]  Vagelis Hristidis,et al.  ObjectRank: Authority-Based Keyword Search in Databases , 2004, VLDB.

[25]  Haoran Xie,et al.  Personalized Resource Search by Tag-Based User Profile and Resource Profile , 2010, WISE.

[26]  Yi-fang Brook Wu,et al.  Identifying important concepts from medical documents , 2006, J. Biomed. Informatics.

[27]  W. Bruce Croft,et al.  Quantifying query ambiguity , 2002 .

[28]  Charles Elkan,et al.  Latent semantic indexing (LSI) fails for TREC collections , 2011, SKDD.

[29]  Robert Krovetz,et al.  Homonymy and Polysemy in Information Retrieval , 1997, ACL.

[30]  Abraham Bernstein,et al.  Querying Ontologies: A Controlled English Interface for End-Users , 2005, SEMWEB.

[31]  Andrzej Bargiela,et al.  Search with Meanings:An Overview of Semantic Search Systems , 2008 .

[32]  Daniel Schwabe,et al.  A hybrid approach for searching in the semantic web , 2004, WWW '04.

[33]  Mark Sanderson,et al.  Word sense disambiguation and information retrieval , 1994, SIGIR '94.

[34]  Soumen Chakrabarti,et al.  Breaking Through the Syntax Barrier: Searching with Entities and Relations , 2004, ECML.

[35]  T. Landauer,et al.  Indexing by Latent Semantic Analysis , 1990 .

[36]  Simone Braun,et al.  Semantics to the Bookmarks: A Review of Social Semantic Bookmarking Systems , 2009, I-SEMANTICS.

[37]  Siegfried Handschuh,et al.  CORAAL - Dive into publications, bathe in the knowledge , 2010, J. Web Semant..

[38]  Eero Hyvönen,et al.  MuseumFinland - Finnish museums on the semantic web , 2005, J. Web Semant..

[39]  Bernardo A. Huberman,et al.  The Structure of Collaborative Tagging Systems , 2005, ArXiv.

[40]  Christoph Trattner,et al.  On the Navigability of Social Tagging Systems , 2010, 2010 IEEE Second International Conference on Social Computing.

[41]  Atanas Kiryakov,et al.  Towards Semantic Web Information Extraction , 2003 .

[42]  Timothy W. Finin,et al.  Swoogle: a search and metadata engine for the semantic web , 2004, CIKM '04.

[43]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[44]  Arkaitz Zubiaga,et al.  Tags vs shelves: from social tagging to social classification , 2011, HT '11.

[45]  Fabio Crestani,et al.  A statistical comparison of tag and query logs , 2009, SIGIR.

[46]  Joemon M. Jose,et al.  Personalizing Web Search with Folksonomy-Based User and Document Profiles , 2010, ECIR.

[47]  G Stix,et al.  The mice that warred. , 2001, Scientific American.

[48]  Emanuele Della Valle,et al.  Squiggle: An Experience in Model-Driven Development of Real-World Semantic Search Engines , 2007, ICWE.

[49]  Wolfgang Nejdl,et al.  Semantically Enhanced Searching and Ranking on the Desktop , 2005, Semantic Desktop Workshop.

[50]  Jon Kleinberg,et al.  Authoritative sources in a hyperlinked environment , 1999, SODA '98.

[51]  E. Chang,et al.  A survey in semantic search technologies , 2008, 2008 2nd IEEE International Conference on Digital Ecosystems and Technologies.

[52]  Mohand Boughanem,et al.  A concept-based approach for indexing documents in IR , 2005, INFORSID.

[53]  Rui Li,et al.  Towards effective browsing of large scale social annotations , 2007, WWW '07.

[54]  Steffen Staab,et al.  SEmantic portAL: The SEAL Approach , 2003, Spinning the Semantic Web.

[55]  Amanda Spink,et al.  An Analysis of Web Documents Retrieved and Viewed , 2003, International Conference on Internet Computing.

[56]  Alessandro Micarelli,et al.  Social Tagging in Query Expansion: A New Way for Personalized Web Search , 2009, 2009 International Conference on Computational Science and Engineering.

[57]  Andreas Hotho,et al.  FolkRank : A Ranking Algorithm for Folksonomies , 2006, LWA.

[58]  Thomas Lukasiewicz,et al.  Semantic search on the Web , 2010, Semantic Web.

[59]  Eero Hyvönen,et al.  Semantic Autocompletion , 2006, ASWC.

[60]  Roman Kern,et al.  KCDC: Word Sense Induction by Using Grammatical Dependencies and Sentence Phrase Structure , 2010, SemEval@ACL.

[61]  Marti A. Hearst,et al.  Automating Creation of Hierarchical Faceted Metadata Structures , 2007, NAACL.

[62]  Dan Brickley,et al.  Rdf vocabulary description language 1.0 : Rdf schema , 2004 .

[63]  Mohand Boughanem,et al.  Semantic cores for representing documents in IR , 2005, SAC '05.

[64]  Nancy Ide,et al.  Introduction to the Special Issue on Word Sense Disambiguation: The State of the Art , 1998, Comput. Linguistics.

[65]  John Davies,et al.  QuizRDF: search technology for the semantic Web , 2004, 37th Annual Hawaii International Conference on System Sciences, 2004. Proceedings of the.

[66]  Wolfgang Nejdl,et al.  From keywords to semantic queries - Incremental query construction on the semantic web , 2009, J. Web Semant..

[67]  Sougata Mukherjea,et al.  Utilizing Resource Importance for Ranking Semantic Web Query Results , 2004, SWDB.

[68]  Stefan Decker,et al.  Creating Semantic Web Contents with Protégé-2000 , 2001, IEEE Intell. Syst..

[69]  Christoph Meinel,et al.  Web Search Personalization Via Social Bookmarking and Tagging , 2007, ISWC/ASWC.

[70]  Avi Arampatzis,et al.  A study of query length , 2008, SIGIR '08.

[71]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[72]  Dekang Lin,et al.  An Information-Theoretic Definition of Similarity , 1998, ICML.

[73]  Bohdan S. Wynar,et al.  Introduction to Cataloging and Classification , 1991 .

[74]  Hai Jin,et al.  RSS: A framework enabling ranked search on the semantic web , 2008, Inf. Process. Manag..

[75]  Michael Kifer,et al.  Logical foundations of object-oriented and frame-based languages , 1995, JACM.

[76]  Enrico Motta,et al.  AquaLog: An Ontology-Portable Question Answering System for the Semantic Web , 2005, ESWC.

[77]  Nenad Stojanovic,et al.  A logic-based approach for query refinement in ontology-based information retrieval systems , 2004, 16th IEEE International Conference on Tools with Artificial Intelligence.

[78]  Yuzhong Qu,et al.  Falcons: searching and browsing entities on the semantic web , 2008, WWW.

[79]  Enrico Motta,et al.  SemSearch: A Search Engine for the Semantic Web , 2006, EKAW.

[80]  Steffen Staab,et al.  TripleRank: Ranking Semantic Web Data by Tensor Decomposition , 2009, SEMWEB.

[81]  Ji-Rong Wen,et al.  WWW 2007 / Track: Search Session: Personalization A Largescale Evaluation and Analysis of Personalized Search Strategies ABSTRACT , 2022 .

[82]  Munindar P. Singh A Social Semantics for Agent Communication Languages , 2000, Issues in Agent Communication.

[83]  Philip Resnik,et al.  Semantic Similarity in a Taxonomy: An Information-Based Measure and its Application to Problems of Ambiguity in Natural Language , 1999, J. Artif. Intell. Res..

[84]  Alexandra Poulovassilis,et al.  A Relaxed Approach to RDF Querying , 2006, International Semantic Web Conference.

[85]  H. Lan,et al.  SWRL : A semantic Web rule language combining OWL and ruleML , 2004 .

[86]  Christoph Mangold,et al.  A survey and classification of semantic search approaches , 2007, Int. J. Metadata Semant. Ontologies.

[87]  James A. Hendler,et al.  DAML+OIL: An Ontology Language for the Semantic Web , 2002, IEEE Intell. Syst..

[88]  Bamshad Mobasher,et al.  Intelligent Techniques for Web Personalization , 2005, Lecture Notes in Computer Science.

[89]  George A. Vouros,et al.  Semantic retrieval and ranking of Semantic Web documents using free-form queries , 2008, Int. J. Metadata Semant. Ontologies.

[90]  Lynda Hardman,et al.  /facet: A Browser for Heterogeneous Semantic Web Repositories , 2006, SEMWEB.

[91]  Alexander Pretschner,et al.  Ontology based personalized search , 1999, Proceedings 11th International Conference on Tools with Artificial Intelligence.

[92]  Stefan Decker,et al.  Sig.ma: Live views on the Web of Data , 2010, J. Web Semant..

[93]  Xin Li,et al.  Tag-based social interest discovery , 2008, WWW.

[94]  James A. Hendler,et al.  SHOE: A Blueprint for the Semantic Web , 2003, Spinning the Semantic Web.

[95]  Ramanathan V. Guha,et al.  SemTag and seeker: bootstrapping the semantic web via automated semantic annotation , 2003, WWW '03.

[96]  Mária Bieliková,et al.  Improving Semantic Search Via Integrated Personalized Faceted and Visual Graph Navigation , 2008, SOFSEM.

[97]  Yong Yu,et al.  Exploring social annotations for the semantic web , 2006, WWW '06.

[98]  Steffen Staab Emergent Semantics , 2002, IEEE Intell. Syst..

[99]  Roy Rada,et al.  Ranking documents with a thesaurus , 1989, JASIS.

[100]  Arthur Stutt,et al.  MnM: Ontology Driven Semi-automatic and Automatic Support for Semantic Markup , 2002, EKAW.

[101]  Enrico Motta,et al.  Semantic Search Meets the Web , 2008, 2008 IEEE International Conference on Semantic Computing.

[102]  E. Prud hommeaux,et al.  SPARQL query language for RDF , 2011 .

[103]  Javier Nogueras-Iso,et al.  Exploring the Advances in Semantic Search Engines , 2010, DCAI.

[104]  Stuart E. Middleton,et al.  Ontological user profiling in recommender systems , 2004, TOIS.

[105]  Andreas Hotho,et al.  Emergent Semantics in BibSonomy , 2006, GI Jahrestagung.

[106]  Hamish Cunningham,et al.  Natural Language Interfaces to Ontologies: Combining Syntactic Analysis and Ontology-Based Lookup through the User Interaction , 2010, ESWC.

[107]  Thomas R. Gruber,et al.  Toward principles for the design of ontologies used for knowledge sharing? , 1995, Int. J. Hum. Comput. Stud..

[108]  Feng Qiu,et al.  Automatic identification of user interest for personalized search , 2006, WWW '06.

[109]  Kevin Li,et al.  Faceted metadata for image search and browsing , 2003, CHI '03.

[110]  Mandar Mitra,et al.  Information Retrieval from Documents: A Survey , 2000, Information Retrieval.

[111]  Wolfgang Nejdl,et al.  Using ODP metadata to personalize search , 2005, SIGIR '05.

[112]  Steffen Staab,et al.  Towards the self-annotating web , 2004, WWW '04.

[113]  Stefanie N. Lindstaedt,et al.  A Network Model Approach to Retrieval in the Semantic Web , 2008, Int. J. Semantic Web Inf. Syst..

[114]  Isabella Peters,et al.  Folksonomies - Indexing and Retrieval in Web 2.0 , 2009, Knowledge and Information.

[115]  Enrico Motta,et al.  PowerMap: Mapping the Real Semantic Web on the Fly , 2006, SEMWEB.

[116]  Philipp Cimiano,et al.  Towards portable natural language interfaces to knowledge bases - The case of the ORAKEL system , 2008, Data Knowl. Eng..

[117]  Yarden Katz,et al.  Pellet: A practical OWL-DL reasoner , 2007, J. Web Semant..

[118]  Ah-Hwee Tan,et al.  Learning and inferencing in user ontology for personalized Semantic Web search , 2009, Inf. Sci..

[119]  Jens Lehmann,et al.  DBpedia: A Nucleus for a Web of Open Data , 2007, ISWC/ASWC.

[120]  Rui Li,et al.  Survey on social tagging techniques , 2010, SKDD.

[121]  Chong Wang,et al.  SPARK: Adapting Keyword Query to Semantic Search , 2007, ISWC/ASWC.

[122]  Barry Smyth,et al.  ASSIST: adaptive social support for information space traversal , 2007, HT '07.

[123]  Eugene J. Shekita,et al.  Beyond basic faceted search , 2008, WSDM '08.

[124]  Catherine Faron-Zucker,et al.  Querying the Semantic Web with Corese Search Engine , 2004, ECAI.

[125]  Giovanni Quattrone,et al.  A query expansion and user profile enrichment approach to improve the performance of recommender systems operating on a folksonomy , 2010, User Modeling and User-Adapted Interaction.

[126]  Steffen Staab,et al.  Authoring and annotation of web pages in CREAM , 2002, WWW.

[127]  Yong Yu,et al.  Optimizing web search using social annotations , 2007, WWW '07.

[128]  Rifat Ozcan,et al.  Concept-based information access , 2005, International Conference on Information Technology: Coding and Computing (ITCC'05) - Volume II.

[129]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[130]  Yong Yu,et al.  Exploring folksonomy for personalized search , 2008, SIGIR '08.

[131]  L. Stein,et al.  OWL Web Ontology Language - Reference , 2004 .