Exploratory Professional Search through Semantic Post-Analysis of Search Results

Professional Search is usually a recall-oriented problem. For helping the user to get efficiently a concise overview, to quickly restrict the search space and to make sense of the results, in this article we present an exploratory strategy for professional search that is based on semantic post-analysis of the classical search results (of keyword based queries). The described strategy can exploit the metadata that are already available, as well as the results of textual clustering and entity mining that can be performed at query time. The outcome of this process (i.e. metadata, clusters and entities grouped in categories) complement the ranked list of results produced from the core search engine with useful information for the user. This extra information is useful not only for providing a concise overview of the search results, but also for supporting a faceted and session-based interaction scheme that allows the users to restrict their focus gradually and to explore other related information. To tackle the corresponding configuration requirements of this process, we show how one can exploit the (constantly evolving) Linked Data for specifying the entities of interest and for providing further information about the identified entities. In this article, apart from detailing the steps of this process, we present applications of this approach in the marine domain and in the domain of patent search.

[1]  Barry Bishop,et al.  FactForge: A fast track to the Web of data , 2011, Semantic Web.

[2]  Yannis Tzitzikas,et al.  Scalable, flexible and generic instant overview search , 2012, WWW.

[3]  Wim Vanderbauwhede,et al.  A survey of patent users: an analysis of tasks, behavior, search functionality and system requirements , 2010, IIiX.

[4]  Lora Aroyo,et al.  The Semantic Web: Research and Applications , 2009, Lecture Notes in Computer Science.

[5]  Philip S. Yu,et al.  Dynamic Load Balancing on Web-Server Systems , 1999, IEEE Internet Comput..

[6]  Yannis Tzitzikas,et al.  Scalable entity-based summarization of web search results using MapReduce , 2014, Distributed and Parallel Databases.

[7]  Oszkar Ambrus Konduit VQB : a Visual Query Builder for SPARQL on the Social Semantic Desktop , 2010 .

[8]  Yannis Tzitzikas,et al.  Interactive Exploration of Fuzzy RDF Knowledge Bases , 2011, ESWC.

[9]  Michelle Q. Wang Baldonado,et al.  SONIA: a service for organizing networked information autonomously , 1998, DL '98.

[10]  Carlo Meghini,et al.  Ostensive Automatic Schema Mapping for Taxonomy-Based Peer-to-Peer Systems , 2003, CIA.

[11]  Felix A. Fischer,et al.  Cooperative Information Agents XI , 2008 .

[12]  Nicolas Spyratos,et al.  Mediators over taxonomy-based information sources , 2005, The VLDB Journal.

[13]  Martin Doerr,et al.  Integrating Heterogeneous and Distributed Information about Marine Species through a Top Level Ontology , 2013, MTSR.

[14]  Sriram Subramanian,et al.  Talking about tactile experiences , 2013, CHI.

[15]  Athman Bouguettaya,et al.  Web Information System Engineering - WISE 2011 - 12th International Conference, Sydney, Australia, October 13-14, 2011. Proceedings , 2011, WISE.

[16]  Mika Käki,et al.  Findex: search result categories help users when document ranking fails , 2005, CHI.

[17]  Ryen W. White,et al.  Supporting exploratory search , 2006 .

[18]  Siegfried Handschuh Konduit VQB: a Visual Query Builder for SPARQL on the Social Semantic Desktop , 2010 .

[19]  Norbert Fuhr,et al.  ezDL: An Interactive Search and Evaluation System , 2012, OSIR@SIGIR.

[20]  Yannis Tzitzikas,et al.  Exploiting Available Memory and Disk for Scalable Instant Overview Search , 2011, WISE.

[21]  Kevin Chen-Chuan Chang,et al.  Beyond pages: supporting efficient, scalable entity search with dual-inversion index , 2010, EDBT '10.

[22]  Yannis Tzitzikas,et al.  On exploiting static and dynamically mined metadata for exploratory web searching , 2011, Knowledge and Information Systems.

[23]  Claudio Carpineto,et al.  Evaluating subtopic retrieval methods: Clustering versus diversification of search results , 2012, Inf. Process. Manag..

[24]  Yannis Tzitzikas,et al.  Exploratory Patent Search with Faceted Search and Configurable Entity Mining , 2013 .

[25]  Allan Hanbury,et al.  Multidisciplinary Information Retrieval , 2011, Lecture Notes in Computer Science.

[26]  W. Pratt,et al.  The usefulness of dynamically categorizing search results. , 2000, Journal of the American Medical Informatics Association : JAMIA.

[27]  François Bry,et al.  Professional Search: Requirements, Prototype and Preliminary Experience Report , 2008 .

[28]  Fabian M. Suchanek,et al.  ESTER: efficient search on text, entities, and relations , 2007, SIGIR.

[29]  Vldb Endowment,et al.  The VLDB journal : the international journal on very large data bases. , 1992 .

[30]  Oren Etzioni,et al.  Web document clustering: a feasibility demonstration , 1998, SIGIR '98.

[31]  Pasquale Pagano,et al.  gCube: A Service-Oriented Application Framework on the Grid , 2008, ERCIM News.

[32]  Fulvio Corno,et al.  Review of the state-of-the-art in patent information and forthcoming evolutions in intelligent patent informatics , 2010 .

[33]  Allan Hanbury,et al.  CLEF-IP 2011: Retrieval in the Intellectual Property Domain , 2011, CLEF.

[34]  Yannis Tzitzikas,et al.  Exploratory Web Searching with Dynamic Taxonomies and Results Clustering , 2009, ECDL.

[35]  Jürgen Umbrich,et al.  Hybrid SPARQL Queries: Fresh vs. Fast Results , 2012, SEMWEB.

[36]  Alon Y. Halevy,et al.  Answering queries using views: A survey , 2001, The VLDB Journal.

[37]  Yannis Tzitzikas,et al.  Web Searching with Entity Mining at Query Time , 2012, IRFC.

[38]  Nigel Shadbolt,et al.  NITELIGHT: A Graphical Tool for Semantic Query Construction , 2008 .

[39]  Gerhard Weikum,et al.  WWW 2007 / Track: Semantic Web Session: Ontologies ABSTRACT YAGO: A Core of Semantic Knowledge , 2022 .

[40]  Mika Käki,et al.  Findex: improving search result use through automatic filtering categories , 2005, Interact. Comput..

[41]  Michel Gagnon,et al.  Automatic Semantic Web Annotation of Named Entities , 2011, Canadian Conference on AI.

[42]  Christian Bizer,et al.  DBpedia spotlight: shedding light on the web of documents , 2011, I-Semantics '11.

[43]  Monica M. C. Schraefel,et al.  A longitudinal study of exploratory and keyword search , 2008, JCDL '08.

[44]  Gary Marchionini,et al.  Exploratory search , 2006, Commun. ACM.

[45]  Ian Horrocks,et al.  Ontology Integration Using Mappings: Towards Getting the Right Logical Consequences , 2009, ESWC.

[46]  Yannis Tzitzikas,et al.  Post-analysis of Keyword-Based Search Results Using Entity Mining, Linked Data, and Link Analysis at Query Time , 2014, 2014 IEEE International Conference on Semantic Computing.

[47]  Giovanni Maria Sacco,et al.  Dynamic Taxonomies and Faceted Search: Theory, Practice, and Experience , 2009, The Information Retrieval Series.

[48]  Kalina Bontcheva,et al.  Evolving GATE to meet new challenges in language engineering , 2004, Natural Language Engineering.

[49]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[50]  Yizhou Sun,et al.  WINACS: construction and analysis of web-based computer science information networks , 2011, SIGMOD '11.

[51]  Matthew Banta,et al.  What do exploratory searchers look at in a faceted search interface? , 2009, JCDL '09.

[52]  Jens Lehmann,et al.  DBpedia: A Nucleus for a Web of Open Data , 2007, ISWC/ASWC.

[53]  Tim Berners-Lee,et al.  Linked Data - The Story So Far , 2009, Int. J. Semantic Web Inf. Syst..

[54]  Jeff Heflin,et al.  The Semantic Web – ISWC 2012 , 2012, Lecture Notes in Computer Science.

[55]  Gottfried Vossen,et al.  Web Information Systems Engineering - WISE 2009, 10th International Conference, Poznan, Poland, October 5-7, 2009. Proceedings , 2009, WISE.

[56]  Allan Hanbury,et al.  A Generalized Framework for Integrated Professional Search Systems , 2013, IRFC.

[57]  Yannis Tzitzikas,et al.  STC+ and NM-STC: Two Novel Online Results Clustering Methods for Web Searching , 2009, WISE.

[58]  Lora Aroyo,et al.  The Semantic Web - ISWC 2011 - 10th International Semantic Web Conference, Bonn, Germany, October 23-27, 2011, Proceedings, Part I , 2011, SEMWEB.

[59]  Sébastien Ferré,et al.  Semantic Search: Reconciling Expressive Querying and Exploratory Search , 2011, SEMWEB.

[60]  Enrico Motta,et al.  Impact of Using Relationships between Ontologies to Enhance the Ontology Search Results , 2012, ESWC.

[61]  Enrico Franconi,et al.  Quelo : a NL-based intelligent query interface , 2010 .

[62]  Stuart Macdonald,et al.  User Engagement in Research Data Curation , 2009, ECDL.