A service-oriented search framework for full text, geospatial and semantic search

Over the last decade, a growing importance of search engines could be observed. An increasing amount of knowledge is exposed and connected within the Linked Open Data Cloud, which raises users' expectations to be able to search for any information that is directly or indirectly contained. However, diverse data types require tailored search functionalities---such as semantic, geospatial and full text search. Hence, using only one data management system will not provide the required functionality at the expected level. In this paper, we will describe search services that provide specific search functionality via a generalized interface inspired by RDF. In addition, we introduce an application layer on top of these services that enables to query them in a unified way. This allows for the implementation of a distributed search that leverages the identification of the optimal search service for each query and subquery. This is achieved by connecting powerful tools like Openlink Virtuoso, ElasticSearch and PostGIS within a single framework.

[1]  Gerhard Weikum,et al.  PATTY: A Taxonomy of Relational Patterns with Semantic Types , 2012, EMNLP.

[2]  Otis Gospodnetic,et al.  Lucene in Action , 2004 .

[3]  Rudolf Schmid,et al.  Organization for the advancement of structured information standards , 2002 .

[4]  Regina O. Obe,et al.  PostGIS in Action , 2011 .

[5]  Claudio Carpineto,et al.  Optimal meta search results clustering , 2010, SIGIR.

[6]  Latifur Khan,et al.  Implementation and performance evaluation of a scheduling algorithm for divisible load parallel applications in a cloud computing environment , 2015, Softw. Pract. Exp..

[7]  Antonio Gulli,et al.  Building an open source meta-search engine , 2005, WWW '05.

[8]  John Domingue,et al.  A Linked Data-driven & Service-oriented Architecture for Sharing Educational Resources , 2011, Linked Learning@ESWC.

[9]  Andriy Nikolov,et al.  FedSearch: Efficiently Combining Structured Queries and Full-Text Search in a SPARQL Federation , 2013, International Semantic Web Conference.

[10]  Seán O'Riain,et al.  Querying Linked Data Using Semantic Relatedness: A Vocabulary Independent Approach , 2011, NLDB.

[11]  Jens Lehmann,et al.  Template-based question answering over RDF data , 2012, WWW.

[12]  Jordi Vilaplana,et al.  SLA-Aware Load Balancing in a Web-Based Cloud System over OpenStack , 2013, ICSOC Workshops.

[13]  Bruce Momjian,et al.  PostgreSQL: Introduction and Concepts , 2000 .

[14]  Michael Bell,et al.  SOA Modeling Patterns for Service-Oriented Discovery and Analysis , 2009 .

[15]  Philipp Cimiano,et al.  Pythia: Compositional Meaning Construction for Ontology-Based Question Answering on the Semantic Web , 2011, NLDB.

[16]  Sören Auer,et al.  AGDISTIS - Agnostic Disambiguation of Named Entities Using Linked Open Data , 2014, ECAI.

[17]  Sam Ruby,et al.  RESTful Web Services , 2007 .

[18]  Roberto De Virgilio,et al.  A similarity measure for approximate querying over RDF data , 2013, EDBT '13.

[19]  D. Gerber,et al.  Bootstrapping the Linked Data Web , 2011 .

[20]  Donald Ervin Knuth,et al.  The Art of Computer Programming, Volume II: Seminumerical Algorithms , 1970 .

[21]  Elena Cabrio,et al.  Multilingual Question Answering over Linked Data (QALD-3): Lab Overview , 2013, CLEF.

[22]  Ricardo Usbeck,et al.  Combining Linked Data and Statistical Information Retrieval - Next Generation Information Systems , 2014, ESWC.

[23]  Francis G. McCabe,et al.  Reference Model for Service Oriented Architecture 1.0 , 2006 .

[24]  Aaron E. Walsh UDDI, SOAP, and WSDL: The Web Services Specification Reference Book , 2002 .

[25]  Urban Liebel,et al.  ??Harvester??: a fast meta search engine of human protein resources , 2004, Bioinform..

[26]  Enrico Motta,et al.  PowerAqua: Fishing the Semantic Web , 2006, ESWC.

[27]  Sören Auer,et al.  Question answering on interlinked data , 2013, WWW.

[28]  Jun Zhao,et al.  CASIA@QALD-3: A Question Answering System over Linked Data , 2013, CLEF.

[29]  Haofen Wang,et al.  Top-k Exploration of Query Candidates for Efficient Keyword Search on Graph-Shaped (RDF) Data , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[30]  Apinan Aurasopon,et al.  Effective Scheduling Algorithm and Scheduler Implementation for use with Time-Triggered Co-operative Architecture , 2014 .

[31]  Enrico Motta,et al.  AquaLog: An ontology-driven question answering system for organizational semantic intranets , 2007, J. Web Semant..

[32]  Thomas Erl,et al.  SOA Principles of Service Design , 2007 .

[33]  Orri Erling,et al.  RDF Support in the Virtuoso DBMS , 2007, CSSW.