Querying Heterogeneous Datasets on the Linked Data Web: Challenges, Approaches, and Trends

The growing number of datasets published on the Web as linked data brings both opportunities for high data availability and challenges inherent to querying data in a semantically heterogeneous and distributed environment. Approaches used for querying siloed databases fail at Web-scale because users don't have an a priori understanding of all the available datasets. This article investigates the main challenges in constructing a query and search solution for linked data and analyzes existing approaches and trends.

[1]  Patrick Pantel,et al.  From Frequency to Meaning: Vector Space Models of Semantics , 2010, J. Artif. Intell. Res..

[2]  Jennifer Chu-Carroll,et al.  Building Watson: An Overview of the DeepQA Project , 2010, AI Mag..

[3]  Enrico Motta,et al.  PowerAqua: Fishing the Semantic Web , 2006, ESWC.

[4]  Seán O'Riain,et al.  Querying Linked Data Using Semantic Relatedness: A Vocabulary Independent Approach , 2011, NLDB.

[5]  Jürgen Umbrich,et al.  Searching and browsing Linked Data with SWSE: The Semantic Web Search Engine , 2011, J. Web Semant..

[6]  Giovanni Tummarello,et al.  Searching web data: An entity retrieval and high-performance indexing model , 2012, J. Web Semant..

[7]  Hamish Cunningham,et al.  FREyA: An Interactive Way of Querying Linked Data Using Natural Language , 2011, ESWC Workshops.

[8]  David Maier,et al.  From databases to dataspaces: a new abstraction for information management , 2005, SGMD.

[9]  Pat Helland If you have too much data, then 'good enough' is good enough , 2011, CACM.

[10]  Haofen Wang,et al.  Semplore: A scalable IR approach to search the Web of Data , 2009, J. Web Semant..

[11]  Alon Y. Halevy,et al.  Indexing dataspaces , 2007, SIGMOD '07.

[12]  Abraham Bernstein,et al.  Evaluating the usability of natural language query languages and interfaces to Semantic Web knowledge bases , 2010, J. Web Semant..

[13]  Edward Curry,et al.  A Multidimensional Semantic Space for Data Model Independent Queries over RDF Data , 2011, 2011 IEEE Fifth International Conference on Semantic Computing.