Keyword search is a popular technique for retrieving information from the ever growing repositories of RDF graph data on the Web. However, keyword queries are inherently ambiguous, resulting in an overwhelming number of candidate results. These results correspond to different interpretations of the query. Most of the current keyword search approaches ignore the diversity of the result interpretations and might fail to provide a broad overview of the query aspects to the users who are interested in exploratory search. To address this issue, we introduce in this paper, a novel technique for diversifying keyword search results on RDF graph data. We generate pattern graphs which are structured queries corresponding to alternative interpretations of the given keyword query. We model the problem as an optimization problem aiming at selecting a set of k pattern graphs with maximum diversity. We devise a metric to estimate the diversity of a set of pattern graphs, and we design an algorithm that employs a greedy heuristic to generate a diverse list of k pattern graphs for a given keyword query.
[1]
Eamonn J. Keogh,et al.
Diversifying query results on semi-structured data
,
2012,
CIKM '12.
[2]
Haofen Wang,et al.
Top-k Exploration of Query Candidates for Efficient Keyword Search on Graph-Shaped (RDF) Data
,
2009,
2009 IEEE 25th International Conference on Data Engineering.
[3]
Peter Fankhauser,et al.
DivQ: diversification for keyword search over structured databases
,
2010,
SIGIR.
[4]
Sreenivas Gollapudi,et al.
Diversifying search results
,
2009,
WSDM '09.
[5]
Dimitri Theodoratos,et al.
Exploiting Semantic Result Clustering to Support Keyword Search on Linked Data
,
2014,
WISE.
[6]
Dimitri Theodoratos,et al.
Keyword Pattern Graph Relaxation for Selective Result Space Expansion on Linked Data
,
2015,
ICWE.
[7]
Yehoshua Sagiv,et al.
Exploratory keyword search on data graphs
,
2010,
SIGMOD Conference.
[8]
Evaggelia Pitoura,et al.
Search result diversification
,
2010,
SGMD.