A Hidden Markov Model Approach to Keyword-Based Search over Relational Databases

We present a novel method for translating keyword queries over relational databases into SQL queries with the same intended semantic meaning. In contrast to the majority of the existing keyword-based techniques, our approach does not require any a-priori knowledge of the data instance. It follows a probabilistic approach based on a Hidden Markov Model for computing the top-K best mappings of the query keywords into the database terms, i.e., tables, attributes and values. The mappings are then used to generate the SQL queries that are executed to produce the answer to the keyword query. The method has been implemented into a system called KEYRY (from KEYword to queRY).

[1]  Jeffrey Xu Yu,et al.  Keyword Search in Databases , 2010, Keyword Search in Databases.

[2]  Carl-Erik W. Sundberg,et al.  List Viterbi decoding algorithms with applications , 1994, IEEE Trans. Commun..

[3]  Sandeep Tata,et al.  SQAK: doing more with keywords , 2008, SIGMOD Conference.

[4]  Sonia Bergamaschi,et al.  Keymantic: Semantic Keyword-based Searching in Data Integration Systems , 2010, Proc. VLDB Endow..

[5]  Ravi Kumar,et al.  A Characterization of Online Search Behavior , 2009, IEEE Data Eng. Bull..

[6]  Surajit Chaudhuri,et al.  DBXplorer: a system for keyword-based search over relational databases , 2002, Proceedings 18th International Conference on Data Engineering.

[7]  Vagelis Hristidis,et al.  DISCOVER: Keyword Search in Relational Databases , 2002, VLDB.

[8]  JUSTIN ZOBEL,et al.  Inverted files for text search engines , 2006, CSUR.

[9]  L. Li,et al.  Performance Evaluation of HITS-based Algorithms , 2002, Communications, Internet, and Information Technology.

[10]  Jr. G. Forney,et al.  The viterbi algorithm , 1973 .

[11]  S. Sudarshan,et al.  Enhancing Search with Structure , 2010, IEEE Data Eng. Bull..

[12]  Wolfgang Nejdl,et al.  From keywords to semantic queries - Incremental query construction on the semantic web , 2009, J. Web Semant..

[13]  Sonia Bergamaschi,et al.  Keyword search over relational databases: a metadata approach , 2011, SIGMOD '11.

[14]  Sunita Sarawagi,et al.  Biography and Position Statement. , 2010 .

[15]  S. Sudarshan,et al.  BANKS: Browsing and Keyword Searching in Relational Databases , 2002, VLDB.

[16]  William Webber,et al.  Evaluating the Effectiveness of Keyword Search , 2010, IEEE Data Eng. Bull..