Efficient Top-k Keyword Search Over Multidimensional Databases

Keyword search over databases has recently received significant attention. Many solutions and prototypes have been developed. However, due to large memory consumption requirements and unpredictable running time, most of them cannot be applied directly to the situations where memory is limited and quick response is required, such as when performing keyword search over multidimensional databases in mobile devices as part of the OLAP functionalities. In this paper, the authors attack the keyword search problem from a new perspective, and propose a cascading top-k keyword search algorithm, which generates supernodes by a branch and bound method in each step of search instead of computing the Steiner trees as done in many existing approaches. This new algorithm consumes less memory and significantly reduces the response time. Experiments show that the method can achieve high search efficiency compared with the state-of-the-art approaches.

[1]  S. Sudarshan,et al.  BANKS: Browsing and Keyword Searching in Relational Databases , 2002, VLDB.

[2]  Divesh Srivastava,et al.  Processing top-k join queries , 2010, Proc. VLDB Endow..

[3]  Mohamed A. Sharaf,et al.  Semantic-based delivery of OLAP summary tables in wireless environments , 2002, CIKM '02.

[4]  S. Sudarshan,et al.  Keyword search on external memory data graphs , 2008, Proc. VLDB Endow..

[5]  Christos Doulkeridis,et al.  Reverse top-k queries , 2010, 2010 IEEE 26th International Conference on Data Engineering (ICDE 2010).

[6]  Bin Liu,et al.  Distributed caching of multi-dimensional data in mobile environments , 2005, MDM '05.

[7]  Sartaj Sahni,et al.  Data Structures, Algorithms, and Applications in C++ , 1997 .

[8]  Andreas S. Maniatis The Case for Mobile OLAP , 2004, EDBT Workshops.

[9]  Jeffrey F. Naughton,et al.  Toward scalable keyword search over relational data , 2010, Proc. VLDB Endow..

[10]  Yin Yang,et al.  Reachability Indexes for Relational Keyword Search , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[11]  Alfredo Cuzzocrea Accuracy Control in Compressed Multidimensional Data Cubes for Quality of Answer-based OLAP Tools , 2006, 18th International Conference on Scientific and Statistical Database Management (SSDBM'06).

[12]  Alfredo Cuzzocrea,et al.  Enabling OLAP in mobile environments via intelligent data cube compression techniques , 2008, Journal of Intelligent Information Systems.

[13]  Anthony K. H. Tung,et al.  A graph method for keyword-based selection of the top-K databases , 2008, SIGMOD Conference.

[14]  Xuemin Lin,et al.  SPARK2: Top-k Keyword Query in Relational Databases , 2007, IEEE Transactions on Knowledge and Data Engineering.

[15]  Surajit Chaudhuri,et al.  DBXplorer: a system for keyword-based search over relational databases , 2002, Proceedings 18th International Conference on Data Engineering.

[16]  Anthony K. H. Tung,et al.  Effective keyword-based selection of relational databases , 2007, SIGMOD '07.

[17]  Alfredo Cuzzocrea Providing probabilistically-bounded approximate answers to non-holistic aggregate range queries in OLAP , 2005, DOLAP '05.

[18]  Philip S. Yu,et al.  BLINKS: ranked keyword searches on graphs , 2007, SIGMOD '07.

[19]  Guoliang Li,et al.  Efficient type-ahead search on relational data: a TASTIER approach , 2009, SIGMOD Conference.

[20]  Jianyong Wang,et al.  Progressive Keyword Search in Relational Databases , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[21]  Jeffrey Xu Yu,et al.  Ten thousand SQLs , 2010, Proc. VLDB Endow..

[22]  Luis Gravano,et al.  Efficient Keyword Search Across Heterogeneous Relational Databases , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[23]  S. Sudarshan,et al.  Keyword searching and browsing in databases using BANKS , 2002, Proceedings 18th International Conference on Data Engineering.

[24]  Yehoshua Sagiv,et al.  Keyword proximity search in complex data graphs , 2008, SIGMOD Conference.

[25]  Alfredo Cuzzocrea,et al.  Delivering Semantics-aware Compressed OLAP Views in Mobile Environments with Hand-OLAP , 2009, 2009 21st IEEE International Conference on Tools with Artificial Intelligence.

[26]  Haofen Wang,et al.  Top-k Exploration of Query Candidates for Efficient Keyword Search on Graph-Shaped (RDF) Data , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[27]  S. Sudarshan,et al.  Bidirectional Expansion For Keyword Search on Graph Databases , 2005, VLDB.

[28]  Alfredo Cuzzocrea,et al.  Hand-OLAP: a system for delivering OLAP services on handheld devices , 2003, The Sixth International Symposium on Autonomous Decentralized Systems, 2003. ISADS 2003..

[29]  Junghoo Cho,et al.  The Hybrid-Layer Index: A synergic approach to answering top-k queries in arbitrary subspaces , 2010, 2010 IEEE 26th International Conference on Data Engineering (ICDE 2010).

[30]  Christos Doulkeridis,et al.  Identifying the most influential data objects with reverse top-k queries , 2010, Proc. VLDB Endow..

[31]  Vagelis Hristidis,et al.  DISCOVER: Keyword Search in Relational Databases , 2002, VLDB.