Efficient query construction for large scale data

In recent years, a number of open databases have emerged on the Web, providing Web users with platforms to collaboratively create structured information. As these databases are intended to accommodate heterogeneous information and knowledge, they usually comprise a very large schema and billions of instances. Browsing and searching data on such a scale is not an easy task for a Web user. In this context, interactive query construction offers an intuitive interface for novice users to retrieve information from databases neither requiring any knowledge of structured query languages, nor any prior knowledge of the database schema. However, the existing mechanisms do not scale well on large scale datasets. This paper presents a set of techniques to boost the scalability of interactive query construction, from the perspective of both, user interaction cost and performance. We connect an abstract ontology layer to the database schema to shorten the process of user-computer interaction. We also introduce a search mechanism to enable efficient exploration of query interpretation spaces over large scale data. Extensive experiments show that our approach scales well on Freebase - an open database containing more than 7,000 relational tables in more than 100 domains.

[1]  ChengXiang Zhai,et al.  Towards natural question guided search , 2010, WWW '10.

[2]  Jakob Nielsen,et al.  Usability engineering , 1997, The Computer Science and Engineering Handbook.

[3]  Surajit Chaudhuri,et al.  DBXplorer: a system for keyword-based search over relational databases , 2002, Proceedings 18th International Conference on Data Engineering.

[4]  Magesh Jayapandian,et al.  Expressive query specification through form customization , 2008, EDBT '08.

[5]  Vagelis Hristidis,et al.  DISCOVER: Keyword Search in Relational Databases , 2002, VLDB.

[6]  Adriane Chapman,et al.  Making database systems usable , 2007, SIGMOD '07.

[7]  Adam Blum,et al.  Microsoft English Query 7.5: Automatic Extraction of Semantics from Relational Databases and OLAP Cubes , 1999, VLDB.

[8]  Clement T. Yu,et al.  Effective keyword search in relational databases , 2006, SIGMOD Conference.

[9]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[10]  编程语言 Query by Example , 2010, Encyclopedia of Database Systems.

[11]  Wolfgang Nejdl,et al.  A Probabilistic Scheme for Keyword-Based Incremental Query Construction , 2012, IEEE Transactions on Knowledge and Data Engineering.

[12]  Peter Thanisch,et al.  Natural language interfaces to databases – an introduction , 1995, Natural Language Engineering.

[13]  Mukesh K. Mohania,et al.  Retrieval]: Query formulation, search process , 2022 .

[14]  Berthold Reinwald,et al.  Towards keyword-driven analytical processing , 2007, SIGMOD '07.

[15]  Wolfgang Nejdl,et al.  SUITS: Faceted User Interface for Constructing Structured Queries from Keywords , 2009, DASFAA.

[16]  Christopher D. Manning,et al.  Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[17]  Xuemin Lin,et al.  SPARK2: Top-k Keyword Query in Relational Databases , 2007, IEEE Transactions on Knowledge and Data Engineering.

[18]  Gerhard Weikum,et al.  WWW 2007 / Track: Semantic Web Session: Ontologies ABSTRACT YAGO: A Core of Semantic Knowledge , 2022 .

[19]  Sandeep Tata,et al.  SQAK: doing more with keywords , 2008, SIGMOD Conference.

[20]  H. V. Jagadish,et al.  Assisted querying using instant-response interfaces , 2007, SIGMOD '07.

[21]  Luis Gravano,et al.  Efficient IR-Style Keyword Search over Relational Databases , 2003, VLDB.

[22]  Sebastian Rudolph,et al.  Ontology-Based Interpretation of Keywords for Semantic Search , 2007, ISWC/ASWC.

[23]  Gerhard Weikum,et al.  YAGO-QA: Answering Questions by Structured Knowledge Queries , 2011, 2011 IEEE Fifth International Conference on Semantic Computing.