A General and Efficient Approach for Solving Nearest Neighbor Problem in the Vague Query System

This article presents a general and efficient approach for finding the best match for complex vague queries in the Vague Query System (VQS) [16]. The VQS is an extension to conventional database systems and can operate on top of them in order to facilitate vague retrieval capabilities. The VQS's key is Numeric-Coordinate-Representation-Tables (NCR-Tables), which store semantic background information of attributes. Concretely, attributes of arbitrary types in a query relation/view are mapped to the Euclidean space and kept by NCR-Tables. Answering a complex vague query requires parallel searching on some NCR-Tables, which usually contain multidimensional data. In [17] Kueng et al proposed an incremental hyper-cube approach for solving complex vague queries, however, this approach has weaknesses lead to degenerate the search performance of the VQS. Theoretical analyses and experimental results in this article will prove that our new approach defeats all these defects and makes the VQS a full-fledged flexible query answering system.

[1]  Josef Küng,et al.  Vague Queries in Tourism Information Systems , 2000 .

[2]  Thomas S. Huang,et al.  Supporting similarity queries in MARS , 1997, MULTIMEDIA '97.

[3]  Luis Gravano,et al.  Evaluating Top-k Selection Queries , 1999, VLDB.

[4]  Josef Küng,et al.  VQS-a vague query system prototype , 1997, Database and Expert Systems Applications. 8th International Conference, DEXA '97. Proceedings.

[5]  Hanan Samet,et al.  Ranking in Spatial Databases , 1995, SSD.

[6]  Ronald Fagin,et al.  Combining Fuzzy Information from Multiple Systems , 1999, J. Comput. Syst. Sci..

[7]  Goetz Graefe,et al.  Query evaluation techniques for large databases , 1993, CSUR.

[8]  Hans-Jörg Schek,et al.  Fast Evaluation Techniques for Complex Similarity Queries , 2001, VLDB.

[9]  Hans-Jörg Schek,et al.  A Quantitative Analysis and Performance Study for Similarity-Search Methods in High-Dimensional Spaces , 1998, VLDB.

[10]  Josef Küng,et al.  An Incremental Hypercube Approach for Finding Best Matches for Vague Queries , 1999, DEXA.

[11]  Masahito Hirakawa,et al.  ARES: A relational database with the capability of performing flexible interpretation of queries , 1986, IEEE Transactions on Software Engineering.

[12]  Laura M. Haas,et al.  Using Fagin's algorithm for merging ranked results in multimedia middleware , 1999, Proceedings Fourth IFCIS International Conference on Cooperative Information Systems. CoopIS 99 (Cat. No.PR00384).

[13]  Hans-Peter Kriegel,et al.  Efficient User-Adaptable Similarity Search in Large Multimedia Databases , 1997, VLDB.

[14]  Christian Böhm,et al.  A cost model for nearest neighbor search in high-dimensional data space , 1997, PODS.

[15]  Nick Roussopoulos,et al.  Nearest neighbor queries , 1995, SIGMOD '95.

[16]  Luis Gravano,et al.  Optimizing queries over multimedia repositories , 1996, SIGMOD 1996.

[17]  Tran Khanh Dang,et al.  The SH-tree: A Super Hybrid Index Structure for Multidimensional Data , 2001, DEXA.

[18]  Christos Faloutsos,et al.  Fast subsequence matching in time-series databases , 1994, SIGMOD '94.

[19]  Surya Nepal,et al.  Query processing issues in image (multimedia) databases , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[20]  Amihai Motro,et al.  VAGUE: a user interface to relational databases that permits vague queries , 1988, TOIS.

[21]  Oliver Günther,et al.  Multidimensional access methods , 1998, CSUR.

[22]  Ronald Fagin,et al.  Combining fuzzy information from multiple systems (extended abstract) , 1996, PODS.

[23]  Hans-Peter Kriegel,et al.  S3: similarity search in CAD database systems , 1997, SIGMOD '97.