Query Optimization Strategies in Similarity-Based Databases

We deal with algorithmic aspects and implementation issues of query execution in relational similarity-based databases. We are concerned with a generalized relational model of data in which queries can be matched to degrees taken from scales represented by complete residuated lattices. The main contribution of this paper are optimization techniques for efficient evaluation of queries involving similarity-based restrictions. In addition, we present experimental evaluation of the proposed techniques showing their efficiency compared to naive approaches.

[1]  Henri Prade,et al.  Generalizing Database Relational Algebra for the Treatment of Incomplete/Uncertain Information and Vague Queries , 1984, Inf. Sci..

[2]  Vilém Vychodil,et al.  Data Tables with Similarity Relations: Functional Dependencies, Complete Rules and Non-redundant Bases , 2006, DASFAA.

[3]  Radim Bělohlávek,et al.  Fuzzy Relational Systems: Foundations and Principles , 2002 .

[4]  Vilém Vychodil,et al.  Foundations of relational similarity-based query language RESIQL , 2013, 2013 IEEE Symposium on Foundations of Computational Intelligence (FOCI).

[5]  Ronald Fagin,et al.  Combining Fuzzy Information from Multiple Systems , 1999, J. Comput. Syst. Sci..

[6]  E. F. Codd,et al.  A relational model of data for large shared data banks , 1970, CACM.

[7]  E. F. Codd,et al.  A relational model of data for large shared data banks , 1970, CACM.

[8]  Michael Pittarelli,et al.  The Theory of Probabilistic Databases , 1987, VLDB.

[9]  Rahul Gupta,et al.  Creating probabilistic databases from information extraction models , 2006, VLDB.

[10]  Petr Hájek,et al.  Metamathematics of Fuzzy Logic , 1998, Trends in Logic.

[11]  Vilém Vychodil,et al.  Relational Algebra for Ranked Tables with Similarities: Properties and Implementation , 2007, IDA.

[12]  Ihab F. Ilyas,et al.  A survey of top-k query processing techniques in relational database systems , 2008, CSUR.

[13]  Christopher Ré,et al.  Probabilistic databases: diamonds in the dirt , 2009, CACM.

[14]  Vilém Vychodil,et al.  Query systems in similarity-based databases: logical foundations, expressive power, and completeness , 2010, SAC '10.

[15]  Dan Suciu,et al.  Efficient query evaluation on probabilistic databases , 2004, The VLDB Journal.

[16]  Dan Suciu,et al.  Management of probabilistic data: foundations and challenges , 2007, PODS '07.

[17]  C. J. Date Database in depth - relational theory for practitioners , 2005 .

[18]  B. Buckles,et al.  A fuzzy representation of data for relational databases , 1982 .

[19]  Petr Hájek,et al.  Fss 5598 Modc+(sc) Triangular Norm Based Predicate Fuzzy Logics , 2022 .

[20]  Siegfried Gottwald,et al.  Mathematical Fuzzy Logics , 2008, Bulletin of Symbolic Logic.

[21]  J. A. Goguen,et al.  The logic of inexact concepts , 1969, Synthese.

[22]  C. J. Date,et al.  Databases, Types and the Relational Model (3rd Edition) , 2006 .

[23]  David Maier,et al.  The Theory of Relational Databases , 1983 .

[24]  Kevin Chen-Chuan Chang,et al.  RankSQL: query algebra and optimization for relational top-k queries , 2005, SIGMOD '05.

[25]  Lluis Godo,et al.  Monoidal t-norm based logic: towards a logic for left-continuous t-norms , 2001, Fuzzy Sets Syst..

[26]  R. Belohlávek Fuzzy Relational Systems: Foundations and Principles , 2002 .

[27]  C. J. Date,et al.  Databases, types and the relational model : the third manifesto , 2007 .