A SAT approach to query optimization in mediator systems

Mediator systems integrate distributed, heterogeneous and autonomous data sources, but their effective use requires the solution of hard query optimization problems. This is usually done in two phases: the selection of a set of data sources is similar to a set covering problem, and their ordering into a feasible and efficient query is a capability restricted join order problem. However, a two-phase approach is unlikely to find optimum queries. We describe a new single-phase approach that, under a simple cost model, can be encoded and solved as a SAT problem. Results on artificial benchmarks indicate that this is an interesting problem from the encoding and search viewpoints, and we use them to address three of the ten SAT challenges posed by Selman, Kautz and McAllester in 1997.

[1]  Joann J. Ordille,et al.  Querying Heterogeneous Information Sources Using Source Descriptions , 1996, VLDB.

[2]  Steven David Prestwich,et al.  SAT problems with chains of dependent variables , 2003, Discret. Appl. Math..

[3]  Bart Selman,et al.  Accelerating Random Walks , 2002, CP.

[4]  Steven Prestwich,et al.  Randomised Backtracking for Linear Pseudo-Boolean Constraint Problems , 2002 .

[5]  Edward A. Hirsch,et al.  UnitWalk: A new SAT solver that uses local search guided by unit clause elimination , 2005, Annals of Mathematics and Artificial Intelligence.

[6]  Hans van Maaren,et al.  Solving satisfiability problems using elliptic approximations - effective branching rules , 2000, Discret. Appl. Math..

[7]  Bart Selman,et al.  Ten Challenges in Propositional Reasoning and Search , 1997, IJCAI.

[8]  Matthew W. Moskewicz,et al.  Cha : Engineering an e cient SAT solver , 2001, DAC 2001.

[9]  Anand Rajaraman,et al.  Answering queries using templates with binding patterns (extended abstract) , 1995, PODS.

[10]  R. Ramakrishnan,et al.  An Optimizer for Heterogeneous Systems with NonStandard Data and Search Capabilities , 1996 .

[11]  Michel Minoux,et al.  Graphs and Algorithms , 1984 .

[12]  Ian P. Gent,et al.  Symmetry breaking during search in constraint programming , 1999 .

[13]  Toby Walsh,et al.  Local Search and the Number of Solutions , 1996, CP.

[14]  Benjamin W. Wah,et al.  A Discrete Lagrangian-Based Global-Search Method for Solving Satisfiability Problems , 1996, J. Glob. Optim..

[15]  Steven David Prestwich A Hybrid Search Architecture Applied to Hard Random 3-SAT and Low-Autocorrelation Binary Sequences , 2000, CP.

[16]  Jean-François Puget,et al.  On the Satisfiability of Symmetrical Constrained Satisfaction Problems , 1993, ISMIS.

[17]  S. Prestwich,et al.  A SAT Approach to Query Optimization in Mediator Systems , 2005 .

[18]  P. Barth A Davis-Putnam based enumeration algorithm for linear pseudo-Boolean optimization , 1995 .

[19]  Eugene Goldberg,et al.  BerkMin: A Fast and Robust Sat-Solver , 2002, Discret. Appl. Math..

[20]  Sharad Malik,et al.  Chaff: engineering an efficient SAT solver , 2001, Proceedings of the 38th Design Automation Conference (IEEE Cat. No.01CH37232).

[21]  Bart Selman,et al.  Noise Strategies for Improving Local Search , 1994, AAAI.

[22]  Matthew L. Ginsberg,et al.  Dynamic Backtracking , 1993, J. Artif. Intell. Res..

[23]  Jennifer Widom,et al.  The TSIMMIS Approach to Mediation: Data Models and Languages , 1997, Journal of Intelligent Information Systems.

[24]  Jeffrey D. Ullman,et al.  Optimizing Large Join Queries in Mediation Systems , 1999, ICDT.

[25]  Henry Kautz,et al.  Pushing the envelope: planning , 1996 .

[26]  Gio Wiederhold,et al.  Mediators in the architecture of future information systems , 1992, Computer.

[27]  Henry Kautz,et al.  Exploiting Variable Dependency in Local Search , 1997, IJCAI 1997.

[28]  Avrim Blum,et al.  Fast Planning Through Planning Graph Analysis , 1995, IJCAI.

[29]  Philippe Bonnet,et al.  The distributed information search component (Disco) and the World Wide Web , 1997, SIGMOD '97.

[30]  Laura M. Haas,et al.  Capabilities-Based Query Rewriting in Mediator Systems , 2004, Distributed and Parallel Databases.

[31]  Jun Yang,et al.  An Optimizer for Heterogeneous Systems with NonStandard Data and Search Capabilities. , 1996 .

[32]  Rolf Backofen,et al.  Excluding Symmetries in Constraint-Based Search , 1999, CP.

[33]  Stéphane Bressan,et al.  Context Knowledge Representation and Reasoning in the Context Interchange System , 2015, Applied Intelligence.

[34]  Bart Selman,et al.  Pushing the Envelope: Planning, Propositional Logic and Stochastic Search , 1996, AAAI/IAAI, Vol. 2.

[35]  James M. Crawford,et al.  Symmetry-Breaking Predicates for Search Problems , 1996, KR.

[36]  Steven David Prestwich Negative Effects of Modeling Techniques on Search Performance , 2003, Ann. Oper. Res..

[37]  Matthias F. Stallmann,et al.  The Role of a Skeptic Agent in Testing and Benchmarking of SAT Algorithms , 2002 .

[38]  Makoto Yokoo,et al.  Why Adding More Constraints Makes a Problem Easier for Hill-climbing Algorithms: Analyzing Landscapes of CSPs , 1997, CP.

[39]  Bart Selman,et al.  Planning as Satisfiability , 1992, ECAI.

[40]  Steven David Prestwich First-Solution Search with Symmetry Breaking and Implied Constraints , 2001 .

[41]  Matteo Fischetti,et al.  A Heuristic Method for the Set Covering Problem , 1999, Oper. Res..