SolveDB: Integrating Optimization Problem Solvers Into SQL Databases

Many real-world decision problems involve solving optimization problems based on data in an SQL database. Traditionally, solving such problems requires combining a DBMS with optimization software packages for each required class of problems (e.g. linear and constraint programming) -- leading to workflows that are cumbersome, complex, inefficient, and error-prone. In this paper, we present SolveDB - a DBMS for optimization applications. SolveDB supports solvers for different problem classes and offers seamless data management and optimization problem solving in a pure SQL-based setting. This allows for much simpler and more effective solutions of database-based optimization problems. SolveDB is based on the 3-level ANSI/SPARC architecture and allows formulating, solving, and analysing solutions of optimization problems using a single so-called solve query. SolveDB provides (1) an SQL-based syntax for optimization problems, (2) an extensible infrastructure for integrating different solvers, and (3) query optimization techniques to achieve the best execution performance and/or result quality. Extensive experiments with the PostgreSQL-based implementation show that SolveDB is a versatile tool offering much higher developer productivity and order of magnitude better performance for specification-complex and data-intensive problems.

[1]  Carlos Ordonez,et al.  Statistical Model Computation with UDFs , 2010, IEEE Transactions on Knowledge and Data Engineering.

[2]  KumarArun,et al.  The MADlib analytics library , 2012, VLDB 2012.

[3]  Paul P. Maglio,et al.  Data is dead... without what-if models , 2011, Proc. VLDB Endow..

[4]  Stanley B. Zdonik,et al.  Searchlight: Enabling Integrated Search and Exploration over Large Multidimensional Data , 2015, Proc. VLDB Endow..

[5]  Jean-Christophe Filliâtre,et al.  A persistent union-find data structure , 2007, ML '07.

[6]  Harvey J. Greenberg,et al.  Views of mathematical programming models and their instances , 1995, Decis. Support Syst..

[7]  Dan Suciu,et al.  Tiresias: the database oracle for how-to queries , 2012, SIGMOD Conference.

[8]  Joobin Choobineh SQLMP: A Data Sublanguage for Representation and Formulation of Linear Mathematical Models , 1991, INFORMS J. Comput..

[9]  Andrew J. Chipperfield,et al.  Simplifying Particle Swarm Optimization , 2010, Appl. Soft Comput..

[10]  Xiaoyang Sean Wang,et al.  Supporting Agile Organizations with a Decision Guidance Query Language , 2012, J. Manag. Inf. Syst..

[11]  A. E. Eiben,et al.  Comparing parameter tuning methods for evolutionary algorithms , 2009, 2009 IEEE Congress on Evolutionary Computation.

[12]  Christopher Ré,et al.  Towards a unified architecture for in-RDBMS analytics , 2012, SIGMOD Conference.

[13]  Roger Alan Pick,et al.  Meta-modeling concepts and tools for model management: a systems approach , 1994 .

[14]  Kun Li,et al.  The MADlib Analytics Library or MAD Skills, the SQL , 2012, Proc. VLDB Endow..

[15]  Clive W. J. Granger,et al.  Short-run forecasts of electricity loads and peaks , 2001 .

[16]  Alexandra Meliou,et al.  Scalable Package Queries in Relational Database Systems , 2015, Proc. VLDB Endow..

[17]  Panagiotis Manolios,et al.  ILP Modulo Data , 2014, 2014 Formal Methods in Computer-Aided Design (FMCAD).

[18]  Akira Kawaguchi,et al.  Linear Programming in Database , 2008 .

[19]  Nesa L'abbe Wu,et al.  Linear programming and extensions , 1981 .

[20]  Todd J. Green,et al.  LogicBlox, Platform and Language: A Tutorial , 2012, Datalog.

[21]  Mohamed F. Mokbel,et al.  CareDB: A context and preference-aware location-based database system , 2010, 2010 IEEE 26th International Conference on Data Engineering Workshops (ICDEW 2010).

[22]  Martin W. P. Savelsbergh,et al.  A Relational Modeling System for Linear and Integer Programming , 2000, Oper. Res..

[23]  Tea Tusar,et al.  Evolutionary scheduling of flexible offers for balancing electricity supply and demand , 2012, 2012 IEEE Congress on Evolutionary Computation.

[24]  Jonathan Goldstein,et al.  Optimizing queries using materialized views: a practical, scalable solution , 2001, SIGMOD '01.

[25]  Wolfgang Lehner,et al.  Towards Integrated Data Analytics: Time Series Forecasting in DBMS , 2012, Datenbank-Spektrum.