Fast processing of join queries with instant response

This research presents an innovative way to process queries without having to perform expensive join and set operations. We propose to store the equi-join relationships of tuples on mass storage devices, such as disks, to facilitate query processing. The equi-join relationships are captured, grouped, and stored as various tables on disks, which are collectively called the Join Core. Queries involving arbitrary legitimate sequences of equi-joins, semi-joins, outer-joins, anti-joins, unions, differences, and intersections can all be answered quickly by merely merging these tables. Without having to perform joins, memory consumptions are dramatically reduced. The Join Core can also be updated dynamically. Preliminary experimental results showed that all test queries began to generate results instantly, and many completed instantly too. The proposed methodology can be very useful for queries with complex joins of large relations, and can be even more advantageous to distributed query processing, as there are fewer or even no relations or intermediate results needed to be retrieved, generated or transferred over the networks.

[1]  Divesh Srivastava,et al.  Answering Queries Using Views. , 1999, PODS 1995.

[2]  Jonathan Goldstein,et al.  Optimizing queries using materialized views: a practical, scalable solution , 2001, SIGMOD '01.

[3]  David J. DeWitt,et al.  Multiprocessor Hash-Based Join Algorithms , 1985, VLDB.

[4]  Jian Yang,et al.  Algorithms for Materialized View Design in Data Warehousing Environment , 1997, VLDB.

[5]  M. W. Blasgen,et al.  Storage and Access in Relational Data Bases , 1977, IBM Syst. J..

[6]  Bela Stantic,et al.  Simulated Annealing for Materialized View Selection in Data Warehousing Environment , 2006, Databases and Applications.

[7]  Marcin Zukowski,et al.  Super-Scalar RAM-CPU Cache Compression , 2006, 22nd International Conference on Data Engineering (ICDE'06).

[8]  Howard J. Karloff,et al.  On the complexity of the view-selection problem , 1999, PODS '99.

[9]  Jalel Akaichi,et al.  A Scalable Algorithm for Answering Top-K Queries Using Cached Views , 2015, FQAS.

[10]  Surajit Chaudhuri,et al.  Automated Selection of Materialized Views and Indexes in SQL Databases , 2000, VLDB.

[11]  Dan Suciu,et al.  From Theory to Practice: Efficient Join Query Evaluation in a Parallel Database System , 2015, SIGMOD Conference.

[12]  Daniel J. Abadi,et al.  Integrating compression and execution in column-oriented database systems , 2006, SIGMOD Conference.

[13]  Hidehiko Tanaka,et al.  Application of hash to data base machine and its architecture , 1983, New Generation Computing.

[14]  Bingsheng He,et al.  Relational joins on graphics processors , 2008, SIGMOD Conference.