Query processing over data warehouse using relational databases and NoSQL

Data warehouse (DW) is an important component of Business Intelligence used to support strategic decision making. DW is a subject-oriented, nonvolatile, historical and massive database, which the processing of analytical queries, results in high response times. There known techniques for improving the performance processing of queries on DW. Among them is the use of data fragmentation, materialized views and indices. In addition, the NoSQL is an emerging technology whose main characteristics are improved query processing and data storage, and an alternative to relational databases. In this paper we investigate and compare the implementation of DW using relational databases and NoSQL, considering the Star Schema Benchmark. The results showed that the column-oriented model of the software FastBit showed a better performance, with gains of 25.4% to 99.4% if compared to other models NoSQL and the relational model, in the processing of queries on DW.

[1]  Zheng Guansheng,et al.  Application of NoSQL Database in Web Crawling , 2011 .

[2]  Qing He,et al.  Research on Business Intelligence in enterprise computing environment , 2007, 2007 IEEE International Conference on Systems, Man and Cybernetics.

[3]  Prabhat,et al.  FastBit: interactively searching massive data , 2009 .

[4]  Rick Cattell,et al.  Scalable SQL and NoSQL data stores , 2011, SGMD.

[5]  Ladjel Bellatreche,et al.  Yet Another Algorithms for Selecting Bitmap Join Indexes , 2010, DaWak.

[6]  Matteo Golfarelli,et al.  Applying Vertical Fragmentation Techniques in Logical Design of Multidimensional Databases , 2000, DaWaK.

[7]  Ralph Kimball,et al.  The Data Warehouse Toolkit: The Complete Guide to Dimensional Modeling , 1996 .

[8]  Jeffrey D. Ullman,et al.  Implementing data cubes efficiently , 1996, SIGMOD '96.

[9]  Goetz Graefe,et al.  Multi-table joins through bitmapped join indices , 1995, SGMD.

[10]  Anne Laurent,et al.  Reduce, You Say: What NoSQL Can Do for Data Aggregation and BI in Large Repositories , 2011, 2011 22nd International Workshop on Database and Expert Systems Applications.

[11]  Guan Le,et al.  Survey on NoSQL database , 2011, 2011 6th International Conference on Pervasive Computing and Applications.

[12]  Panos Vassiliadis,et al.  View usability and safety for the answering of top-k queries via materialized views , 2009, DOLAP.

[13]  Xuedong Chen,et al.  The Star Schema Benchmark and Augmented Fact Table Indexing , 2009, TPCTC.

[14]  A. Carniel,et al.  An OLAP Tool Based on the Bitmap Join Index , 2011 .

[15]  Yi Chen,et al.  Efficient and scalable data evolution with column oriented databases , 2011, EDBT/ICDT '11.

[16]  Thiago Luís Lopes Siqueira,et al.  The SB-index and the HSB-index: efficient indices for spatial data warehouses , 2011, GeoInformatica.

[17]  Kesheng Wu,et al.  Bitmap Indices for Data Warehouses , 2006 .