Big Data and New Data Warehousing Approaches

Big data are a data trend present around us mainly through Internet -- social networks and smart devices and meters -- mostly without us being aware of them. Also they are a fact that both industry and scientific research needs to deal with. They are interesting from analytical point of view, for they contain knowledge that cannot be ignored and left unused. Traditional system that supports the advanced analytics and knowledge extraction -- data warehouse -- is not able to cope with large amounts of fast incoming various and unstructured data, and may be facing a paradigm shift in terms of utilized concepts, technologies and methodologies, which have become a very active research area in the last few years. This paper provides an overview of research trends important for the big data warehousing, concepts and technologies used for data storage and (ETL) processing, and research approaches done in attempts to empower traditional data warehouses for handling big data.

[1]  Hong Min,et al.  Octopus: Hybrid Big Data Integration Engine , 2015, 2015 IEEE 7th International Conference on Cloud Computing Technology and Science (CloudCom).

[2]  Jerzy Duda Business intelligence and NoSQL databases , 2012 .

[3]  Wilson C. Hsieh,et al.  Bigtable: A Distributed Storage System for Structured Data , 2006, TOCS.

[4]  Abhishek Sharma,et al.  Augmenting Data Warehouses with Big Data , 2015, Inf. Syst. Manag..

[5]  Aleksey Bondarev,et al.  Data warehouse on Hadoop platform for decision support systems in education , 2015, 2015 Twelve International Conference on Electronics Computer and Computation (ICECCO).

[6]  Omar Boussaïd,et al.  Columnar NoSQL CUBE: Agregation operator for columnar NoSQL data warehouse , 2014, 2014 IEEE International Conference on Systems, Man, and Cybernetics (SMC).

[7]  Li-Yan Yuan,et al.  Rubato DB: A Highly Scalable Staged Grid Database System for OLTP and Big Data Applications , 2014, CIKM.

[8]  Liang Dong,et al.  Starfish: A Self-tuning System for Big Data Analytics , 2011, CIDR.

[9]  Max Chevalier,et al.  Document-oriented data warehouses: Models and extended cuboids, extended cuboids in oriented document , 2016, 2016 IEEE Tenth International Conference on Research Challenges in Information Science (RCIS).

[10]  Zheng Shao,et al.  Hive - a petabyte scale data warehouse using Hadoop , 2010, 2010 IEEE 26th International Conference on Data Engineering (ICDE 2010).

[11]  Khaled Dehdouh Building OLAP Cubes from Columnar NoSQL Data Warehouses , 2016, MEDI.

[12]  Abraham Silberschatz,et al.  HadoopDB: An Architectural Hybrid of MapReduce and DBMS Technologies for Analytical Workloads , 2009, Proc. VLDB Endow..

[13]  Alberto Abelló,et al.  Building cubes with MapReduce , 2011, DOLAP '11.