Effective data warehouse for information delivery: a literature survey and classification

Data warehouse is playing an important role in strategic decision making process for complex business solutions. To gain competitive advantage, business executives are increasingly making use of data warehouse concepts as it plays a vital role in analysing, predicting future trends based on past and current scenarios. We as authors have surveyed the various techniques used in building of data warehouse and the methods used for the implementation of techniques. We have conducted an in-depth survey of existing literature from various known international journal papers to come up with a framework which will help the researchers to focus on specific and emerging areas in the field of data warehouse development as well as application of data warehouse in various business domains.

[1]  Alan D. Smith,et al.  Quality assurance practices for competitive data warehouse management systems , 2011, Int. J. Bus. Inf. Syst..

[2]  Mario Piattini,et al.  Security requirement with a UML 2.0 profile , 2006, First International Conference on Availability, Reliability and Security (ARES'06).

[3]  Maria Haigh,et al.  Software quality, non-functional software requirements and IT-business alignment , 2010, Software Quality Journal.

[4]  Mario Piattini,et al.  Building measure-based prediction models for UML class diagram maintainability , 2007, Empirical Software Engineering.

[5]  A. Yazici,et al.  A casestudy of data models in data warehousing , 2008, 2008 First International Conference on the Applications of Digital Information and Web Technologies (ICADIWT).

[6]  Connolly,et al.  Database Systems , 2004 .

[7]  Mario Piattini,et al.  Applying an MDA-Based Approach to Consider Security Rules in the Development of Secure DWs , 2009, 2009 International Conference on Availability, Reliability and Security.

[8]  Ghulam Mustafa,et al.  Virtual data warehouse: implementation and experimental comparison , 2010 .

[9]  Angélica Caro,et al.  A Probabilistic Approach to Web Portal's Data Quality Evaluation , 2007 .

[10]  Alfredo Cuzzocrea,et al.  Improving range-sum query evaluation on data cubes via polynomial approximation , 2006, Data Knowl. Eng..

[11]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[12]  E. Fernandez-Medina,et al.  An approach based on i* for security requirement analysis in data warehouses , 2008, IEEE Latin America Transactions.

[13]  Amit Kumar,et al.  Analysis the effect of data mining techniques on database , 2012, Adv. Eng. Softw..

[14]  Adir Even,et al.  Utility-driven configuration of data quality in data repositories , 2007, Int. J. Inf. Qual..

[15]  Atish P. Sinha,et al.  An empirical investigation of the key determinants of data warehouse adoption , 2008, Decis. Support Syst..

[16]  R. Villarroel,et al.  A UML profile for designing secure data warehouses , 2005, IEEE Latin America Transactions.

[17]  Jeffrey C. Carver,et al.  The role of replications in Empirical Software Engineering , 2008, Empirical Software Engineering.

[18]  Saeed Ayat,et al.  New Approach in Data Stream Association Rule Mining Based on Graph Structure , 2010, ICDM.

[19]  Maria Cláudia Reis Cavalcanti,et al.  Complementing Data in the ETL Process , 2011, DaWaK.

[20]  Daniel Mellado,et al.  A systematic review of security requirements engineering , 2010, Comput. Stand. Interfaces.

[21]  W. H. Inmon,et al.  Building the Data Warehouse,3rd Edition , 2002 .

[22]  Jose-Norberto Mazón,et al.  A family of experiments to validate measures for UML activity diagrams of ETL processes in data warehouses , 2010, Inf. Softw. Technol..

[23]  E. W. T. Ngai,et al.  A literature review and classification of electronic commerce research , 2002, Inf. Manag..

[24]  Jaewan Lee,et al.  A framework for discovering relevant patterns using aggregation and intelligent data mining agents in telematics systems , 2009, Telematics Informatics.

[25]  Aurora Vizcaíno,et al.  Optimal Data Quality in Project Management for Global Software Developments , 2009, 2009 Fourth International Conference on Cooperation and Promotion of Information Resources in Science and Technology.

[26]  Matteo Golfarelli,et al.  Data Warehouse Testing , 2011, Int. J. Data Warehous. Min..

[27]  Carlo Batini,et al.  16 Methodologies for Data Quality Assessment and Improvement , 2009 .

[28]  Mario Piattini,et al.  Metrics for data warehouse conceptual models understandability , 2007, Inf. Softw. Technol..

[29]  Ozgur Turetken,et al.  Comparing the understandability of alternative data warehouse schemas: An empirical study , 2011, Decis. Support Syst..

[30]  Panos Vassiliadis,et al.  A method for the mapping of conceptual designs to logical blueprints for ETL processes , 2008, Decis. Support Syst..

[31]  Francesco Di Tria,et al.  Hybrid methodology for data warehouse conceptual design by UML schemas , 2012, Inf. Softw. Technol..

[32]  Yann-Gaël Guéhéneuc,et al.  Design evolution metrics for defect prediction in object oriented systems , 2010, Empirical Software Engineering.

[33]  Mario Piattini,et al.  Including Security Rules Support in an MDA Approach for Secure DWs , 2009, 2009 International Conference on Availability, Reliability and Security.

[34]  Mario Piattini,et al.  Empirical studies to assess the understandability of data warehouse schemas using structural metrics , 2008, Software Quality Journal.

[35]  Radoslaw Hofman,et al.  Behavioral economics in software quality engineering , 2011, Empirical Software Engineering.

[36]  Anjana Gosain,et al.  Predicting quality of data warehouse using fuzzy logic , 2012 .

[37]  Jenny A. Harding,et al.  Textual data mining for industrial knowledge management and text classification: A business oriented approach , 2012, Expert Syst. Appl..

[38]  Mario Piattini,et al.  Developing secure data warehouses with a UML extension , 2007, Inf. Syst..

[39]  Gurpreet Singh Bhamra,et al.  Agent Enriched Distributed Association Rules Mining: A Review , 2011, ADMI.

[40]  Manole Velicanu,et al.  Improving performance in integrated DSS with object oriented modeling , 2009 .

[41]  Jose-Norberto Mazón,et al.  A survey on summarizability issues in multidimensional modeling , 2009, Data Knowl. Eng..

[42]  Kavita Pabreja,et al.  A data warehousing and data mining approach for analysis and forecast of cloudburst events using OLAP-based data hypercube , 2012, Int. J. Data Anal. Tech. Strateg..

[43]  Mario Piattini,et al.  Implementing Multidimensional Security into OLAP Tools , 2008, 2008 Third International Conference on Availability, Reliability and Security.

[44]  Hajer Kefi,et al.  Measuring data warehousing success: an empirical investigation applying the DeLone and McLean model , 2011, Int. J. Data Anal. Tech. Strateg..

[45]  Bhavani M. Thuraisingham,et al.  Extended RBAC-based design and implementation for a secure data warehouse , 2007, Int. J. Bus. Intell. Data Min..

[46]  Daniel L. Moody,et al.  Theoretical and practical issues in evaluating the quality of conceptual models: current state and future directions , 2005, Data Knowl. Eng..

[47]  Mario Piattini,et al.  Towards Comprehensive Requirement Analysis for Data Warehouses: Considering Security Requirements , 2008, 2008 Third International Conference on Availability, Reliability and Security.

[48]  Angappa Gunasekaran,et al.  A review for mobile commerce research and applications , 2007, Decis. Support Syst..

[49]  Abdeltawab M. Hendawi,et al.  EMD: entity mapping diagram for automated extraction, transformation, and loading processes in data warehousing , 2012, Int. J. Intell. Inf. Database Syst..

[50]  Somnuk Phon-Amnuaisuk,et al.  Data warehouse design on the basis of Hierarchical Degenerate Snowflake (HDS) , 2011, Int. J. Bus. Intell. Data Min..

[51]  Mario Piattini,et al.  Defining and validating metrics for assessing the understandability of entity-relationship diagrams , 2008, Data Knowl. Eng..

[52]  Mick J. Ridley,et al.  Data modelling for effective data warehouse architecture and design , 2009, Int. J. Inf. Decis. Sci..

[53]  Tharam S. Dillon,et al.  Conceptual Design of an XML FACT Repository for Dispersed XML Document Warehouses and XML Marts , 2005, The Fifth International Conference on Computer and Information Technology (CIT'05).

[54]  Mario Piattini,et al.  Representing levels of abstraction to facilitate the secure multidimensional modeling , 2006, First International Conference on Availability, Reliability and Security (ARES'06).

[55]  Anjana Gosain,et al.  Assessment of quality of data warehouse multidimensional model , 2011, Int. J. Inf. Qual..

[56]  M. Piattini,et al.  MEPLAMECAL: A Methodology Based on ISO/IEC 15939 to Elaborate Data Quality Measurement Plans , 2009, IEEE Latin America Transactions.

[57]  Wolfgang Lehner,et al.  Supporting the ETL-process by Web Service technologies , 2005, Int. J. Web Grid Serv..

[58]  Evan W. Duggan,et al.  Integrating nominal group technique and joint application development for improved systems requirements determination , 2004, Inf. Manag..

[59]  Dimitrios Skoutas,et al.  Natural language reporting for ETL processes , 2008, DOLAP '08.

[60]  Lucio Ieronutti,et al.  A statistical and syntactical approach to datawarehouse design quality , 2007, Int. J. Inf. Qual..

[61]  Lotfi Lakhal,et al.  Reduced representations of Emerging Cubes for OLAP database mining , 2009, Int. J. Bus. Intell. Data Min..

[62]  Olivier Teste,et al.  Towards Multidimensional Requirement Design , 2006, DaWaK.

[63]  Chao Zhang,et al.  Extracting Dimensions for OLAP on Multidimensional Text Databases , 2011, WISM.