Requirements-driven data warehouse design based on enhanced pivot tables

The design of data warehouses (DWs) is based on both their data sources and users’ requirements. The more closely the DW multidimensional schema reflects the stakeholders’ needs, the more effectively they will make use of the DW content for their OLAP analyses. Thus, considerable attention has been given in the literature to DW requirements analysis, including requirements elicitation, specification and validation. Unfortunately, traditional approaches are based on complex formalisms that cannot be used with decision makers who have no previous experience with DWs and OLAP. This forces a sharp separation between elicitation and specification. To cope with this problem, we propose a new requirements analysis process where pivot tables, a well-known representation for multidimensional data often used by decision makers, are enhanced to be used both for elicitation and as a specification formalism. A pivot table is a two-dimensional spreadsheet that supports the analyses of multidimensional data by nesting several dimensions on the x - or y -axis and displaying data on multiple pages. The requirements analysis process we propose is iterative and relies on both unstructured and structured interviews; particular attention is given to enable the design of irregular multidimensional schemata, which are often present in real-world DWs but can hardly be understood by unskilled users. Finally, we validate our proposal using a real case study in the biodiversity domain.

[1]  Olivier Teste,et al.  Algebraic and Graphic Languages for OLAP Manipulations , 2008, Int. J. Data Warehous. Min..

[2]  Klaus Pohl,et al.  Requirements Engineering - Fundamentals, Principles, and Techniques , 2010 .

[3]  Esteban Zimányi,et al.  Advanced Data Warehouse Design: From Conventional to Spatial and Temporal Applications , 2010 .

[4]  Nenad Jukic,et al.  A Framework for Collecting and Defining Requirements for Data Warehousing Projects , 2010, J. Comput. Inf. Technol..

[5]  Beni Suranto Software prototypes: Enhancing the quality of requirements engineering process , 2015, 2015 International Symposium on Technology Management and Emerging Technologies (ISTMET).

[6]  Manoj Kumar,et al.  Stakeholders Driven Requirements Engineering Approach for Data Warehouse Development , 2010, J. Inf. Process. Syst..

[7]  Lei-da Chen,et al.  Measuring user satisfaction with data warehouses: an exploratory study , 2000, Inf. Manag..

[8]  Matteo Golfarelli,et al.  Data Warehouse Testing , 2011, Int. J. Data Warehous. Min..

[9]  Ralph Kimball,et al.  The Data Warehouse Toolkit: The Complete Guide to Dimensional Modeling , 1996 .

[10]  Francesco Di Tria,et al.  Hybrid methodology for data warehouse conceptual design by UML schemas , 2012, Inf. Softw. Technol..

[11]  John C. Grundy,et al.  Generating essential user interface prototypes to validate requirements , 2011, 2011 26th IEEE/ACM International Conference on Automated Software Engineering (ASE 2011).

[12]  Fausto Giunchiglia,et al.  Tropos: An Agent-Oriented Software Development Methodology , 2004, Autonomous Agents and Multi-Agent Systems.

[13]  Guy Camilleri,et al.  A Volunteer Design Methodology of Data Warehouses , 2018, ER.

[14]  Peter Thanisch,et al.  Constructing OLAP cubes based on queries , 2001, DOLAP '01.

[15]  Paolo Giorgini,et al.  GRAnD: A goal-oriented approach to requirement analysis in data warehouses , 2008, Decis. Support Syst..

[16]  Camille Salinesi,et al.  A Requirement-driven Approach for Designing Data Warehouses , 2006 .

[17]  Mark I. Hwang,et al.  The Effect of Implementation Factors on Data Warehousing Success: An Exploratory Study , 2007 .

[18]  Sandro Bimonte,et al.  A New Methodology for Elicitation of DataWarehouse Requirements based on the Pivot Table Formalism , 2018, EDA.

[19]  Samira Si-Said Cherfi,et al.  Multidimensional Schemas Quality Assessment , 2003, CAiSE Workshops.

[20]  Gottfried Vossen,et al.  Multidimensional normal forms for data warehouse design , 2003, Inf. Syst..

[21]  Jamel Feki,et al.  SSReq: A method for designing Star Schemas from decisional requirements , 2017, 2017 International Conference on Communication, Control, Computing and Electronics Engineering (ICCCCEE).

[22]  Mario Piattini,et al.  Metrics for data warehouse conceptual models understandability , 2007, Inf. Softw. Technol..

[23]  Jose-Norberto Mazón,et al.  A hybrid model driven development framework for the multidimensional modeling of data warehouses! , 2009, SGMD.

[24]  Shiwei Tang,et al.  Triple-driven data modeling methodology in data warehousing: a case study , 2006, DOLAP '06.

[25]  Jaelson Brelaz de Castro,et al.  Enhancing Data Warehouse Design with the NFR Framework , 2002, WER.

[26]  Oscar Pastor,et al.  From Early to Late Requirements: A Goal-Based Approach , 2006, AOIS.

[27]  Stefano Rizzi,et al.  Volunteered Multidimensional Design to the Test: The Farmland Biodiversity VGI4Bio Project's Experiment , 2019, DOLAP.

[28]  Stefano Rizzi,et al.  ProtOLAP: rapid OLAP prototyping with on-demand data supply , 2013, DOLAP '13.

[29]  Stefano Paraboschi,et al.  Designing data marts for data warehouses , 2001, TSEM.

[30]  Alejandro A. Vaisman Requirements Elicitation for Decision Support Systems: A Data Quality Approach , 2006, ICEIS.

[31]  Matteo Golfarelli,et al.  The Dimensional Fact Model: A Conceptual Model for Data Warehouses , 1998, Int. J. Cooperative Inf. Syst..

[32]  Anjana Gosain,et al.  Literature Review of Data Model Quality Metrics of Data Warehouse , 2015 .

[33]  Faïez Gargouri,et al.  Automatic construction of multidimensional schema from OLAP requirements , 2005, The 3rd ACS/IEEE International Conference onComputer Systems and Applications, 2005..

[34]  John Mylopoulos,et al.  Goal-oriented requirements engineering: an extended systematic mapping study , 2017, Requirements Engineering.

[35]  Alberto Abelló,et al.  Interactive multidimensional modeling of linked data for exploratory OLAP , 2018, Inf. Syst..

[36]  Matteo Golfarelli,et al.  Modern Software Engineering Methodologies Meet Data Warehouse Design: 4WD , 2011, DaWaK.

[37]  Robert Winter,et al.  A method for demand-driven information requirements analysis in data warehousing projects , 2003, 36th Annual Hawaii International Conference on System Sciences, 2003. Proceedings of the.

[38]  Alberto Abelló,et al.  A Survey of Multidimensional Modeling Methodologies , 2009, Int. J. Data Warehous. Min..

[39]  Campbell Wilson,et al.  A conceptual query-driven design framework for data warehousing , 2007 .

[40]  Jameleddine Hassine,et al.  A questionnaire-based survey methodology for systematically validating goal-oriented models , 2015, Requirements Engineering.

[41]  Kenji Ikeda,et al.  Visual pivot-table components for web application development , 2007 .

[42]  Jorge Oliveira e Sá,et al.  Data Warehouse Methodology: A Process Driven Approach , 2004, CAiSE.

[43]  PinetFrancois,et al.  Conceptual model for spatial data cubes , 2015 .

[44]  Alberto Abelló,et al.  Multidimensional Design by Examples , 2006, DaWaK.

[45]  Boris Vrdoljak,et al.  Designing Web Warehouses from XML Schemas , 2003, DaWaK.

[46]  William Yeoh,et al.  Critical Success Factors for Business Intelligence Systems , 2010, J. Comput. Inf. Syst..

[47]  David C. Yen,et al.  Critical factors influencing the adoption of data warehouse technology: a study of the banking industry in Taiwan , 2004, Decis. Support Syst..

[48]  Thomas Benker,et al.  A Case Study on Model-Driven Data Warehouse Development , 2012, DaWaK.

[49]  Sandro Bimonte,et al.  Conceptual model for spatial data cubes: A UML profile and its automatic implementation , 2015, Comput. Stand. Interfaces.

[50]  Beate List,et al.  Developing Requirements for Data Warehouse Systems with Use Cases , 2001 .

[51]  Rachid Chalal,et al.  EXODuS: Exploratory OLAP over Document Stores , 2017, Inf. Syst..

[52]  Claudia Diamantini,et al.  Collaborative Building of an Ontology of Key Performance Indicators , 2014, OTM Conferences.

[53]  Jean-Pierre Corriveau,et al.  Modeling and Validating Requirements Using Executable Cotnracts and Scenarios , 2010, 2010 Eighth ACIS International Conference on Software Engineering Research, Management and Applications.

[54]  Jose-Norberto Mazón,et al.  Reconciling requirement-driven data warehouses with data sources via multidimensional normal forms , 2007, Data Knowl. Eng..

[55]  Naveen Prakash,et al.  A multifactor approach for elicitation of Information requirements of data warehouses , 2017, Requirements Engineering.