Metadata and Data Warehouse Quality

In the traditional view, data warehouses provide large-scale caches of historic data. They sit between information sources gained externally or through online transaction processing systems (OLTP), and decision support or data mining queries following the vision of online analytic processing (OLAP). Three main arguments have been put forward in favor of this caching approach: 1. Performance and safety considerations: The concurrency control methods of most DBMS do not react well to a mix of short update transactions (as in OLTP) and OLAP queries that typically search a large portion of the database. Moreover, the OLTP systems are often critical for the operation of the organization and must not be in danger of corruption by other applications. 2. Logical interpretability problems: Inspired by the success of spreadsheet techniques, OLAP users tend to think in terms of highly structured multidimensional data models, whereas information sources offer at best relational, often just semi-structured data models. 3. Temporal and granularity mismatch: OLTP systems focus on current operational support in great detail, whereas OLAP often considers historical developments at a somewhat less detailed granularity.

[1]  Niv Ahituv,et al.  Assessing Data Reliability in an Information System , 1987, J. Manag. Inf. Syst..

[2]  Satya Sachdeva Metadata for Data Warehouse , 2000 .

[3]  Charles H. Kriebel,et al.  Evaluating the Quality of Information Systems , 1979 .

[4]  Matthias Jarke,et al.  Design and Analysis of Quality Information for Data Warehouses , 1998, ER.

[5]  F. R. Keller,et al.  Data Quality , 1990, Taxation and Labour Supply.

[6]  Divesh Srivastava,et al.  Answering Queries Using Views. , 1999, PODS 1995.

[7]  Matthias Jarke,et al.  Managing knowledge about information system evolution , 1988, SIGMOD '88.

[8]  Maurizio Lenzerini,et al.  Conceptual Language for Statistical Data Modeling , 1995, Data Knowl. Eng..

[9]  Christof Peltason The BACK system—an overview , 1991, SGAR.

[10]  Matthias Jarke,et al.  A toolkit for negotiation support interfaces to multi-dimensional data , 1997, SIGMOD '97.

[11]  Donald P. Ballou,et al.  Modeling Data and Process Quality in Multi-Input, Multi-Output Information Systems , 1985 .

[12]  Veda C. Storey,et al.  A Framework for Analysis of Data Quality Research , 1995, IEEE Trans. Knowl. Data Eng..

[13]  Matthias Jarke,et al.  Requirements Engineering Repositories: Formal Support for Informal Teamwork Methods , 1997, Requirements Targeting Software and Systems Engineering.

[14]  Miyoung Jeong,et al.  Quality function deployment: An extended framework for service quality and customer satisfaction in the hospitality industry , 1998 .

[15]  Philip A. Bernstein,et al.  Microsoft Repository Version 2 and the Open Information Model , 1999, Inf. Syst..

[16]  Kalle Lyytinen,et al.  The Impact of Computer Supported Technologies in Information Systems Development, Proceedings of the IFIP WG8.2 Working Conference on The Impact of Computer Supported Technologies in Information Systems Development, Minneapolis, Minnesota, USA, 14-17 June, 1992 , 1992, The Impact of Computer Supported Technologies in Information Systems Development.

[17]  Ma Janson,et al.  Data quality: The Achilles heel of end-user computing , 1988 .

[18]  John R. Smith,et al.  Dynamic assembly of views in data cubes , 1998, PODS '98.

[19]  John Moriarty,et al.  System Development Quality Control , 1978, MIS Q..

[20]  Ulrich Reimer,et al.  Mining Insurance Data at Swiss Life , 1997, VLDB.

[21]  John Mylopoulos,et al.  Representing and Using Nonfunctional Requirements: A Process-Oriented Approach , 1992, IEEE Trans. Software Eng..

[22]  Donald P. Ballou,et al.  Cost/quality tradeoffs for control procedures in information systems , 1987 .

[23]  Giri Kumar Tayi,et al.  Methodology for allocating resources for data quality enhancement , 1989, Commun. ACM.