Incorporating Recovery from Failures into a Data Integration Benchmark
暂无分享,去创建一个
The proposed TPC-DI benchmark measures the performance of Data Integration systems (a.k.a. ETL systems) given the task of integrating data from an OLTP system and other data sources to create a data warehouse.This paper describes the scenario, structure and timing principles used in TPC-DI. Although failure recovery is very important in real deployments of Data Integration systems, certain complexities made it difficult to specify in the benchmark. Hence failure recovery aspects have been scoped out of the current version of TPC-DI. The issues around failure recovery are discussed in detail and some options are described. Finally the audience is invited to offer additional suggestions.
[1] Jean-Claude Laprie,et al. Dependable computing: concepts, limits, challenges , 1995 .
[2] Lieven Eeckhout,et al. Performance Evaluation and Benchmarking , 2005 .
[3] Ralph Kimball,et al. The Data Warehouse Toolkit: Practical Techniques for Building Dimensional Data Warehouses , 1996 .
[4] Daniel Pol,et al. Principles for an ETL Benchmark , 2009, TPCTC.