The ETLMR MapReduce-Based ETL Framework
暂无分享,去创建一个
This paper presents ETLMR, a parallel Extract-Transform-Load (ETL) programming framework based on MapReduce. It has builtin support for high-level ETL-specific constructs including star schemas, snowflake schemas, and slowly changing dimensions (SCDs). ETLMR gives both high programming productivity and high ETL scalability.
[1] Sanjay Ghemawat,et al. MapReduce: a flexible data processing tool , 2010, CACM.
[2] Torben Bach Pedersen,et al. pygrametl: a powerful programming framework for extract-transform-load programmers , 2009, DOLAP.