Easy and effective parallel programmable ETL
暂无分享,去创建一个
[1] Wilson Ifill,et al. PyCSP-Communicating Sequential Processes for Python , 2007 .
[2] Sanjay Ghemawat,et al. MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.
[3] Ehtisham Zaidi,et al. Magic Quadrant for Data Integration Tools , 2010 .
[4] Torben Bach Pedersen,et al. pygrametl: a powerful programming framework for extract-transform-load programmers , 2009, DOLAP.
[5] Ravi Kumar,et al. Pig latin: a not-so-foreign language for data processing , 2008, SIGMOD Conference.
[6] Zheng Shao,et al. Hive - a petabyte scale data warehouse using Hadoop , 2010, 2010 IEEE 26th International Conference on Data Engineering (ICDE 2010).
[7] Brian Vinter,et al. PyCSP - Communicating Sequential Processes for Python. , 2007 .
[8] Panos Vassiliadis,et al. Deciding the physical implementation of ETL workflows , 2007, DOLAP '07.
[9] Torben Bach Pedersen,et al. ETLMR: A Highly Scalable Dimensional ETL Framework Based on MapReduce , 2013, Trans. Large Scale Data Knowl. Centered Syst..
[10] Sanjay Ghemawat,et al. MapReduce: simplified data processing on large clusters , 2008, CACM.
[11] Michael Stonebraker,et al. MapReduce and parallel DBMSs: friends or foes? , 2010, CACM.
[12] Panos Vassiliadis,et al. A Survey of Extract-Transform-Load Technology , 2009, Int. J. Data Warehous. Min..