Partition-Based Online Aggregation with Shared Sampling in the Cloud
暂无分享,去创建一个
[1] Adam Jacobs,et al. The pathologies of big data , 2009, Commun. ACM.
[2] Joos-Hendrik Böse,et al. Beyond online aggregation: parallel and incremental data mining with online Map-Reduce , 2010, MDAC '10.
[3] Sanjay Ghemawat,et al. MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.
[4] Michael J. Carey,et al. Extending Map-Reduce for Efficient Predicate-Based Sampling , 2012, 2012 IEEE 28th International Conference on Data Engineering.
[5] Chris Jermaine,et al. Online aggregation for large MapReduce jobs , 2011, Proc. VLDB Endow..
[6] George Kollios,et al. MRShare , 2010, Proc. VLDB Endow..
[7] Peter J. Haas,et al. Large-sample and deterministic confidence intervals for online aggregation , 1997, Proceedings. Ninth International Conference on Scientific and Statistical Database Management (Cat. No.97TB100150).
[8] Fang Dong,et al. Improving Online Aggregation Performance for Skewed Data Distribution , 2012, DASFAA.
[9] Jeffrey F. Naughton,et al. A scalable hash ripple join algorithm , 2002, SIGMOD '02.
[10] Joseph M. Hellerstein,et al. Online aggregation and continuous query support in MapReduce , 2010, SIGMOD Conference.
[11] Beng Chin Ooi,et al. Continuous sampling for online aggregation over multiple queries , 2010, SIGMOD Conference.
[12] Surajit Chaudhuri,et al. Effective use of block-level sampling in statistics estimation , 2004, SIGMOD '04.
[13] Yuanyuan Tian,et al. CoHadoop: Flexible Data Placement and Its Exploitation in Hadoop , 2011, Proc. VLDB Endow..
[14] Liang Dong,et al. Starfish: A Self-tuning System for Big Data Analytics , 2011, CIDR.
[15] Scott Shenker,et al. Delay scheduling: a simple technique for achieving locality and fairness in cluster scheduling , 2010, EuroSys '10.
[16] Carlo Zaniolo,et al. Early Accurate Results for Advanced Analytics on MapReduce , 2012, Proc. VLDB Endow..
[17] Helen J. Wang,et al. Online aggregation , 1997, SIGMOD '97.
[18] Magdalena Balazinska,et al. ArrayStore: a storage manager for complex parallel array processing , 2011, SIGMOD '11.
[19] Beng Chin Ooi,et al. Distributed Online Aggregation , 2009, Proc. VLDB Endow..
[20] Rares Vernica,et al. Hyracks: A flexible and extensible foundation for data-intensive computing , 2011, 2011 IEEE 27th International Conference on Data Engineering.
[21] Anwar M. Ghuloum,et al. ViewpointFace the inevitable, embrace parallelism , 2009, CACM.
[22] Peter J. Haas,et al. Ripple joins for online aggregation , 1999, SIGMOD '99.
[23] Rajeev Motwani,et al. Overcoming limitations of sampling for aggregation queries , 2001, Proceedings 17th International Conference on Data Engineering.
[24] Xiaofeng Meng,et al. You can stop early with COLA: online processing of aggregate queries in the cloud , 2012, CIKM.