DiNoDB: Efficient Large-Scale Raw Data Analytics
暂无分享,去创建一个
Marko Vukolic | Anastasia Ailamaki | Pietro Michiardi | Ioannis Alagiannis | Erietta Liarou | Yongchao Tian
[1] Abraham Silberschatz,et al. Invisible loading: access-driven data transfer from raw files into database systems , 2013, EDBT '13.
[2] Hairong Kuang,et al. The Hadoop Distributed File System , 2010, 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST).
[3] Scott Shenker,et al. Spark: Cluster Computing with Working Sets , 2010, HotCloud.
[4] Vinay Setty,et al. Hadoop++: Making a Yellow Elephant Run Like a Cheetah (Without It Even Noticing) , 2010, Proc. VLDB Endow..
[5] Yuanyuan Tian,et al. CoHadoop: Flexible Data Placement and Its Exploitation in Hadoop , 2011, Proc. VLDB Endow..
[6] Anastasia Ailamaki,et al. NoDB: efficient query execution on raw data files , 2012, Commun. ACM.
[7] Scott Shenker,et al. Discretized streams: fault-tolerant streaming computation at scale , 2013, SOSP.
[8] Abraham Silberschatz,et al. HadoopDB: An Architectural Hybrid of MapReduce and DBMS Technologies for Analytical Workloads , 2009, Proc. VLDB Endow..
[9] Scott Shenker,et al. Shark: SQL and rich analytics at scale , 2012, SIGMOD '13.
[10] Hans-Peter Kriegel,et al. A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.
[11] Sanjay Ghemawat,et al. MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.
[12] J. MacQueen. Some methods for classification and analysis of multivariate observations , 1967 .