MapReduce is Good Enough? If All You Have is a Hammer, Throw Away Everything That's Not a Nail!
暂无分享,去创建一个
[1] Beng Chin Ooi,et al. Llama: leveraging columnar storage for scalable join processing in the MapReduce framework , 2011, SIGMOD '11.
[2] Jorge Nocedal,et al. On the limited memory BFGS method for large scale optimization , 1989, Math. Program..
[3] Yann LeCun,et al. Large Scale Online Learning , 2003, NIPS.
[4] Leslie G. Valiant,et al. A bridging model for parallel computation , 1990, CACM.
[5] Jorge-Arnulfo Quiané-Ruiz,et al. Trojan data layouts: right shoes for a running elephant , 2011, SoCC.
[6] Sergei Vassilvitskii,et al. A model of computation for MapReduce , 2010, SODA '10.
[7] Jeffrey D. Ullman,et al. Vision Paper: Towards an Understanding of the Limits of Map-Reduce Computation , 2012, ArXiv.
[8] Benjamin Moseley,et al. Fast Clustering using MapReduce (Extended Abstract) y , 2011 .
[9] Sreenivas Gollapudi,et al. Estimating PageRank on graph streams , 2008, PODS.
[10] Mirek Riedewald,et al. Processing theta-joins using MapReduce , 2011, SIGMOD '11.
[11] Jeffrey Davis,et al. Continuous analytics over discontinuous streams , 2010, SIGMOD Conference.
[12] Jignesh M. Patel,et al. A comparison of join algorithms for log processing in MaPreduce , 2010, SIGMOD Conference.
[13] Sanjay Ghemawat,et al. MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.
[14] Magdalena Balazinska,et al. SkewTune: mitigating skew in mapreduce applications , 2012, SIGMOD Conference.
[15] Benjamin Moseley,et al. Fast clustering using MapReduce , 2011, KDD.
[16] Dominic Battré,et al. Nephele/PACTs: a programming model and execution framework for web-scale analytical processing , 2010, SoCC '10.
[17] Abraham Silberschatz,et al. HadoopDB: An Architectural Hybrid of MapReduce and DBMS Technologies for Analytical Workloads , 2009, Proc. VLDB Endow..
[18] Jimmy J. Lin,et al. Book Reviews: Data-Intensive Text Processing with MapReduce by Jimmy Lin and Chris Dyer , 2010, CL.
[19] Betty Salzberg,et al. Bulletin of the Technical Committee on Data Engineering , 1995 .
[20] Zhiwei Xu,et al. RCFile: A fast and space-efficient data placement structure in MapReduce-based warehouse systems , 2011, 2011 IEEE 27th International Conference on Data Engineering.
[21] Joseph M. Hellerstein,et al. MapReduce Online , 2010, NSDI.
[22] Jimmy J. Lin,et al. Large-scale machine learning at twitter , 2012, SIGMOD Conference.
[23] Rajeev Motwani,et al. The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.
[24] Badrish Chandramouli,et al. Temporal Analytics on Big Data for Web Advertising , 2012, 2012 IEEE 28th International Conference on Data Engineering.
[25] Thomas G. Dietterich. Machine-Learning Research Four Current Directions , 1997 .
[26] Michael Stonebraker,et al. Monitoring Streams - A New Class of Data Management Applications , 2002, VLDB.
[27] Jimmy J. Lin,et al. Design patterns for efficient graph algorithms in MapReduce , 2010, MLG '10.
[28] Cong Yu,et al. Distributed cube materialization on holistic measures , 2011, 2011 IEEE 27th International Conference on Data Engineering.
[29] Jörg Tiedemann,et al. Bitext Alignment , 2011, Synthesis Lectures on Human Language Technologies.
[30] Matei Zaharia,et al. Job Scheduling for Multi-User MapReduce Clusters , 2009 .
[31] Randy H. Katz,et al. Improving MapReduce Performance in Heterogeneous Environments , 2008, OSDI.
[32] Yanfeng Zhang,et al. PrIter: A Distributed Framework for Prioritizing Iterative Computations , 2011, IEEE Transactions on Parallel and Distributed Systems.
[33] Christos Faloutsos,et al. Clustering very large multi-dimensional datasets with MapReduce , 2011, KDD.
[34] Chen Li,et al. Inside "Big Data management": ogres, onions, or parfaits? , 2012, EDBT '12.
[35] Jorge-Arnulfo Quiané-Ruiz,et al. Only Aggressive Elephants are Fast Elephants , 2012, Proc. VLDB Endow..
[36] John Langford,et al. Sparse Online Learning via Truncated Gradient , 2008, NIPS.
[37] Jimmy J. Lin,et al. Fast, Easy, and Cheap: Construction of Statistical Machine Translation Models with MapReduce , 2008, WMT@ACL.
[38] Léon Bottou,et al. Large-Scale Machine Learning with Stochastic Gradient Descent , 2010, COMPSTAT.
[39] Vinay Setty,et al. Hadoop++: Making a Yellow Elephant Run Like a Cheetah (Without It Even Noticing) , 2010, Proc. VLDB Endow..
[40] Johannes Gehrke. Letter from the Special Issue Editor , 2003, IEEE Data Eng. Bull..
[41] Aart J. C. Bik,et al. Pregel: a system for large-scale graph processing , 2010, SIGMOD Conference.
[42] Leonardo Neumeyer,et al. S4: Distributed Stream Computing Platform , 2010, 2010 IEEE International Conference on Data Mining Workshops.
[43] Geoffrey C. Fox,et al. Twister: a runtime for iterative MapReduce , 2010, HPDC '10.
[44] Ludmila I. Kuncheva,et al. Combining Pattern Classifiers: Methods and Algorithms , 2004 .
[45] Fusheng Wang,et al. YSmart: Yet Another SQL-to-MapReduce Translator , 2011, 2011 31st International Conference on Distributed Computing Systems.
[46] D. Rubin,et al. Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .
[47] Yoram Singer,et al. Pegasos: primal estimated sub-gradient solver for SVM , 2011, Math. Program..
[48] Michael D. Ernst,et al. HaLoop , 2010, Proc. VLDB Endow..
[49] Younghoon Kim,et al. Parallel Top-K Similarity Join Algorithms Using MapReduce , 2012, 2012 IEEE 28th International Conference on Data Engineering.
[50] Gideon S. Mann,et al. Efficient Large-Scale Distributed Training of Conditional Maximum Entropy Models , 2009, NIPS.
[51] Rares Vernica,et al. Hyracks: A flexible and extensible foundation for data-intensive computing , 2011, 2011 IEEE 27th International Conference on Data Engineering.
[52] Abraham Silberschatz,et al. Efficient processing of data warehousing queries in a split execution environment , 2011, SIGMOD '11.
[53] Michael J. Franklin,et al. Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing , 2012, NSDI.
[54] Jignesh M. Patel,et al. Column-Oriented Storage Techniques for MapReduce , 2011, Proc. VLDB Endow..
[55] Gideon S. Mann,et al. Distributed Training Strategies for the Structured Perceptron , 2010, NAACL.
[56] Neoklis Polyzotis,et al. Scaling Datalog for Machine Learning on Big Data , 2012, ArXiv.
[57] Sandeep Tata,et al. Clydesdale: structured data processing on MapReduce , 2012, EDBT '12.
[58] Subhash C. Bagui,et al. Combining Pattern Classifiers: Methods and Algorithms , 2005, Technometrics.
[59] Pramod Bhatotia,et al. Large-scale Incremental Data Processing with Change Propagation , 2011, HotCloud.