A Parallel Distributed Weka Framework for Big Data Mining Using Spark
暂无分享,去创建一个
Goran Nenadic | John A. Keane | Firat Tekiner | Paraskevas Yiapanis | Aris-Kyriakos Koliopoulos | G. Nenadic | J. Keane | F. Tekiner | Aris-Kyriakos Koliopoulos | Paraskevas Yiapanis
[1] Michael J. Franklin,et al. Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing , 2012, NSDI.
[2] Tim Kraska,et al. MLI: An API for Distributed Machine Learning , 2013, 2013 IEEE 13th International Conference on Data Mining.
[3] María S. Pérez-Hernández,et al. Adapting the Weka Data Mining Toolkit to a Grid Based Environment , 2005, AWIC.
[4] Andrew McCallum,et al. Efficient clustering of high-dimensional data sets with application to reference matching , 2000, KDD '00.
[5] Padhraic Smyth,et al. From Data Mining to Knowledge Discovery: An Overview , 1996, Advances in Knowledge Discovery and Data Mining.
[6] Stefan Wrobel,et al. Toolkit-Based High-Performance Data Mining of Large Data on MapReduce Clusters , 2009, 2009 IEEE International Conference on Data Mining Workshops.
[7] Domenico Talia,et al. Weka4WS: A WSRF-Enabled Weka Toolkit for Distributed Data Mining on Grids , 2005, PKDD.
[8] Shirish Tatikonda,et al. SystemML: Declarative machine learning on MapReduce , 2011, 2011 IEEE 27th International Conference on Data Engineering.
[9] Ralf Klinkenberg,et al. Data Classification: Algorithms and Applications , 2014 .
[10] Robert A. Muenchen,et al. The Popularity of Data Analysis Software , 2013 .
[11] Zoltán Prekopcsák,et al. Radoop: Analyzing Big Data with RapidMiner and Hadoop , 2011 .
[12] Scott Shenker,et al. Disk-Locality in Datacenter Computing Considered Irrelevant , 2011, HotOS.
[13] Peter J. Haas,et al. Ricardo: integrating R and Hadoop , 2010, SIGMOD Conference.
[14] Antony I. T. Rowstron,et al. Scale-up vs scale-out for Hadoop: time to rethink? , 2013, SoCC.
[15] Sherif Sakr,et al. The family of mapreduce and large-scale data processing systems , 2013, CSUR.
[16] Sanjay Ghemawat,et al. MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.
[17] Rakesh Agrawal,et al. Parallel Mining of Association Rules , 1996, IEEE Trans. Knowl. Data Eng..
[18] Ian Witten,et al. Data Mining , 2000 .
[19] Ian H. Witten,et al. Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .
[20] Ian H. Witten,et al. The WEKA data mining software: an update , 2009, SKDD.
[21] Samuel P. Midkiff,et al. RABID: A Distributed Parallel R for Large Datasets , 2014, 2014 IEEE International Congress on Big Data.