K-Means Parallel Acceleration for Sparse Data Dimensions on Flink
暂无分享,去创建一个
Kenli Li | Mingxing Duan | Chubo Liu | Xiangke Liao | Zihao Zeng | Kenli Li | Xiangke Liao | Mingxing Duan | Chubo Liu | Zihao Zeng
[1] Danna Zhou,et al. d. , 1934, Microbial pathogenesis.
[2] Anil K. Jain,et al. Algorithms for Clustering Data , 1988 .
[3] Juby Mathew,et al. Scalable parallel clustering approach for large data using parallel K means and firefly algorithms , 2014, 2014 International Conference on High Performance Computing and Applications (ICHPCA).
[4] Daniel T. Larose,et al. Discovering Knowledge in Data: An Introduction to Data Mining , 2005 .
[5] François Fleuret,et al. Nested Mini-Batch K-Means , 2016, NIPS.
[6] Jiming Liu,et al. Speeding up K-Means Algorithm by GPUs , 2010, 2010 10th IEEE International Conference on Computer and Information Technology.
[7] S. P. Lloyd,et al. Least squares quantization in PCM , 1982, IEEE Trans. Inf. Theory.
[8] Guangwen Yang,et al. Large-Scale Hierarchical k-means for Heterogeneous Many-Core Supercomputers , 2018, SC18: International Conference for High Performance Computing, Networking, Storage and Analysis.
[9] Kenli Li,et al. GFlink: An In-Memory Computing Architecture on Heterogeneous CPU-GPU Clusters for Big Data , 2016, IEEE Transactions on Parallel and Distributed Systems.
[10] Tsuyoshi Murata,et al. {m , 1934, ACML.
[11] Sergei Vassilvitskii,et al. k-means++: the advantages of careful seeding , 2007, SODA '07.
[12] Sanjay Agrawal,et al. A Performance Analysis of MapReduce Task with Large Number of Files Dataset in Big Data Using Hadoop , 2014, 2014 Fourth International Conference on Communication Systems and Network Technologies.
[13] Christopher Potts,et al. Learning Word Vectors for Sentiment Analysis , 2011, ACL.
[14] Sean Owen,et al. Mahout in Action , 2011 .
[15] Chitresh Verma,et al. Big Data representation for grade analysis through Hadoop framework , 2016, 2016 6th International Conference - Cloud System and Big Data Engineering (Confluence).
[16] E. Sivaraman,et al. High Performance and Fault Tolerant Distributed File System for Big Data Storage and Processing Using Hadoop , 2014, 2014 International Conference on Intelligent Computing Applications.
[17] Sergei Vassilvitskii,et al. Scalable K-Means++ , 2012, Proc. VLDB Endow..
[18] V. Santhi,et al. Performance Analysis of Parallel K-Means with Optimization Algorithms for Clustering on Spark , 2018, ICDCIT.