Distributed Column Subset Selection on MapReduce
暂无分享,去创建一个
Mohamed S. Kamel | Ali Ghodsi | Ahmed Elgohary | Ahmed K. Farahat | Ahmed Elgohary | A. Ghodsi | M. Kamel
[1] Venkatesan Guruswami,et al. Optimal column-based low-rank matrix reconstruction , 2011, SODA.
[2] Robert H. Halstead,et al. Matrix Computations , 2011, Encyclopedia of Parallel Computing.
[3] Mohamed S. Kamel,et al. Efficient greedy feature selection for unsupervised learning , 2012, Knowledge and Information Systems.
[4] Gene H. Golub,et al. Matrix computations (3rd ed.) , 1996 .
[5] Alan M. Frieze,et al. Clustering Large Graphs via the Singular Value Decomposition , 2004, Machine Learning.
[6] Anil K. Jain,et al. Algorithms for Clustering Data , 1988 .
[7] Richard A. Harshman,et al. Indexing by Latent Semantic Analysis , 1990, J. Am. Soc. Inf. Sci..
[8] Edward Y. Chang,et al. Parallel Spectral Clustering in Distributed Systems , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[9] Sanjoy Dasgupta,et al. An elementary proof of a theorem of Johnson and Lindenstrauss , 2003, Random Struct. Algorithms.
[10] Antonio Torralba,et al. Ieee Transactions on Pattern Analysis and Machine Intelligence 1 80 Million Tiny Images: a Large Dataset for Non-parametric Object and Scene Recognition , 2022 .
[11] Petros Drineas,et al. FAST MONTE CARLO ALGORITHMS FOR MATRICES II: COMPUTING A LOW-RANK APPROXIMATION TO A MATRIX∗ , 2004 .
[12] Christos Boutsidis,et al. Near Optimal Column-Based Matrix Reconstruction , 2011, 2011 IEEE 52nd Annual Symposium on Foundations of Computer Science.
[13] Sanjay Ghemawat,et al. MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.
[14] Nathan Halko,et al. An Algorithm for the Principal Component Analysis of Large Data Sets , 2010, SIAM J. Sci. Comput..
[15] Dimitris Achlioptas,et al. Database-friendly random projections: Johnson-Lindenstrauss with binary coins , 2003, J. Comput. Syst. Sci..
[16] Alan M. Frieze,et al. Fast monte-carlo algorithms for finding low-rank approximations , 2004, JACM.
[17] S. Muthukrishnan,et al. Subspace Sampling and Relative-Error Matrix Approximation: Column-Based Methods , 2006, APPROX-RANDOM.
[18] Kenneth Ward Church,et al. Very sparse random projections , 2006, KDD '06.
[19] Peter J. Rousseeuw,et al. Clustering by means of medoids , 1987 .
[20] Mohamed S. Kamel,et al. An Efficient Greedy Method for Unsupervised Feature Selection , 2011, 2011 IEEE 11th International Conference on Data Mining.
[21] Jimmy J. Lin,et al. Pairwise Document Similarity in Large Collections with MapReduce , 2008, ACL.
[22] Tom White,et al. Hadoop: The Definitive Guide , 2009 .
[23] Malik Magdon-Ismail,et al. Column subset selection via sparse approximation of SVD , 2012, Theor. Comput. Sci..
[24] Yiming Yang,et al. RCV1: A New Benchmark Collection for Text Categorization Research , 2004, J. Mach. Learn. Res..
[25] Christos Boutsidis,et al. Clustered subset selection and its applications on it service metrics , 2008, CIKM '08.
[26] Santosh S. Vempala,et al. Matrix approximation and projective clustering via volume sampling , 2006, SODA '06.
[27] Sergei Vassilvitskii,et al. A model of computation for MapReduce , 2010, SODA '10.
[28] Benjamin Moseley,et al. Fast clustering using MapReduce , 2011, KDD.
[29] D. Wan. AN ELEMENTARY PROOF OF A THEOREM OF KATZ , 1989 .
[30] Luis Rademacher,et al. Efficient Volume Sampling for Row/Column Subset Selection , 2010, 2010 IEEE 51st Annual Symposium on Foundations of Computer Science.
[31] Christos Boutsidis,et al. An improved approximation algorithm for the column subset selection problem , 2008, SODA.