MR-DIS: democratic instance selection for big data by MapReduce
暂无分享,去创建一个
Álvar Arnaiz-González | José-Francisco Díez-Pastor | Carlos López Nozal | Alejandro González-Rogel | J. Díez-Pastor | C. L. Nozal | Álvar Arnaiz-González | Alejandro González-Rogel
[1] Nicolás García-Pedrajas,et al. A divide-and-conquer recursive approach for scaling up instance selection algorithms , 2009, Data Mining and Knowledge Discovery.
[2] Fabrizio Angiulli,et al. Distributed Nearest Neighbor-Based Condensation of Very Large Data Sets , 2007, IEEE Transactions on Knowledge and Data Engineering.
[3] Juan José Rodríguez Diez,et al. Instance selection of linear complexity for big data , 2016, Knowl. Based Syst..
[4] Vipin Kumar,et al. Isoefficiency: measuring the scalability of parallel algorithms and architectures , 1993, IEEE Parallel & Distributed Technology: Systems & Applications.
[5] G. Amdhal,et al. Validity of the single processor approach to achieving large scale computing capabilities , 1967, AFIPS '67 (Spring).
[6] Antonio González Muñoz,et al. Three new instance selection methods based on local sets: A comparative study with several approaches from a bi-objective perspective , 2015, Pattern Recognit..
[7] Nicolás García-Pedrajas,et al. Democratic instance selection: A linear complexity instance selection algorithm based on classifier ensemble concepts , 2010, Artif. Intell..
[8] Tony R. Martinez,et al. Instance Pruning Techniques , 1997, ICML.
[9] Francisco Herrera,et al. Stratification for scaling up evolutionary prototype selection , 2005, Pattern Recognit. Lett..
[10] Scott Shenker,et al. Spark: Cluster Computing with Working Sets , 2010, HotCloud.
[11] Francisco Herrera,et al. MRPR: A MapReduce solution for prototype reduction in big data classification , 2015, Neurocomputing.
[12] Francisco Herrera,et al. Prototype Selection for Nearest Neighbor Classification: Taxonomy and Empirical Study , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[13] Piotr Indyk,et al. Approximate nearest neighbors: towards removing the curse of dimensionality , 1998, STOC '98.
[14] Michael Minelli,et al. Big Data, Big Analytics: Emerging Business Intelligence and Analytic Trends for Today's Businesses , 2012 .
[15] Chris Mellish,et al. Advances in Instance Selection for Instance-Based Learning Algorithms , 2002, Data Mining and Knowledge Discovery.
[16] Daniel Asimov,et al. The grand tour: a tool for viewing multidimensional data , 1985 .
[17] Francisco Herrera,et al. kNN-IS: An Iterative Spark-based design of the k-Nearest Neighbors classifier for big data , 2017, Knowl. Based Syst..
[18] M. Anusha,et al. Big Data-Survey , 2016 .
[19] Francisco Herrera,et al. Data Preprocessing in Data Mining , 2014, Intelligent Systems Reference Library.
[20] Chih-Fong Tsai,et al. Big data mining with parallel computing: A comparison of distributed and MapReduce methodologies , 2016, J. Syst. Softw..
[21] Sanjay Ghemawat,et al. MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.
[22] S. R,et al. Data Mining with Big Data , 2017, 2017 11th International Conference on Intelligent Systems and Control (ISCO).
[23] Peter E. Hart,et al. The condensed nearest neighbor rule (Corresp.) , 1968, IEEE Trans. Inf. Theory.
[24] Verónica Bolón-Canedo,et al. Data discretization: taxonomy and big data challenge , 2016, WIREs Data Mining Knowl. Discov..