Naïve Bayes Classifier: A MapReduce Approach

[1]  Douglas A. Dodge,et al.  Large-scale seismic signal analysis with Hadoop , 2014, Comput. Geosci..

[2]  Sabu M. Thampi,et al.  Improving Hadoop Performance in Handling Small Files , 2011, ACC.

[3]  Tom White,et al.  Hadoop: The Definitive Guide , 2009 .

[4]  Pierre Baldi,et al.  Bioinformatics - the machine learning approach (2. ed.) , 2000 .

[5]  Hae-Chang Rim,et al.  Some Effective Techniques for Naive Bayes Text Classification , 2006, IEEE Transactions on Knowledge and Data Engineering.

[6]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[7]  D.M. Mount,et al.  An Efficient k-Means Clustering Algorithm: Analysis and Implementation , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Joshua Goodman,et al.  Finding advertising keywords on web pages , 2006, WWW '06.

[9]  J. Friedman Special Invited Paper-Additive logistic regression: A statistical view of boosting , 2000 .

[10]  Walmir M. Caminhas,et al.  A review of machine learning approaches to Spam filtering , 2009, Expert Syst. Appl..

[11]  J. Shigemitsu,et al.  B-meson decay constant from unquenched lattice QCD. , 2005, Physical review letters.

[12]  Geoffrey C. Fox,et al.  Investigation of Data Locality in MapReduce , 2012, 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012).

[13]  William H. Dutton,et al.  Clouds, big data, and smart assets: Ten tech-enabled business trends to watch , 2010 .

[14]  Sylvain Arlot,et al.  A survey of cross-validation procedures for model selection , 2009, 0907.4728.

[15]  Hairong Kuang,et al.  The Hadoop Distributed File System , 2010, 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST).

[16]  S. Larson The shrinkage of the coefficient of multiple correlation. , 1931 .

[17]  Filippo Menczer,et al.  Visualizing Communication on Social Media: Making Big Data Accessible , 2012, ArXiv.

[18]  Craig MacDonald,et al.  MapReduce indexing strategies: Studying scalability and efficiency , 2012, Inf. Process. Manag..

[19]  Sean Owen,et al.  Mahout in Action , 2011 .

[20]  J. Alberto Espinosa,et al.  Big Data: Issues and Challenges Moving Forward , 2013, 2013 46th Hawaii International Conference on System Sciences.

[21]  Kyle Chard,et al.  Social Cloud: Cloud Computing in Social Networks , 2010, 2010 IEEE 3rd International Conference on Cloud Computing.

[22]  Daniel Keren,et al.  Painter identification using local features and naive Bayes , 2002, Object recognition supported by user interaction for service robots.

[24]  Tom Drummond,et al.  Machine Learning for High-Speed Corner Detection , 2006, ECCV.

[25]  Xubin He,et al.  Implementing WebGIS on Hadoop: A case study of improving small file I/O performance on HDFS , 2009, 2009 IEEE International Conference on Cluster Computing and Workshops.

[26]  Howard Gobioff,et al.  The Google file system , 2003, SOSP '03.

[27]  Vladimir Vapnik,et al.  An overview of statistical learning theory , 1999, IEEE Trans. Neural Networks.

[28]  Samuel Madden,et al.  From Databases to Big Data , 2012, IEEE Internet Comput..

[29]  Isak Gath,et al.  Unsupervised Optimal Fuzzy Clustering , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[30]  Gunnar Rätsch,et al.  An introduction to kernel-based learning algorithms , 2001, IEEE Trans. Neural Networks.

[31]  Paul Zikopoulos,et al.  Understanding Big Data: Analytics for Enterprise Class Hadoop and Streaming Data , 2011 .

[32]  Zhou Ai-wu An Improved Algorithm of DBSCAN , 2011 .

[33]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[34]  Arkady B. Zaslavsky,et al.  Sensing as a Service and Big Data , 2013, ArXiv.

[35]  Kunle Olukotun,et al.  Map-Reduce for Machine Learning on Multicore , 2006, NIPS.

[36]  Rajesh Parekh,et al.  Lessons and Challenges from Mining Retail E-Commerce Data , 2004, Machine Learning.