Cloud computing is a business model. It distributes computing tasks in a large number of computer resource pool configuration. It can provide on-demand for the user computing power, storage capacity and application services capabilities. Cloud computing offers a cheap and efficient solution for storing and analyzing massive amounts of data. Data mining is going to extract useful information and knowledge from a lot of, incomplete, noisy, fuzzy, random data to hidden practice in which people do not know in advance, but is potentially. It has played a guiding role in many fields of scientific research and business decisions ,with far-reaching social and economic significance. Data mining policy for cloud computing environments has important theoretical significance and application value. In this paper, after a series of studies in the improvement of parallel data mining algorithms can greatly improve the efficiency of data mining algorithms.
[1]
Rajkumar Buyya,et al.
Market-Oriented Cloud Computing: Vision, Hype, and Reality of Delivering Computing as the 5th Utility
,
2009,
2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid.
[2]
Doug Johnson,et al.
Computing in the Clouds.
,
2010
.
[3]
Arlo Faria,et al.
MapReduce : Distributed Computing for Machine Learning
,
2006
.
[4]
Ting Liu,et al.
Clustering Billions of Images with Large Scale Nearest Neighbor Search
,
2007,
2007 IEEE Workshop on Applications of Computer Vision (WACV '07).
[5]
Nitesh V. Chawla,et al.
Scaling up Classifiers to Cloud Computers
,
2008,
2008 Eighth IEEE International Conference on Data Mining.