Improved Parallel Data Mining Policy for Cloud Computing Environments

Cloud computing is a business model. It distributes computing tasks in a large number of computer resource pool configuration. It can provide on-demand for the user computing power, storage capacity and application services capabilities. Cloud computing offers a cheap and efficient solution for storing and analyzing massive amounts of data. Data mining is going to extract useful information and knowledge from a lot of, incomplete, noisy, fuzzy, random data to hidden practice in which people do not know in advance, but is potentially. It has played a guiding role in many fields of scientific research and business decisions ,with far-reaching social and economic significance. Data mining policy for cloud computing environments has important theoretical significance and application value. In this paper, after a series of studies in the improvement of parallel data mining algorithms can greatly improve the efficiency of data mining algorithms.

[1]  Rajkumar Buyya,et al.  Market-Oriented Cloud Computing: Vision, Hype, and Reality of Delivering Computing as the 5th Utility , 2009, 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid.

[2]  Doug Johnson,et al.  Computing in the Clouds. , 2010 .

[3]  Arlo Faria,et al.  MapReduce : Distributed Computing for Machine Learning , 2006 .

[4]  Ting Liu,et al.  Clustering Billions of Images with Large Scale Nearest Neighbor Search , 2007, 2007 IEEE Workshop on Applications of Computer Vision (WACV '07).

[5]  Nitesh V. Chawla,et al.  Scaling up Classifiers to Cloud Computers , 2008, 2008 Eighth IEEE International Conference on Data Mining.