Cloud Computing Environments Parallel Data Mining Policy Research

With the rapid development of computer science and technology, more and more data is stored in the computer storage media , data mining (DM) emerged as an interdisciplinary subject , it is based on a method previously used researcher science and algorithms . Cloud computing is an emerging shared infrastructure approach that open standards and service-based, Internet-centric , providing safe , fast and convenient data storage and network computing services. The cloud computing applications to data mining, you can tap the growing number of mass data solutions. This paper presents a cloud computing environment suitable for partitioning the data set allocation method and data sets; introduces improved Apriori algorithm based on its calculation of two parallel processes running on the platform in the cloud, the results of the simulation.

[1]  Geoffrey C. Fox,et al.  High Performance Parallel Computing with Clouds and Cloud Technologies , 2009, CloudComp.

[2]  Rakesh Agrawal,et al.  Parallel Mining of Association Rules , 1996, IEEE Trans. Knowl. Data Eng..

[3]  Chia-Chu Chiang,et al.  A Parallel Apriori Algorithm for Frequent Itemsets Mining , 2006, Fourth International Conference on Software Engineering Research, Management and Applications (SERA'06).

[4]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[5]  Salvatore Orlando,et al.  Enhancing the Apriori Algorithm for Frequent Set Counting , 2001, DaWaK.

[6]  Ruoming Jin,et al.  Shared memory parallelization of data mining algorithms: techniques, programming interface, and performance , 2005, IEEE Transactions on Knowledge and Data Engineering.

[7]  Nitesh V. Chawla,et al.  Scaling up Classifiers to Cloud Computers , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[8]  Jiawei Han,et al.  Data Mining: Concepts and Techniques, Second Edition , 2006, The Morgan Kaufmann series in data management systems.

[9]  Bowen Chen,et al.  The research of improved apriori algorithm for mining association rules , 2008 .