Research of Cloud Computing Based on the Hadoop Platform

The application of cloud computing will cause mass of data accumulation so that how to manage data and distribute data storage space effectively become a hot topic recently. Hadoop was developed by the Apache Foundation to deal with massive data through parallel processing. In addition, Hadoop is applied widely as the most popular distributed platform. This paper mainly presents a Hadoop platform computing model and the Map/Reduce algorithm. We combined the K-means with data mining technology to implement the effectiveness analysis and application of the cloud computing platform.