Research on Hadoop Cloud Computing Model and its Applications
暂无分享,去创建一个
Hadoop is an open-source software platform for distributed computing dealing with a parallel processing of large data sets. It has been widely used in the field of cloud computing. This paper describes the three most crucial parts of Hadoop, including HDFS, the distributed file system, MapReduce, the data processing model, and HBase, the distributed structured data table. The application status, main research directions and existing problems of Hadoop data processing platform are analyzed, and some performance optimization suggestions are given.
[1] Chuck Lam,et al. Hadoop in Action , 2010 .
[2] Tom White,et al. Hadoop: The Definitive Guide , 2009 .
[3] Alan L. Cox,et al. The Hadoop distributed filesystem: Balancing portability and performance , 2010, 2010 IEEE International Symposium on Performance Analysis of Systems & Software (ISPASS).