Design of the mass multimedia files storage architecture based on Hadoop

With the development of social network and video sites, the amount of images and videos being uploaded to the internet is rapidly increasing, and it is more and more difficult to storage and process the mass network images and videos. How to storage, manage and classify mass network images and videos effectively, and provide an excellent experience for users has been an urgent problem. Hadoop distributed file system (HDFS) becomes a representative cloud storage platform, benefiting from its reliable, scalable and low-cost storage capability. However, HDFS is not designed for storing mass small files. This paper presents a Hadoop-based mass multimedia files storage architecture. The core idea of this architecture is that merge the small images and video files to a Bundle, and provide a unified interface to handle mass images and videos. It's a good solution to the problem of small files. The experimental results show that the approach can achieve a better performance.

[1]  Qinghua Zheng,et al.  A Novel Approach to Improving the Efficiency of Storing and Accessing Small Files on Hadoop: A Case Study by PowerPoint Files , 2010, 2010 IEEE International Conference on Services Computing.

[2]  Jun Wang,et al.  Improving metadata management for small files in HDFS , 2009, 2009 IEEE International Conference on Cluster Computing and Workshops.

[3]  Jason Lawrence,et al.  HIPI : A Hadoop Image Processing Interface for Image-based MapReduce Tasks , 2011 .

[4]  Yang Yang,et al.  Hadoop-based storage architecture for mass MP3 files: Hadoop-based storage architecture for mass MP3 files , 2013 .

[5]  Xubin He,et al.  Implementing WebGIS on Hadoop: A case study of improving small file I/O performance on HDFS , 2009, 2009 IEEE International Conference on Cluster Computing and Workshops.

[6]  Chen Yu Hadoop-based storage architecture for mass MP3 files , 2012 .

[7]  Hairong Kuang,et al.  The Hadoop Distributed File System , 2010, 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST).