Color and Texture Feature Extraction Using Apache Hadoop Framework

Multimedia data is expanding exponentially. The rapid growth of technology combined with affordable storage and capabilities has lead to explosion in the availability and applications of multimedia. Most of the data is available in the form of images and videos. Today large amount of image data is produced through digital cameras, mobile phones and other sources. Processing of this large collection of images involve highly complex and repetitive operations on a large database leading to challenges of optimizing the query time and data storage capacity. Many image processing and computer vision algorithms are applicable to large-scale data tasks. It is often desirable to run the image processing algorithms on large data sets (e.g. larger than 1 TB) that are currently limited by the computational power of a single computer system. In order to handle such a huge data, we propose execution of time and space intensive computer vision algorithms on a distributed computing platform by using Apache Hadoop framework. Basically, Hadoop framework works based on divide and conquer strategy. The task of extracting color and texture features will be divided and assigned to multiple nodes of the Hadoop cluster. A significant speedup in computation time and efficient utilizations of memory can be achieved by exploiting the parallelism nature of Apache Hadoop framework. The Most important advantage of using Hadoop is, it is highly economical as whole framework can be implemented on existing commodity machines. Moreover, the system is highly fault tolerant and less vulnerable to node failures.

[1]  Avita Katal,et al.  Big data: Issues, challenges, tools and Good practices , 2013, 2013 Sixth International Conference on Contemporary Computing (IC3).

[2]  Mansour Jamzad,et al.  Fast Content Based Color Image Retrieval System Based on Texture Analysis of Edge Map , 2011 .

[3]  Mingjing Li,et al.  Color texture moments for content-based image retrieval , 2002, Proceedings. International Conference on Image Processing.

[4]  Zibin Zheng,et al.  Service-Generated Big Data and Big Data-as-a-Service: An Overview , 2013, 2013 IEEE International Congress on Big Data.