Acceleration of a content-based image-retrieval application on the RDISK cluster

Because of the growing use of multimedia content over Internet, content-based image retrieval (CBIR) has recently received a lot of interest. While accurate search techniques based on local image descriptors exist, they suffer from very long execution time. We propose to accelerate CBIR on the RDISK machine, a cluster of FPGA-enhanced hard-drives, that follows the philosophy of smart-disks. Our platform combines coarse and fine grain parallelism thanks to the concurrent use of the cluster nodes and of a programmable logic device. The implementation of the CBIR application on this mixed hardware/software platform follows a strict methodology, that was validated on realistic data-set (image database of more than 30,000 images). This methodology allows us to adapt the original algorithm to suit a hardware implementation, and to select the values of some key design parameters to maximize global performance. Our preliminary results indicate that speed-ups between 120 and 200 could be obtained for a cluster of 32 nodes compared with a software implementation running on a standard desktop PC

[1]  Christos Faloutsos,et al.  Searching Multimedia Databases by Content , 1996, Advances in Database Systems.

[2]  Daniel Ménard,et al.  Automatic evaluation of the accuracy of fixed-point algorithms , 2002, Proceedings 2002 Design, Automation and Test in Europe Conference and Exhibition.

[3]  James Ze Wang,et al.  Content-based image retrieval: approaches and trends of the new age , 2005, MIR '05.

[4]  Keith R. Dimond,et al.  A Hardware Implementation of a Content Based Image Retrieval Algorithm , 2004, FPL.

[5]  Christos Faloutsos,et al.  Active Disks for Large-Scale Data Processing , 2001, Computer.

[6]  Ramesh Radhakrishnan,et al.  Evaluating MMX technology using DSP and multimedia applications , 1998, Proceedings. 31st Annual ACM/IEEE International Symposium on Microarchitecture.

[7]  Patrice Quinton Automatic synthesis of systolic arrays from uniform recurrent equations , 1984, ISCA '84.

[8]  Mahmut T. Kandemir,et al.  Design and Evaluation of a Smart Disk Cluster for DSS Commercial Workloads , 2001, J. Parallel Distributed Comput..

[9]  Dominique Lavenier,et al.  Cluster of re-configurable nodes for scanning large genomic banks , 2005, Parallel Comput..

[10]  Ioannis Andreadis,et al.  Parallel Local Histogram Comparison Hardware Architecture for Content-Based Image Retrieval , 2004, J. Intell. Robotic Syst..

[11]  Koji Nakano,et al.  An image retrieval system using FPGAs , 2003, ASP-DAC '03.

[12]  Markus G. Kuhn,et al.  Attacks on Copyright Marking Systems , 1998, Information Hiding.

[13]  David A. Patterson,et al.  A case for intelligent disks (IDISKs) , 1998, SGMD.

[14]  Luis Pastor,et al.  Performance analysis of a CBIR system on shared-memory systems and heterogeneous clusters , 2005, Seventh International Workshop on Computer Architecture for Machine Perception (CAMP'05).

[15]  Patrick Gros,et al.  Content-based Retrieval Using Local Descriptors: Problems and Issues from a Database Perspective , 2001, Pattern Analysis & Applications.

[16]  Viktor K. Prasanna,et al.  Analysis of high-performance floating-point arithmetic on FPGAs , 2004, 18th International Parallel and Distributed Processing Symposium, 2004. Proceedings..

[17]  Susmita Sur-Kolay,et al.  Combined instruction and loop parallelism in array synthesis for FPGAs , 2001, International Symposium on System Synthesis (IEEE Cat. No.01EX526).

[18]  Hai Jin,et al.  Active Disks: Programming Model, Algorithms and Evaluation , 2002 .

[19]  Edward Babb,et al.  Implementing a relational database by means of specialzed hardware , 1979, TODS.