论文信息 - GPU acceleration of an image characterization algorithm for document similarity analysis

GPU acceleration of an image characterization algorithm for document similarity analysis

This paper aims to provide decision support for selecting software and hardware architecture for content-based document comparison. We evaluate Java, C, CUDA C and OpenCL implementations of an image characterization algorithm used for content-based document comparison on a CPU and NVIDIA and AMD graphics processing units (GPUs). Based on our experimental results, we conclude that the original Java implementation of the image characterization algorithm running on a CPU-based architecture can be accelerated by a factor of 6 if the Java code is re-implemented in C, or by a factor of almost 16 if the Java code is re-implemented in CUDA C and run on NVIDIA GTX 480 GPU hardware. We also provide a power efficiency analysis.

[1] Peter Bajcsy,et al. Comprehensive Appraisals of Contemporary Documents , 2009 .

[2] John E. Stone,et al. GPU clusters for high-performance computing , 2009, 2009 IEEE International Conference on Cluster Computing and Workshops.

[3] John E. Stone,et al. Quantifying the impact of GPUs on performance and energy efficiency in HPC clusters , 2010, International Conference on Green Computing.

[4] Peter Bajcsy,et al. Comprehensive and Scalable Appraisals of Contemporary Documents , 2010 .

[5] Thierry Pun,et al. Content-based query of image databases: inspirations from text retrieval , 2000, Pattern Recognit. Lett..

[6] John E. Stone,et al. OpenCL: A Parallel Programming Standard for Heterogeneous Computing Systems , 2010, Computing in Science & Engineering.

[7] Chew Lim Tan,et al. Model-Based Chart Image Recognition , 2003, GREC.

[8] Robert P. Futrelle,et al. Recognition and Classification of Figures in PDF Documents , 2005, GREC.

[9] James Allan,et al. Automatic structuring and retrieval of large text files , 1994, CACM.