Self-Learning of Feature Regions for Image Recognition

Mobile systems are used in various environments. Thus, it is practical for image recognition systems to autonomously learn template images that are adaptive to objects in their various environments. However, learning the features of such objects requires large-scale computation and complex control. Hence, we propose an image recognition system that selects and learns regions that have a given object's features. This system is designed as a hardware/software (hw/sw) complex system with the multi-dimensional field programmable gate array (FPGA) “Vocalise.” This study discusses the possibility of dynamically building image databases and of real-time learning using the proposed image recognition system. Results indicate that the learning speed of the proposed method is estimated to be 1.4 × 103 faster than that obtained with a conventional software method. This suggests the possibility of real-time learning.

[1]  Kenji Kudo,et al.  Hardware Object Model and Its Application to the Image Processing(Video/Image Coding)( Applications and Implementations of Digital Signal Processing) , 2004 .

[2]  Harry Wechsler,et al.  The FERET database and evaluation procedure for face-recognition algorithms , 1998, Image Vis. Comput..

[3]  Jesse Patrick Harvey,et al.  GPU acceleration of object classification algorithms using NVIDIA CUDA , 2009 .

[4]  Li Jiang,et al.  Voice Recognition System using hw/sw Complex , 2014 .

[5]  Sekine Masatoshi,et al.  Face detection with the union of hardware and software , 2007 .

[6]  Scott B. Baden,et al.  Accelerating Viola-Jones Face Detection to FPGA-Level Using GPUs , 2010, 2010 18th IEEE Annual International Symposium on Field-Programmable Custom Computing Machines.

[7]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, International Journal of Computer Vision.

[8]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1995, EuroCOLT.

[9]  Amit K. Roy-Chowdhury,et al.  Evaluation and Acceleration of High-Throughput Fixed-Point Object Detection on FPGAs , 2015, IEEE Transactions on Circuits and Systems for Video Technology.

[10]  H. Tamukoh,et al.  Internet booster: A networked HW/SW complex system and its application to HI-performance web application , 2010, 2010 World Automation Congress.

[11]  Hyeonjoon Moon,et al.  The FERET evaluation methodology for face-recognition algorithms , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[12]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[13]  Kotaki Hiroshi,et al.  The voice recognition circuit by using hardware and software complex , 2008 .

[14]  Tamukoh Hakaru,et al.  Implementation of an Image Recognition System with Hierarchical Feature Learning Function , 2012 .