Bionic vision system and its application in license plate recognition

Conventional computer vision systems detect object after super-resolution (SR) or image reconstruction of the whole image, which is not an economical manner. By imitating the visual system of human beings, we proposed the bionic vision system (BVS), which is mainly composed by three parts: object detection by visual attention model, object-oriented SR reconstruction and object recognition by convolutional neural networks. The visual attention model contains both bottom-up and top-down cues. The bottom-up cues integrate low-level features by the feature integration theory. An Adaboost detector imitates the top-down cues. Sparse coding and compressed sensing reconstruction realize the object-oriented SR reconstruction. The BVS was validated on license plate recognition task. Both detection performance and SR reconstruction performance are tested. Besides of these, we also test the final recognition rate, all the experimental results are quite encouraging.

[1]  John K. Tsotsos,et al.  Saliency, attention, and visual search: an information theoretic approach. , 2009, Journal of vision.

[2]  A. Çapar,et al.  License Plate Recognition From Still Images and Video Sequences: A Survey , 2008, IEEE Transactions on Intelligent Transportation Systems.

[3]  Radu Berinde,et al.  Advances in sparse signal recovery methods , 2009 .

[4]  Zhang Ke Application of Human Vision Bionics in Detection,Estimation and Tracking for Photoelectric Target , 2006 .

[5]  A. Treisman,et al.  A feature-integration theory of attention , 1980, Cognitive Psychology.

[6]  Weidong Yi,et al.  License plate location based on improved visual attention model , 2012, International Conference on Machine Vision.

[7]  Andreas Koschan,et al.  Digital Color Image Processing , 2008 .

[8]  S Ullman,et al.  Shifts in selective visual attention: towards the underlying neural circuitry. , 1985, Human neurobiology.

[9]  Patrice Y. Simard,et al.  Best practices for convolutional neural networks applied to visual document analysis , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[10]  Peter Földiák,et al.  SPARSE CODING IN THE PRIMATE CORTEX , 2002 .

[11]  Rajat Raina,et al.  Efficient sparse coding algorithms , 2006, NIPS.

[12]  Hua Han,et al.  Wavelet-domain HMT-based image super-resolution , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[13]  Simone Frintrop,et al.  VOCUS: A Visual Attention System for Object Detection and Goal-Directed Search , 2006, Lecture Notes in Computer Science.

[14]  Emmanuel J. Candès,et al.  Robust uncertainty principles: exact signal reconstruction from highly incomplete frequency information , 2004, IEEE Transactions on Information Theory.

[15]  Marcus A. Butavicius,et al.  Super-resolution of Infrared Images: Does it Improve Operator Object Detection Performance? , 2010, J. Comput. Inf. Technol..

[16]  Thierry Pun,et al.  Integration of bottom-up and top-down cues for visual attention using non-linear relaxation , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[17]  A. Mizuno,et al.  A change of the leading player in flow Visualization technique , 2006, J. Vis..

[18]  S. Edelman Receptive Fields for Vision: from Hyperacuity to Object Recognition , 1995 .

[19]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[20]  Emmanuel J. Candès,et al.  Decoding by linear programming , 2005, IEEE Transactions on Information Theory.

[21]  Roxanne L. Canosa,et al.  Real-world vision: Selective perception and task , 2009, TAP.

[22]  Thomas B. Moeslund,et al.  Super-resolution: a comprehensive survey , 2014, Machine Vision and Applications.

[23]  Weidong Yi,et al.  License plate detection based on multistage information fusion , 2014, Inf. Fusion.

[24]  Thomas S. Huang,et al.  Image Super-Resolution Via Sparse Representation , 2010, IEEE Transactions on Image Processing.

[25]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[26]  Kyoung Mu Lee,et al.  Accurate Image Super-Resolution Using Very Deep Convolutional Networks , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[27]  Kechen Zhang,et al.  A Sparse Object Coding Scheme in Area V4 , 2011, Current Biology.

[28]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[29]  Liming Zhang,et al.  A new method of images super-resolution restoration by neural networks , 2002, Proceedings of the 9th International Conference on Neural Information Processing, 2002. ICONIP '02..

[30]  Christian Ledig,et al.  Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[31]  Kwang In Kim,et al.  Single-Image Super-Resolution Using Sparse Regression and Natural Image Prior , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[32]  David J. Field,et al.  Sparse coding with an overcomplete basis set: A strategy employed by V1? , 1997, Vision Research.

[33]  Pierre Baldi,et al.  Bayesian surprise attracts human attention , 2005, Vision Research.

[34]  Alex Graves,et al.  Recurrent Models of Visual Attention , 2014, NIPS.

[35]  S. Frick,et al.  Compressed Sensing , 2014, Computer Vision, A Reference Guide.

[36]  Michal Irani,et al.  Super-resolution from a single image , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[37]  Kilian Q. Weinberger,et al.  Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[38]  William T. Freeman,et al.  Learning Low-Level Vision , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[39]  Misha Denil,et al.  Learning Where to Attend with Deep Architectures for Image Tracking , 2011, Neural Computation.

[40]  J. Wolfe,et al.  What attributes guide the deployment of visual attention and how do they do it? , 2004, Nature Reviews Neuroscience.