论文信息 - Enhanced Object Recognition in Cortex-Like Machine Vision

Enhanced Object Recognition in Cortex-Like Machine Vision

This paper reports an extension of the previous MIT and Caltech’s cortex-like machine vision models of Graph-Based Visual Saliency (GBVS) and Feature Hierarchy Library (FHLIB), to remedy some of the undesirable drawbacks in these early models which improve object recognition efficiency. Enhancements in three areas, a) extraction of features from the most salient region of interest (ROI) and their rearrangement in a ranked manner, rather than random extraction over the whole image as in the previous models, b) exploitation of larger patches in the C1 and S2 layers to improve spatial resolutions, c) a more versatile template matching mechanism without the need of ‘pre-storing’ physical locations of features as in previous models, have been the main contributions of the present work. The improved model is validated using 3 different types of datasets which shows an average of ~7% better recognition accuracy over the original FHLIB model.

[1] David G. Lowe,et al. University of British Columbia. , 1945, Canadian Medical Association journal.

[2] Tomaso Poggio,et al. Models of object recognition , 2000, Nature Neuroscience.

[3] Allen Allport,et al. Visual attention , 1989 .

[4] Michael A. Arbib,et al. The handbook of brain theory and neural networks , 1995, A Bradford book.

[5] Leslie G. Ungerleider. Two cortical visual systems , 1982 .

[6] A. Treisman,et al. A feature-integration theory of attention , 1980, Cognitive Psychology.

[7] Simei Gomes Wysoski,et al. Fast and adaptive network of spiking neurons for multi-view visual pattern recognition , 2008, Neurocomputing.

[8] Thomas Serre,et al. Robust Object Recognition with Cortex-Like Mechanisms , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9] Laurent Itti,et al. A Bayesian model for efficient visual search and recognition , 2010, Vision Research.

[10] Thomas Serre,et al. Object recognition with features inspired by visual cortex , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[11] D. Hubel,et al. Receptive fields and functional architecture of monkey striate cortex , 1968, The Journal of physiology.

[12] Mark A. Richardson,et al. An improved cortex-like neuromorphic system for target recognitions , 2010, Security + Defence.

[13] T. Poggio,et al. Hierarchical models of object recognition in cortex , 1999, Nature Neuroscience.

[14] Laurent Itti,et al. Modelling Primate Visual Attention , 2003 .

[15] Takayuki Ito,et al. Neocognitron: A neural network model for a mechanism of visual pattern recognition , 1983, IEEE Transactions on Systems, Man, and Cybernetics.

[16] Pietro Perona,et al. Learning Generative Visual Models from Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object Categories , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[17] Christof Koch,et al. A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[18] Thomas G. Dietterich,et al. A Hierarchical Object Recognition System Based on Multi-scale Principal Curvature Regions , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[19] Pietro Perona,et al. Graph-Based Visual Saliency , 2006, NIPS.

[20] Ali Borji,et al. Scene classification with a sparse set of salient regions , 2011, 2011 IEEE International Conference on Robotics and Automation.

[21] Tieniu Tan,et al. Invariant texture segmentation via circular Gabor filters , 2002, Object recognition supported by user interaction for service robots.