Real-Time Visual Recognition of Objects and Scenes Using P-Channel Matching

In this paper we propose a new approach to real-time view-based object recognition and scene registration. Object recognition is an important sub-task in many applications, as e.g., robotics, retrieval, and surveillance. Scene registration is particularly useful for identifying camera views in databases or video sequences. All of these applications require a fast recognition process and the possibility to extend the database with new material, i.e., to update the recognition system online. The method that we propose is based on P-channels, a special kind of information representation which combines advantages of histograms and local linear models. Our approach is motivated by its similarity to information representation in biological systems but its main advantage is its robustness against common distortions as clutter and occlusion. The recognition algorithm extracts a number of basic, intensity invariant image features, encodes them into P-channels, and compares the query P-channels to a set of prototype P-channels in a database. The algorithm is applied in a cross-validation experiment on the COIL database, resulting in nearly ideal ROC curves. Furthermore, results from scene registration with a fish-eye camera are presented.

[1]  Matthew Brand,et al.  Incremental Singular Value Decomposition of Uncertain Data with Missing Values , 2002, ECCV.

[2]  Hans Knutsson,et al.  Signal processing for computer vision , 1994 .

[3]  Michael Felsberg,et al.  Channel smoothing: efficient robust smoothing of low-level signal features , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Björn Johansson,et al.  The application of an oblique-projected Landweber method to a model of supervised learning , 2006, Math. Comput. Model..

[5]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[6]  Jan-Olof Eklundh,et al.  A pure learning approach to background-invariant object recognition using pedagogical support vector learning , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[7]  Jochen Triesch,et al.  Analysis of a Biologically-Inspired System for Real-time Object Recognition , 2005 .

[8]  Sameer A. Nene,et al.  Columbia Object Image Library (COIL100) , 1996 .

[9]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[10]  Michael Felsberg,et al.  P-Channels: Robust Multivariate M-Estimation of Large Datasets , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[11]  Christopher M. Bishop,et al.  Pattern Recognition and Machine Learning (Information Science and Statistics) , 2006 .

[12]  Per-Erik Forssén,et al.  Low and Medium Level Vision Using Channel Representations , 2004 .

[13]  Massimiliano Pontil,et al.  Support Vector Machines for 3D Object Recognition , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[15]  Michael Felsberg,et al.  Reconstruction of Probability Density Functions from Channel Representations , 2005, SCIA.

[16]  Michael Unser,et al.  Splines: a perfect fit for signal and image processing , 1999, IEEE Signal Process. Mag..

[17]  Jan J. Koenderink,et al.  Algebraic Frames for the Perception-Action Cycle , 1997, Lecture Notes in Computer Science.

[18]  Jan J. Koenderink,et al.  Discrimination thresholds for channel-coded systems , 1992, Biological Cybernetics.

[19]  Leszek Wojnar,et al.  Image Analysis , 1998 .

[20]  Gösta H. Granlund,et al.  An Associative Perception-Action Structure Using a Localized Space Variant Information Representation , 2000, AFPAC.

[21]  Stepán Obdrzálek,et al.  Sub-linear Indexing for Large Scale Object Recognition , 2005, BMVC.