Learning and Detection of Object Landmarks in Canonical Object Space

This work contributes to part-based object detection and recognition by introducing an enhanced method for local part detection. The method is based on complex-valued multiresolution Gabor features and their ranking using multiple hypothesis testing. In the present work, our main contribution is the introduction of a canonical object space, where objects are represented in their ``expected pose and visual appearance''. The canonical space circumvents the problem of geometric image normalisation prior to feature extraction. In addition, we define a compact set of Gabor filter parameters, from where the optimal values can be easily devised. These enhancements make our method an attractive landmark detector for part-based object detection and recognition methods.

[1]  Josef Kittler,et al.  Object Localisation Using Generative Probability Model for Spatial Constellation and Local Image Features , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[2]  Rajesh P. N. Rao,et al.  An Active Vision Architecture Based on Iconic Representations , 1995, Artif. Intell..

[3]  Pietro Perona,et al.  Learning Generative Visual Models from Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object Categories , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[4]  Daphna Weinshall,et al.  Efficient Learning of Relational Object Class Models , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[5]  Cor J. Veenman,et al.  Kernel Codebooks for Scene Categorization , 2008, ECCV.

[6]  Gabriela Csurka,et al.  Visual categorization with bags of keypoints , 2002, eccv 2004.

[7]  S. Umeyama,et al.  Least-Squares Estimation of Transformation Parameters Between Two Point Patterns , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Joni-Kristian Kämäräinen,et al.  EXPERIMENTAL STUDY ON FAST 2D HOMOGRAPHY ESTIMATION FROM A FEW POINT CORRESPONDENCES , 2009 .

[9]  Martin A. Fischler,et al.  The Representation and Matching of Pictorial Structures , 1973, IEEE Transactions on Computers.

[10]  Jitendra Malik,et al.  Shape matching and object recognition using shape contexts , 2010, 2010 3rd International Conference on Computer Science and Information Technology.

[11]  Dan Roth,et al.  Learning to detect objects in images via a sparse, part-based representation , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Samy Bengio,et al.  Measuring the performance of face localization systems , 2006, Image Vis. Comput..

[13]  Josef Kittler,et al.  Image Feature Localization by Multiple Hypothesis Testing of Gabor Features , 2008, IEEE Transactions on Image Processing.

[14]  David Cristinacce,et al.  Automatic feature localisation with constrained local models , 2008, Pattern Recognit..

[15]  Jiri Matas,et al.  Feature-based affine-invariant localization of faces , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Cordelia Schmid,et al.  Constructing Category Hierarchies for Visual Recognition , 2008, ECCV.

[17]  Anil K. Jain,et al.  Unsupervised Learning of Finite Mixture Models , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[18]  Daniel P. Huttenlocher,et al.  Pictorial Structures for Object Recognition , 2004, International Journal of Computer Vision.

[19]  Timothy F. Cootes,et al.  Active Shape Models-Their Training and Application , 1995, Comput. Vis. Image Underst..

[20]  Gustavo Carneiro,et al.  Flexible Spatial Configuration of Local Image Features , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.