Local Features for Image Retrieval

Global image features, although quite successful for a surprisingly large number of queries, are nevertheless restricted in their usefulness. They are less effective when e.g. looking for other images that contain the same object or simply the same scene but seen from a very different viewpoint. Relative amounts of colour and texture typically change dramatically in such cases. Hence, more local features are needed as well, that capture local image structure in a way that is robust against changes in viewpoint and illumination. We propose affinely invariant regions. These are extracted from single images in isolation. Their image shapes are viewpoint dependent in that they systematically cover the same physical part of surfaces. These ‘affinely invariant regions’ work in principle perfectly for planar shapes under pseudo-orthographic viewing conditions. Keeping them small helps to fulfill these conditions. Several ways of extracting such regions are discussed. Next, features are extracted from the colour patterns within the regions. These are invariant under the geometric and photometric changes that are considered. As a consequence, they can be directly compared between images. This is the basis for our simple retrieval algorithm.

[1]  Luc Van Gool,et al.  Recognizing color patterns irrespective of viewpoint and illumination , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[2]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Luc Van Gool,et al.  Wide Baseline Stereo Matching based on Local, Affinely Invariant Regions , 2000, BMVC.

[4]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[5]  Amnon Shashua,et al.  The Rank 4 Constraint in Multiple (>=3) View Geometry , 1996, ECCV.

[6]  Stefan Carlsson Recognizing Walking People , 2000, ECCV.

[7]  Christopher G. Harris,et al.  A Combined Corner and Edge Detector , 1988, Alvey Vision Conference.

[8]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[9]  Andrew Zisserman,et al.  Matching and Reconstruction from Widely Separated Views , 1998, SMILE.