Recognizing landmarks in sequences of images is a challenging problem for a number of reasons. First of all, the appearance of any given landmark varies substantially from one observation to the next. In addition to variations due to different aspects, an illumination change, external clutter, and changing geometry of the imaging devices are other factors affecting the variability of the observed landmarks. Finally, it is typically difficult to make use of accurate 3D information in landmark recognition applications. For those reasons, it is not possible to use many of the object recognition techniques based on strong geometric models. The alternative is to use image-based techniques in which landmarks are represented by collections of images which capture the “typical” appearance of the object. The information most relevant to recognition is extracted from the collection of raw images and used as the model for recognition. This process is often referred to as “visual learning.” Models of landmarks are acquired from image sequences and later recognized for vehicle localization in urban environments. In the acquisition phase, a vehicle drives and collects images of an unknown area. The algorithm organizes these images into groups with similar image features. The feature distribution for each group describes a landmark. In the recognition phase, while navigating through the same general area, the vehicle collects new images. The algorithm classifies these images into one of the learned groups, thus recognizing a landmark. Unlike computationally intensive model-based approaches that build models from known objects observed in isolation, our image-based approach automatically learns the most salient landmarks in complex environments. It delivers a robust performance under a wide range of lighting and imaging angle variations.
[1]
Radu Horaud,et al.
Finding Geometric and Relational Structures in an Image
,
1990,
ECCV.
[2]
Stefan Carlsson,et al.
Combinatorial Geometry for Shape Representation and Indexing
,
1996,
Object Representation in Computer Vision.
[3]
Patrick Gros,et al.
Rapid Object Indexing and Recognition Using Enhanced Geometric Hashing
,
1996,
ECCV.
[4]
Patrick Gros,et al.
Using Local Planar Geometric Invariants to Match and Model Images of Line Segments
,
1998,
Comput. Vis. Image Underst..
[5]
Amarnath Gupta,et al.
Virage image search engine: an open framework for image management
,
1996,
Electronic Imaging.
[6]
C. Tomasi.
The Earth Mover's Distance, Multi-Dimensional Scaling, and Color-Based Image Retrieval
,
1997
.
[7]
Dean A. Pomerleau.
Neural network-based vision processing for autonomous robot guidance
,
1991
.
[8]
Eric Krotkov,et al.
Position estimation from outdoor visual landmarks for teleoperation of lunar rovers
,
1996,
Proceedings Third IEEE Workshop on Applications of Computer Vision. WACV'96.
[9]
Cordelia Schmid,et al.
Combining greyvalue invariants with local constraints for object recognition
,
1996,
Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.
[10]
Rosalind W. Picard.
A Society of Models for Video and Image Libraries
,
1996,
IBM Syst. J..