Localization Based on Building Recognition

Navigational capabilities of people in urban areas are to a large extent determined by their knowledge of current location. In addition to location information available by means of global positioning sensors, images can provide additional and often complementary information about relative position and/or viewpoint of the person with respect to some known landmarks. In order to enable such functionality, landmarks (e.g. buildings) have to be reliably and efficiently recognized. In this paper we describe a hierarchical approach for recognition of buildings. At the first stage, we use a novel and efficient representation named localized color histograms. This representation enables efficient retrieval of a small number of candidate matches from the database. At the second stage, the recognition is refined by matching descriptors associated with local image regions. Once the correct building is identified, the relative pose with respect to the building is recovered. The proposed approach is validated by extensive experiments, with images taken in different weather conditions, seasons and with different cameras.

[1]  Wei Zhang,et al.  Video Compass , 2002, ECCV.

[2]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[3]  Cordelia Schmid,et al.  Local Grayvalue Invariants for Image Retrieval , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Luc Van Gool,et al.  HPAT Indexing for Fast Object/Scene Recognition Based on Local Appearance , 2003, CIVR.

[5]  Alex Pentland,et al.  Recognizing Personal Location from Video , 1998 .

[6]  Jana Kosecka,et al.  Extraction, matching and pose recovery based on dominant rectangular structures , 2003, First IEEE International Workshop on Higher-Level Knowledge in 3D Modeling and Motion Analysis, 2003. HLK 2003..

[7]  Luc Van Gool,et al.  Fast wide baseline matching for visual navigation , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[8]  Wei Zhang,et al.  Extraction, matching and pose recovery based on dominant rectangular structures , 1989, First IEEE International Workshop on Higher-Level Knowledge in 3D Modeling and Motion Analysis, 2003. HLK 2003..

[9]  Alan L. Yuille,et al.  Manhattan World: compass direction from a single image by Bayesian inference , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[10]  Roberto Cipolla,et al.  An Image-Based System for Urban Navigation , 2004, BMVC.

[11]  Antonio Torralba,et al.  Context-based vision system for place and object recognition , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[12]  Shih-Fu Chang,et al.  Image Retrieval: Current Techniques, Promising Directions, and Open Issues , 1999, J. Vis. Commun. Image Represent..

[13]  Markus A. Stricker,et al.  Spectral covariance and fuzzy regions for image indexing , 1997, Machine Vision and Applications.

[14]  Cordelia Schmid,et al.  Affine-invariant local descriptors and neighborhood statistics for texture recognition , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.