Building Detection and Recognition for an Automated Tour Guide

We present a building recognition module for an automated tour guide system which, instead of using user input or artificial landmarks, relies solely on real-time video recorded from a wearable camcorder to recognize faces of buildings and localize the user. Strong vertical edges are first detected on each frame to suggest the existence of building objects. Additional information such as color feature is collected to eliminate edges from trees, cars, and pedestrians. Vertical edges, in addition to horizontal edges, are further processed to locate window frames and thereafter faces of a building. An image section identified as a building face is then rectified for perspective distortion and recognized by a neural network. We train the neural network using whole and partial building face images before the recognition takes place.

[1]  Ah Chung Tsoi,et al.  Face Recognition: A Hybrid Neural Network Approach , 1998 .

[2]  D Marr,et al.  Theory of edge detection , 1979, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[3]  V. Beranger,et al.  Neural network-based recognition of mine environments , 1996, Proceedings of 1996 Canadian Conference on Electrical and Computer Engineering.

[4]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[5]  Paul S. Heckbert,et al.  Fundamentals of Texture Mapping and Image Warping , 1989 .

[6]  Takeo Kanade,et al.  Neural Network-Based Face Detection , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  J. Canny A Computational Approach toEdgeDetection , 1986 .

[8]  N. Vincent,et al.  A local approach for fast line detection , 2002, 2002 14th International Conference on Digital Signal Processing Proceedings. DSP 2002 (Cat. No.02TH8628).

[9]  Katsushi Ikeuchi,et al.  Recognition of Urban Scene Using Silhouette of Buildings and City Map Database , 1998, ACCV.

[10]  Dean Pomerleau,et al.  Efficient Training of Artificial Neural Networks for Autonomous Navigation , 1991, Neural Computation.

[11]  Edward H. Adelson,et al.  Representing moving images with layers , 1994, IEEE Trans. Image Process..

[12]  William H. Press,et al.  Numerical recipes in C , 2002 .

[13]  Roger Y. Tsai,et al.  A versatile camera calibration technique for high-accuracy 3D machine vision metrology using off-the-shelf TV cameras and lenses , 1987, IEEE J. Robotics Autom..

[14]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Reg G. Willson Modeling and calibration of automated zoom lenses , 1994, Other Conferences.

[16]  Shamik Sural,et al.  Segmentation and histogram generation using the HSV color space for image retrieval , 2002, Proceedings. International Conference on Image Processing.

[17]  Ramakant Nevatia,et al.  Detection and Modeling of Buildings from Multiple Aerial Images , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[18]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .

[19]  Ramakant Nevatia,et al.  Automatic Integration of Facade Textures into 3D Building Models with a Projective Geometry Based Line Clustering , 2002, Comput. Graph. Forum.