A multi-scale learning approach for landmark recognition using mobile devices

The growing usage of mobile camera phones has led to proliferation of many mobile applications. Landmark recognition is one of the mobile applications that are gaining more attention in recent years. The main idea of the application is that a user will use a camera phone to capture the image of a landmark or building and then the system will analyze, identify, and inform the user the name of the captured landmark together with its related information. A new mobile landmark recognition method is proposed in this paper: first, a set of multi-scale patches are extracted from the landmark images. Discriminative patches of the images are then selected based on a Gaussian mixture model (GMM). A combination of color, texture and scale-invariant feature transform (SIFT) descriptors are then extracted from the selected patches. They are used to train support vector machine (SVM) classifiers for each category of landmark. Experimental results using a database of 4000 landmark images illustrate the effectiveness of the proposed method.

[1]  Joo-Hwee Lim,et al.  Cascaded classification with optimal candidate selection for effective place recognition , 2008, 2008 IEEE International Conference on Multimedia and Expo.

[2]  Stephen L. Chiu,et al.  Fuzzy Model Identification Based on Cluster Estimation , 1994, J. Intell. Fuzzy Syst..

[3]  Wei Zhang,et al.  Hierarchical building recognition , 2007, Image Vis. Comput..

[4]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[5]  Zhen Li,et al.  A Survey on Mobile Landmark Recognition for Information Retrieval , 2009, 2009 Tenth International Conference on Mobile Data Management: Systems, Services and Middleware.

[6]  Pietro Perona,et al.  Object class recognition by unsupervised scale-invariant learning , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[7]  Joo-Hwee Lim,et al.  Scene Recognition with Camera Phones for Tourist Information Access , 2007, 2007 IEEE International Conference on Multimedia and Expo.

[8]  Walter Cazzola,et al.  Mobile Vision and Cultural Heritage : the AGAMEMNON Project , 2006 .

[9]  Justus H. Piater,et al.  Adaptive Patch Features for Object Class Recognition with Learned Hierarchical Models , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[10]  Barbara Caputo,et al.  Incremental learning for place recognition in dynamic environments , 2007, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[11]  Peter Auer,et al.  Generic object recognition with boosting , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Edward Y. Chang,et al.  Scalable landmark recognition using EXTENT , 2007, Multimedia Tools and Applications.

[13]  Richard Szeliski,et al.  Multi-image matching using multi-scale oriented patches , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[14]  Barbara Caputo,et al.  Visual Servoing to Help Camera Operators Track Better , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.