Image based Indian monument recognition using convoluted neural networks

Monument recognition is a challenging problem in the domain of image classification due to huge variations in the architecture of different monuments. Different orientations of the structure play an important role in the recognition of the monuments in their images. The paper proposes an approach for classification of various monuments based on the features of the monument images. The state-of-the-art Deep Convolutional Neural Networks (DCNN) is used for extracting representations. The model is trained on representations of different Indian monuments, obtained from cropped images, which exhibit geographic and cultural diversity. Experiments have been carried out on the manually acquired dataset that is composed of images of different monuments where each monument has images from different angular views. The experiments show the performance of the model when it is trained on representations of cropped images of the various monuments. The overall accuracy achieved is 92.7%, using DCNN, for a total of 100 different monuments that have been considered in the dataset for classification.

[1]  Georgios Triantafyllidis,et al.  Image based Monument Recognition using Graph based Visual Saliency , 2013 .

[2]  Li WangDong-Chen He,et al.  Texture classification using texture spectrum , 1990, Pattern Recognit..

[3]  Dong-Chen He,et al.  Texture Unit, Texture Spectrum, And Texture Analysis , 1990 .

[4]  Pietro Perona,et al.  Graph-Based Visual Saliency , 2006, NIPS.

[5]  Matti Pietikäinen,et al.  Performance evaluation of texture measures with classification based on Kullback discrimination of distributions , 1994, Proceedings of 12th International Conference on Pattern Recognition.

[6]  Alexei A. Efros,et al.  Scene completion using millions of photographs , 2008, Commun. ACM.

[7]  Matti Pietikäinen,et al.  A comparative study of texture measures with classification based on featured distributions , 1996, Pattern Recognit..

[8]  Antonio Torralba,et al.  Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.

[9]  Cordelia Schmid,et al.  Evaluation of GIST descriptors for web-scale image search , 2009, CIVR '09.

[10]  John Y. Cole,et al.  The Library of Congress : the art and architecture of the Thomas Jefferson Building , 1997 .

[11]  I. A. Shamov,et al.  Application of the convolutional neural network to design an algorithm for recognition of tower lighthouses , 2017, 2017 24th Saint Petersburg International Conference on Integrated Navigation Systems (ICINS).

[12]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[13]  Ali Maleki,et al.  Graph-based Visual Saliency Model using Background Color , 2018 .