Document Image Retrieval Based on Visual Saliency Maps

There has been a massive growth in the production of various unstructured, complex and multi-lingual digitised documents in recent years. Storing and manipulating such digitised documents towards a paperless society has been the objective of emerging technology. As the human visual system can easily distinguish the global summary of images, extracting features based on human attention from images is desirable to achieve more accurate document image retrieval results. Thus, in this research work, an appearance-based document image retrieval system using image saliency maps depending on human visual attention is proposed. The saliency map obtained from the input document image is used to generate a weighted document image. Features are then extracted from the weighted document images using the Gist operator. Then, locality-sensitive hashing is considered to compute similarity distances between a query and the document images in the knowledge-based database. To evaluate the performance of the proposed document image retrieval system MTDB, ITESOFT, and CLEF-IP datasets of document images were used for experimentation. The proposed document image retrieval system provided promising retrieval results compared to the results reported in the literature.

[1]  Laurent Itti,et al.  Saliency and Gist Features for Target Detection in Satellite Images , 2011, IEEE Transactions on Image Processing.

[2]  Alireza Alaei,et al.  Document image retrieval based on texture features and similarity fusion , 2016, 2016 International Conference on Image and Vision Computing New Zealand (IVCNZ).

[3]  Jilin Li,et al.  Document Image Retrieval with Local Feature Sequences , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[4]  Antonio Torralba,et al.  Building the gist of a scene: the role of global image features in recognition. , 2006, Progress in brain research.

[5]  Ling Huang,et al.  Saliency-based multi-feature modeling for semantic image retrieval , 2018, J. Vis. Commun. Image Represent..

[6]  Dacheng Tao,et al.  Database Saliency for Fast Image Retrieval , 2015, IEEE Transactions on Multimedia.

[7]  Alfred Haar On the Theory of Orthogonal Function Systems. * ) * * ) (first Communication.) , .

[8]  David S. Doermann,et al.  Logo Matching for Document Image Retrieval , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[9]  Rafael C. González,et al.  Local Determination of a Moving Contrast Edge , 1985, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Martin D. Levine,et al.  Visual Saliency Based on Scale-Space Analysis in the Frequency Domain , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Allan Hanbury,et al.  CLEF-IP 2011: Retrieval in the Intellectual Property Domain , 2011, CLEF.

[12]  Meng Jian,et al.  Visual saliency estimation using constraints , 2018, Neurocomputing.

[13]  Jayant Kumar,et al.  Structural similarity for document image classification and retrieval , 2014, Pattern Recognit. Lett..

[14]  Jean-Yves Ramel,et al.  A New One-Class Classification Method Based on Symbolic Representation: Application to Document Classification , 2014, 2014 11th IAPR International Workshop on Document Analysis Systems.

[15]  Federica Battisti,et al.  A feature-based approach for saliency estimation of omni-directional images , 2018, Signal Process. Image Commun..

[16]  许超 Database Saliency for Fast Image Retrieval , 2015 .

[17]  Tasneem Mirza,et al.  Content based Image Retrieval using Color and Texture , 2016 .

[18]  Cordelia Schmid,et al.  Evaluation of GIST descriptors for web-scale image search , 2009, CIVR '09.

[19]  Josep Lladós,et al.  Key-Region Detection for Document Images -- Application to Administrative Document Retrieval , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[20]  Nan Li,et al.  A novel locality-sensitive hashing algorithm for similarity searches on large-scale hyperspectral data , 2016 .

[21]  Masakazu Iwamura,et al.  Real-Time Document Image Retrieval for a 10 Million Pages Database with a Memory Efficient and Stability Improved LLAH , 2011, 2011 International Conference on Document Analysis and Recognition.

[22]  Alireza Alaei,et al.  Document Image Retrieval Based on Texture Features: A Recognition-Free Approach , 2016, 2016 International Conference on Digital Image Computing: Techniques and Applications (DICTA).

[23]  Alireza Alaei,et al.  Evaluation of Gist Operator for Document Image Retrieval , 2018, 2018 13th IAPR International Workshop on Document Analysis Systems (DAS).

[24]  Alireza Alaei,et al.  Logo and seal based administrative document image retrieval: A survey , 2016, Comput. Sci. Rev..

[25]  Alireza Alaei,et al.  A comparative study of different texture features for document image retrieval , 2019, Expert Syst. Appl..

[26]  Antonio Torralba,et al.  Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.

[27]  Wei Liu,et al.  Robust scene classification by Gist with angular radial partitioning , 2012, 2012 5th International Symposium on Communications, Control and Signal Processing.

[28]  Stéphane Mallat,et al.  A Theory for Multiresolution Signal Decomposition: The Wavelet Representation , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[29]  Zhihui Lai,et al.  Multi-scale gist feature manifold for building recognition , 2011, Neurocomputing.