Detected text-based image retrieval approach for textual images

This work addresses the problem of searching and retrieving similar textual images based on the detected text and opens the new directions for textual image retrieval. For image retrieval, several methods have been proposed to extract visual features and social tags; however, to extract embedded and scene text within images and use that text as automatic keywords/tags is still a young research field for text-based and content-based image retrieval applications. The automatic text detection retrieval is an emerging technology for robotics and artificial intelligence. In this study, the authors have proposed a novel approach to detect the text in an image and exploit it as keywords and tags for automatic text-based image retrieval. First, text regions are detected using maximally stable extremal region algorithm. Second, unwanted false positive text regions are eliminated based on geometric properties and stroke width transform. Next, the true text regions are proceeded into optical character recognition for recognition. Third, keywords are formed using a neural probabilistic language model. Finally, the textual images are indexed and retrieved based on the detected keywords. The experimental results on two benchmark datasets show the dominancy of text is efficient and valuable for image retrieval specifically for textual images.

[1]  Jiangye Yuan,et al.  Image feature based GPS trace filtering for road network generation and road segmentation , 2015, Machine Vision and Applications.

[2]  Yilong Yin,et al.  Hybrid textual-visual relevance learning for content-based image retrieval , 2017, J. Vis. Commun. Image Represent..

[3]  Jing-Yu Yang,et al.  Content-based image retrieval using computational visual attention model , 2015, Pattern Recognit..

[4]  Jie Liu,et al.  A cascaded method for text detection in natural scene images , 2017, Neurocomputing.

[5]  Lei Wu,et al.  Tag Completion for Image Retrieval , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Hai-Miao Hu,et al.  A hierarchal BoW for image retrieval by enhancing feature salience , 2016, Neurocomputing.

[7]  Jun Wu,et al.  Global Correlation Descriptor: A novel image representation for image retrieval , 2015, J. Vis. Commun. Image Represent..

[8]  Youbao Tang,et al.  Scene Text Detection and Segmentation Based on Cascaded Convolution Neural Networks , 2017, IEEE Transactions on Image Processing.

[9]  Xiangyang Wang,et al.  Content-based image retrieval using local visual attention feature , 2014, J. Vis. Commun. Image Represent..

[10]  Chucai Yi,et al.  Text String Detection From Natural Scenes by Structure-Based Partition and Grouping , 2011, IEEE Transactions on Image Processing.

[11]  Chunheng Wang,et al.  Scene text detection using graph model built upon maximally stable extremal regions , 2013, Pattern Recognit. Lett..

[12]  Rassoul Amirfattahi,et al.  A New Content-Based Image Retrieval System Based on Optimized Integration of DCD , Wavelet and Curvelet Features , 2016 .

[13]  Wei Shen,et al.  Text detection in scene images based on exhaustive segmentation , 2017, Signal Process. Image Commun..

[14]  Xingyuan Wang,et al.  A novel method for image retrieval based on structure elements' descriptor , 2013, J. Vis. Commun. Image Represent..

[15]  Xingyuan Wang,et al.  The method for image retrieval based on multi-factors correlation utilizing block truncation coding , 2014, Pattern Recognit..

[16]  Saso Dzeroski,et al.  Improving bag-of-visual-words image retrieval with predictive clustering trees , 2016, Inf. Sci..

[17]  Akhtar Hussain Jalbani,et al.  Artificial Urdu Text Detection and Localization from Individual Video Frames , 2018 .

[18]  Aman Pal,et al.  Fusion framework for effective color image retrieval , 2014, J. Vis. Commun. Image Represent..

[19]  Qifeng Liu,et al.  A stroke filter and its application to text localization , 2009, Pattern Recognit. Lett..

[20]  Hengyou Wang,et al.  Separable vocabulary and feature fusion for image retrieval based on sparse representation , 2017, Neurocomputing.

[21]  Abbes Amira,et al.  Semantic content-based image retrieval: A comprehensive study , 2015, J. Vis. Commun. Image Represent..

[22]  Jiri Matas,et al.  Robust wide-baseline stereo from maximally stable extremal regions , 2004, Image Vis. Comput..

[23]  Kosin Chamnongthai,et al.  Fusion of color histogram and LBP-based features for texture image retrieval and classification , 2017, Inf. Sci..