Google image search refinement: Finding text in images using local features

In this paper, we implement a Google image search refinement that utilizes local features to find Chinese characters in search results, including following stages: (1) Chinese characters images and their SIFT features (SDB) are generated offline, (2) A text-based image search results are retrieved from the Google, (3) SIFT features of results are matched to query-text SDB using MPLSH, (4) A geometric verification algorithm is used to find the query-text and rerank results. Experiment results show that our approach is simple and effective in recognition of text in natural images, and is helpful to refine the web image search.

[1]  Piotr Indyk,et al.  Approximate nearest neighbors: towards removing the curse of dimensionality , 1998, STOC '98.

[2]  Xilin Chen,et al.  Automatic detection and recognition of signs from natural scenes , 2004, IEEE Transactions on Image Processing.

[3]  David S. Doermann,et al.  Camera-based analysis of text and documents: a survey , 2005, International Journal of Document Analysis and Recognition (IJDAR).

[4]  Anil K. Jain,et al.  Text information extraction in images and video: a survey , 2004, Pattern Recognit..

[5]  Masashi Koga,et al.  Camera-based Kanji OCR for mobile-phones: practical issues , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[6]  Cordelia Schmid,et al.  A Performance Evaluation of Local Descriptors , 2005, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  T. Tuytelaars,et al.  A Survey on Local Invariant Features , 2006 .

[8]  Zhe Wang,et al.  Multi-Probe LSH: Efficient Indexing for High-Dimensional Similarity Search , 2007, VLDB.

[9]  Cheng-Lin Liu,et al.  Text Localization in Natural Scene Images Based on Conditional Random Field , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[10]  Jiri Matas,et al.  Geometric min-Hashing: Finding a (thick) needle in a haystack , 2009, CVPR.

[11]  Kai Chen,et al.  Text Localization and Recognition in Complex Scenes Using Local Features , 2010, ACCV.

[12]  Yonatan Wexler,et al.  Detecting text in natural scenes with stroke width transform , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[13]  Cordelia Schmid,et al.  Aggregating local descriptors into a compact image representation , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[14]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .