论文信息 - Query by string word spotting based on character bi-gram indexing

Query by string word spotting based on character bi-gram indexing

In this paper we propose a segmentation-free query by string word spotting method. Both the documents and query strings are encoded using a recently proposed word representation that projects images and strings into a common attribute space based on a Pyramidal Histogram of Characters (PHOC). These attribute models are learned using linear SVMs over the Fisher Vector [8] representation of the images along with the PHOC labels of the corresponding strings. In order to search through the whole page, document regions are indexed per character bi-gram using a similar attribute representation. On top of that, we propose an integral image representation of the document using a simplified version of the attribute model for efficient computation. Finally we introduce a re-ranking step in order to boost retrieval performance. We show state-of-the-art results for segmentation-free query by string word spotting in single-writer and multi-writer standard datasets.

Ernest Valveny | Suman K. Ghosh | Ernest Valveny

[1] Andreas Keller,et al. Lexicon-free handwritten word spotting using character HMMs , 2012, Pattern Recognit. Lett..

[2] Sergios Theodoridis,et al. Keyword-guided word spotting in historical printed documents using synthetic data and user feedback , 2007, International Journal of Document Analysis and Recognition (IJDAR).

[3] Michael Isard,et al. Total Recall: Automatic Query Expansion with a Generative Feature Model for Object Retrieval , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[4] Lior Wolf,et al. A Simple and Fast Word Spotting Method , 2014, 2014 14th International Conference on Frontiers in Handwriting Recognition.

[5] Frank Lebourgeois,et al. Towards an omnilingual word retrieval system for ancient manuscripts , 2009, Pattern Recognit..

[6] Horst Bunke,et al. The IAM-database: an English sentence database for offline handwriting recognition , 2002, International Journal on Document Analysis and Recognition.

[7] C. V. Jawahar,et al. Character N-Gram Spotting on Handwritten Documents Using Weakly-Supervised Segmentation , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[8] Ernest Valveny,et al. Word Spotting and Recognition with Embedded Attributes , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9] Thomas Mensink,et al. Improving the Fisher Kernel for Large-Scale Image Classification , 2010, ECCV.

[10] Ernest Valveny,et al. Segmentation-free word spotting with exemplar SVMs , 2014, Pattern Recognit..

[11] Bill Triggs,et al. Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[12] Samy Bengio,et al. Offline recognition of unconstrained handwritten texts using HMMs and statistical language models , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13] Volkmar Frinken,et al. A Novel Word Spotting Method Based on Recurrent Neural Networks , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14] Michael N Jones,et al. Case-sensitive letter and bigram frequency counts from large-scale English corpora , 2004, Behavior research methods, instruments, & computers : a journal of the Psychonomic Society, Inc.

[15] F. Perronnin,et al. Local gradient histogram features for word spotting in unconstrained handwritten documents , 2008 .

[16] Michael C. Fairhurst,et al. A synthesised word approach to word retrieval in handwritten documents , 2012, Pattern Recognit..

[17] C. V. Jawahar,et al. Robust Recognition of Degraded Documents Using Character N-Grams , 2012, 2012 10th IAPR International Workshop on Document Analysis Systems.

[18] Giovanni Soda,et al. Font adaptive word indexing of modern printed documents , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19] Andrew Zisserman,et al. Three things everyone should know to improve object retrieval , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.