A survey of document image word spotting techniques

This work reviews the word spotting methods for document indexing.The nature of texts addressed by word spotting techniques is analyzed.The core steps that compose a word spotting system are thoroughly explored.Several boosting mechanisms which enhance the retrieved results are examined.Results achieved by the state of the art imply that there are still goals to be reached. Vast collections of documents available in image format need to be indexed for information retrieval purposes. In this framework, word spotting is an alternative solution to optical character recognition (OCR), which is rather inefficient for recognizing text of degraded quality and unknown fonts usually appearing in printed text, or writing style variations in handwritten documents. Over the past decade there has been a growing interest in addressing document indexing using word spotting which is reflected by the continuously increasing number of approaches. However, there exist very few comprehensive studies which analyze the various aspects of a word spotting system. This work aims to review the recent approaches as well as fill the gaps in several topics with respect to the related works. The nature of texts and inherent challenges addressed by word spotting methods are thoroughly examined. After presenting the core steps which compose a word spotting system, we investigate the use of retrieval enhancement techniques based on relevance feedback which improve the retrieved results. Finally, we present the datasets which are widely used for word spotting, we describe the evaluation standards and measures applied for performance assessment and discuss the results achieved by the state of the art.

[1]  Horst Bunke,et al.  The IAM-database: an English sentence database for offline handwriting recognition , 2002, International Journal on Document Analysis and Recognition.

[2]  Francesc Moreno-Noguer,et al.  Deformation and illumination invariant feature point descriptor , 2011, CVPR 2011.

[3]  João Miguel da Costa Sousa,et al.  Word Indexing of Ancient Documents Using Fuzzy Classification , 2007, IEEE Transactions on Fuzzy Systems.

[4]  C. V. Jawahar,et al.  Character N-Gram Spotting on Handwritten Documents Using Weakly-Supervised Segmentation , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[5]  Konstantinos Zagoris,et al.  Segmentation-Based Historical Handwritten Word Spotting Using Document-Specific Local Features , 2014, 2014 14th International Conference on Frontiers in Handwriting Recognition.

[6]  Venu Govindaraju,et al.  Statistical script independent word spotting in offline handwritten documents , 2014, Pattern Recognit..

[7]  Christophoros Nikou,et al.  Word Spotting in Handwritten Text Using Contour-Based Models , 2014, 2014 14th International Conference on Frontiers in Handwriting Recognition.

[8]  Christophe Garcia,et al.  A Comprehensive Representation Model for Handwriting Dedicated to Word Spotting , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[9]  Shijian Lu,et al.  Retrieval of machine-printed Latin documents through Word Shape Coding , 2008, Pattern Recognit..

[10]  Jean-Yves Ramel,et al.  Word Retrieval in Historical Document Using Character-Primitives , 2011, 2011 International Conference on Document Analysis and Recognition.

[11]  C. V. Jawahar,et al.  Matching word images for content-based retrieval from printed document images , 2008, International Journal of Document Analysis and Recognition (IJDAR).

[12]  Josep Lladós,et al.  A study of Bag-of-Visual-Words representations for handwritten keyword spotting , 2015, International Journal on Document Analysis and Recognition (IJDAR).

[13]  Raid Saabni,et al.  Efficient Word Image Retrieval Using Earth Movers Distance Embedded to Wavelets Coefficients Domain , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[14]  R. Manmatha,et al.  Holistic word recognition for handwritten historical documents , 2004, First International Workshop on Document Image Analysis for Libraries, 2004. Proceedings..

[15]  Pinar Duygulu Sahin,et al.  A line-based representation for matching words in historical manuscripts , 2011, Pattern Recognit. Lett..

[16]  Muriel Visani,et al.  Cursive On-line Handwriting Word Recognition Using a Bi-character Model for Large Lexicon Applications , 2010, 2010 12th International Conference on Frontiers in Handwriting Recognition.

[17]  Imran Siddiqi,et al.  Towards Searchable Digital Urdu Libraries - A Word Spotting Based Retrieval Approach , 2011, 2011 International Conference on Document Analysis and Recognition.

[18]  Bing Zhang,et al.  Applications of Recurrent Neural Network Language Model in Offline Handwriting Recognition and Word Spotting , 2014, 2014 14th International Conference on Frontiers in Handwriting Recognition.

[19]  Alicia Fornés,et al.  A Coarse-to-Fine Word Spotting Approach for Historical Handwritten Documents Based on Graph Embedding and Graph Edit Distance , 2014, 2014 22nd International Conference on Pattern Recognition.

[20]  Nicole Vincent,et al.  Fusion of Word Spotting and Spatial Information for Figure Caption Retrieval in Historical Document Images , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[21]  C. V. Jawahar,et al.  Bringing Semantics in Word Image Retrieval , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[22]  Raid Saabni,et al.  Fast Keyword Searching Using 'BoostMap' Based Embedding , 2012, 2012 International Conference on Frontiers in Handwriting Recognition.

[23]  Ernest Valveny,et al.  Query by string word spotting based on character bi-gram indexing , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).

[24]  Geetha Srikantan,et al.  A multiple feature/resolution approach to handprinted digit and character recognition , 1996, Int. J. Imaging Syst. Technol..

[25]  Konstantinos Zagoris,et al.  ICFHR 2014 Competition on Handwritten Keyword Spotting (H-KWS 2014) , 2014, 2014 14th International Conference on Frontiers in Handwriting Recognition.

[26]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[27]  Kuanquan Wang,et al.  Chinese Keyword Spotting Using Knowledge-Based Clustering , 2011, 2011 International Conference on Document Analysis and Recognition.

[28]  Volkmar Frinken,et al.  Keyword Spotting in Online Handwritten Documents Containing Text and Non-text Using BLSTM Neural Networks , 2011, 2011 International Conference on Document Analysis and Recognition.

[29]  Ernest Valveny,et al.  Word Spotting and Recognition with Embedded Attributes , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[30]  Christian Viard-Gaudin,et al.  Lexicon-Based Word Recognition Using Support Vector Machine and Hidden Markov Model , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[31]  Cheng-Lin Liu,et al.  Lexicon-Driven Segmentation and Recognition of Handwritten Character Strings for Japanese Address Reading , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[32]  Anders Brun,et al.  Semantic and Verbatim Word Spotting Using Deep Neural Networks , 2016, 2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR).

[33]  Chew Lim Tan,et al.  Keyword Spotting in Document Images through Word Shape Coding , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[34]  Simone Marinai Text retrieval from early printed books , 2009, AND '09.

[35]  José A. Rodríguez-Serrano,et al.  Synthesizing queries for handwritten word image retrieval , 2012, Pattern Recognit..

[36]  Mohamed Cheriet,et al.  Application of Multi-Level Classifiers and Clustering for Automatic Word Spotting in Historical Document Images , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[37]  Gernot A. Fink,et al.  Bag-of-Features HMMs for Segmentation-Free Word Spotting in Handwritten Documents , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[38]  Ching Y. Suen,et al.  Arabic Handwritten Text Line Extraction by Applying an Adaptive Mask to Morphological Dilation , 2012, 2012 10th IAPR International Workshop on Document Analysis Systems.

[39]  Gernot A. Fink,et al.  A Modified Isomap Approach to Manifold Learning in Word Spotting , 2015, GCPR.

[40]  Venu Govindaraju,et al.  Online Handwritten Cursive Word Recognition Using Segmentation-Free MRF in Combination with P2DBMN-MQDF , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[41]  George Kollios,et al.  BoostMap: A method for efficient approximate similarity rankings , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[42]  Imran Siddiqi,et al.  Word Spotting Based Retrieval of Urdu Handwritten Documents , 2012, 2012 International Conference on Frontiers in Handwriting Recognition.

[43]  Jihad El-Sana,et al.  Word Spotting Using Radial Descriptor , 2014, 2014 14th International Conference on Frontiers in Handwriting Recognition.

[44]  C. V. Jawahar,et al.  Towards more effective distance functions for word image matching , 2010, DAS '10.

[45]  Yi Yang,et al.  A Multimedia Retrieval Framework Based on Semi-Supervised Ranking and Relevance Feedback , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[46]  Liang Huang,et al.  Keyword spotting in unconstrained handwritten Chinese documents using contextual word model , 2013, Image Vis. Comput..

[47]  Guanglai Gao,et al.  A Method for Removing Inflectional Suffixes in Word Spotting of Mongolian Kanjur , 2011, 2011 International Conference on Document Analysis and Recognition.

[48]  Rodney M. Goodman,et al.  Keyword spotting for cursive document retrieval , 1997, Proceedings Workshop on Document Image Analysis (DIA'97).

[49]  Gernot A. Fink,et al.  PHOCNet: A Deep Convolutional Neural Network for Word Spotting in Handwritten Documents , 2016, 2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR).

[50]  Josep Lladós,et al.  Efficient segmentation-free keyword spotting in historical document collections , 2015, Pattern Recognit..

[51]  Yousri Kessentini,et al.  Word Spotting and Regular Expression Detection in Handwritten Documents , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[52]  C. V. Jawahar,et al.  Efficient Search in Document Image Collections , 2007, ACCV.

[53]  Nikos A. Nikolaou,et al.  Segmentation of historical machine-printed documents using Adaptive Run Length Smoothing and skeleton segmentation paths , 2010, Image Vis. Comput..

[54]  Andreas Keller,et al.  HMM-based Word Spotting in Handwritten Documents Using Subword Models , 2010, 2010 20th International Conference on Pattern Recognition.

[55]  Santanu Chaudhury,et al.  Word image based latent semantic indexing for conceptual querying in document image databases , 2007 .

[56]  Sargur N. Srihari,et al.  Handwritten Word Recognition Using Conditional Random Fields , 2007, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007).

[57]  Josep Lladós,et al.  Boosting the handwritten word spotting experience by including the user in the loop , 2014, Pattern Recognit..

[58]  Josep Lladós,et al.  Browsing Heterogeneous Document Collections by a Segmentation-Free Word Spotting Method , 2011, 2011 International Conference on Document Analysis and Recognition.

[59]  Ying Wen,et al.  HoG based two-directional Dynamic Time Warping for handwritten word spotting , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).

[60]  Venu Govindaraju,et al.  A probabilistic method for keyword retrieval in handwritten document images , 2009, Pattern Recognit..

[61]  Xi Zhang,et al.  Image Based Retrieval and Keyword Spotting in Documents , 2014, Handbook of Document Image Processing and Recognition.

[62]  Chew Lim Tan,et al.  A Fast Keyword-Spotting Technique , 2007 .

[63]  Venu Govindaraju,et al.  Template-free word spotting in low-quality manuscripts , 2006 .

[64]  Ernest Valveny,et al.  Deformable HOG-Based Shape Descriptor , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[65]  Volkmar Frinken,et al.  A Novel Word Spotting Method Based on Recurrent Neural Networks , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[66]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[67]  Friedrich M. Wahl,et al.  Block segmentation and text extraction in mixed text/image documents , 1982, Comput. Graph. Image Process..

[68]  Lambert Schomaker,et al.  Separability versus Prototypicality in Handwritten Word Retrieval , 2012, 2012 International Conference on Frontiers in Handwriting Recognition.

[69]  Nicholas R. Howe,et al.  A Laplacian Energy for Document Binarization , 2011, 2011 International Conference on Document Analysis and Recognition.

[70]  Guanglai Gao,et al.  A multiple instances approach to improving keyword spotting on historical Mongolian document images , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).

[71]  A. Papandreou,et al.  Adaptive Zoning Features for Character and Word Recognition , 2011, 2011 International Conference on Document Analysis and Recognition.

[72]  Jean-Yves Ramel,et al.  Word Spotting in Bangla and English Graphical Documents , 2014, 2014 22nd International Conference on Pattern Recognition.

[73]  Kaspar Riesen,et al.  Approximate graph edit distance computation by means of bipartite graph matching , 2009, Image Vis. Comput..

[74]  Volkmar Frinken,et al.  Adapting BLSTM Neural Network Based Keyword Spotting Trained on Modern Data to Historical Documents , 2010, 2010 12th International Conference on Frontiers in Handwriting Recognition.

[75]  L. Vincent Google Book Search: Document Understanding on a Massive Scale , 2007 .

[76]  Pinar Duygulu Sahin,et al.  Matching ottoman words: an image retrieval approach to historical document indexing , 2007, CIVR '07.

[77]  Xi Zhang,et al.  Segmentation-Free Keyword Spotting for Handwritten Documents Based on Heat Kernel Signature , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[78]  Alejandro Héctor Toselli Rossi,et al.  Fast HMM-Filler Approach for Key Word Spotting in Handwritten Documents , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[79]  José A. Rodríguez-Serrano,et al.  Handwritten word-spotting using hidden Markov models and universal vocabularies , 2009, Pattern Recognit..

[80]  Keisuke Kameyama,et al.  An Application-Independent and Segmentation-Free Approach for Spotting Queries in Document Images , 2014, 2014 22nd International Conference on Pattern Recognition.

[81]  Volker Märgner,et al.  An Historical Handwritten Arabic Dataset for Segmentation-Free Word Spotting - HADARA80P , 2014, 2014 14th International Conference on Frontiers in Handwriting Recognition.

[82]  Andrew Zisserman,et al.  Fisher Vector Faces in the Wild , 2013, BMVC.

[83]  C. V. Jawahar,et al.  Document Specific Sparse Coding for Word Retrieval , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[84]  Giovanni Soda,et al.  Digital Libraries and Document Image Retrieval Techniques: A Survey , 2011, Learning Structure and Schemas from Documents.

[85]  Matti Pietikäinen,et al.  Face Description with Local Binary Patterns: Application to Face Recognition , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[86]  Venu Govindaraju,et al.  Unconstrained handwritten document retrieval , 2010, International Journal on Document Analysis and Recognition (IJDAR).

[87]  Dhavachelvan Ponnurangam,et al.  A survey of keyword spotting techniques for printed document images , 2010, Artificial Intelligence Review.

[88]  Ching Y. Suen,et al.  Word Spotting in Gray Scale Handwritten Pashto Documents , 2010, 2010 12th International Conference on Frontiers in Handwriting Recognition.

[89]  Jonathan J. Hull,et al.  Keyword Location in Noisy Document Images , 1993 .

[90]  Frank Lebourgeois,et al.  Towards an omnilingual word retrieval system for ancient manuscripts , 2009, Pattern Recognit..

[91]  Jean-Yves Ramel,et al.  Exemplary Sequence Cardinality: An effective application for word spotting , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).

[92]  Venu Govindaraju,et al.  Historical document image enhancement using background light intensity normalization , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[93]  Klaus D. Tönnies,et al.  Robust Line Detection in Historical Church Registers , 2001, DAGM-Symposium.

[94]  Horst Bunke,et al.  Tree structure for word extraction from handwritten text lines , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[95]  Vassilis Katsouros,et al.  Handwritten document image segmentation into text lines and words , 2010, Pattern Recognit..

[96]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[97]  R. Manmatha,et al.  Word spotting for historical documents , 2007, International Journal of Document Analysis and Recognition (IJDAR).

[98]  Laurent Heutte,et al.  Spot It! Finding Words and Patterns in Historical Documents , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[99]  Heng Zhang,et al.  Character confidence based on N-best list for keyword spotting in online Chinese handwritten documents , 2014, Pattern Recognit..

[100]  Frank Lebourgeois,et al.  Text search for medieval manuscript images , 2007, Pattern Recognit..

[101]  R. Manmatha,et al.  Word image matching using dynamic time warping , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[102]  Gabriela Csurka,et al.  Visual categorization with bags of keypoints , 2002, eccv 2004.

[103]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[104]  Marcus Liwicki,et al.  Combining On-Line and Off-Line Systems for Handwriting Recognition , 2007, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007).

[105]  Gernot A. Fink,et al.  Retrieving Cuneiform Structures in a Segmentation-free Word Spotting Framework , 2015, HIP@ICDAR.

[106]  Harold Mouchère,et al.  SpottingNet: Learning the Similarity of Word Images with Convolutional Neural Network for Word Spotting in Handwritten Historical Documents , 2016, 2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR).

[107]  F. Perronnin,et al.  Local gradient histogram features for word spotting in unconstrained handwritten documents , 2008 .

[108]  Ernest Valveny,et al.  Efficient Exemplar Word Spotting , 2012, BMVC.

[109]  Basilios Gatos,et al.  Efficient Word Recognition Using a Pixel-Based Dissimilarity Measure , 2011, 2011 International Conference on Document Analysis and Recognition.

[110]  Michael C. Fairhurst,et al.  A synthesised word approach to word retrieval in handwritten documents , 2012, Pattern Recognit..

[111]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[112]  Thomas Mensink,et al.  Improving the Fisher Kernel for Large-Scale Image Classification , 2010, ECCV.

[113]  R. Manmatha,et al.  An Efficient Framework for Searching Text in Noisy Document Images , 2012, 2012 10th IAPR International Workshop on Document Analysis Systems.

[114]  Ioannis Pratikakis,et al.  Text line and word segmentation of handwritten documents , 2009, Pattern Recognit..

[115]  Venu Govindaraju,et al.  Handwritten Carbon Form Preprocessing Based on Markov Random Field , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[116]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[117]  Sergios Theodoridis,et al.  Keyword-guided word spotting in historical printed documents using synthetic data and user feedback , 2007, International Journal of Document Analysis and Recognition (IJDAR).

[118]  Edward M. Riseman,et al.  Word spotting: a new approach to indexing handwriting , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[119]  Xi Zhang,et al.  Handwritten word image matching based on Heat Kernel Signature , 2015, Pattern Recognit..

[120]  Alicia Fornés,et al.  Handwritten Word Spotting in Old Manuscript Images Using a Pseudo-structural Descriptor Organized in a Hash Structure , 2011, IbPRIA.

[121]  Alicia Fornés,et al.  A keyword spotting approach using blurred shape model-based descriptors , 2011, HIP '11.

[122]  Ching Y. Suen,et al.  Learning-based word spotting system for Arabic handwritten documents , 2014, Pattern Recognit..

[123]  Basilios Gatos,et al.  Efficient Word Retrieval Using a Multiple Ranking Combination Scheme , 2012, 2012 10th IAPR International Workshop on Document Analysis Systems.

[124]  Nicholas R. Howe,et al.  Part-Structured Inkball Models for One-Shot Handwritten Word Spotting , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[125]  Nicholas R. Howe Inkball models for character localization and out-of-vocabulary word spotting , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).

[126]  Ioannis Pratikakis,et al.  A word spotting framework for historical machine-printed documents , 2010, International Journal on Document Analysis and Recognition (IJDAR).

[127]  Rabia Nuray-Turan,et al.  Automatic ranking of information retrieval systems using data fusion , 2006, Inf. Process. Manag..

[128]  Xi Zhang,et al.  Unconstrained Handwritten Word Recognition Based on Trigrams Using BLSTM , 2014, 2014 22nd International Conference on Pattern Recognition.

[129]  Jean-Yves Ramel,et al.  A Fast Word Retrieval Technique Based on Kernelized Locality Sensitive Hashing , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[130]  Santanu Chaudhury,et al.  Word shape descriptor-based document image indexing: a new DBH-based approach , 2012, International Journal on Document Analysis and Recognition (IJDAR).

[131]  Ernest Valveny,et al.  Segmentation-free word spotting with exemplar SVMs , 2014, Pattern Recognit..

[132]  Dan S. Bloomberg,et al.  Word spotting in scanned images using hidden Markov models , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[133]  Ernest Valveny,et al.  A Sliding Window Framework for Word Spotting Based on Word Attributes , 2015, IbPRIA.

[134]  Alejandro Héctor Toselli,et al.  ICDAR2015 Competition on Keyword Spotting for Handwritten Documents , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).

[135]  Fei Yin,et al.  Handwritten Chinese text line segmentation by clustering with distance metric learning , 2009, Pattern Recognit..

[136]  Volkmar Frinken,et al.  Keyword spotting for self-training of BLSTM NN based handwriting recognition systems , 2014, Pattern Recognit..

[137]  C. V. Jawahar,et al.  Word Image Retrieval Using Bag of Visual Words , 2012, 2012 10th IAPR International Workshop on Document Analysis Systems.

[138]  Alejandro Héctor Toselli Rossi,et al.  Word-Graph-Based Handwriting Keyword Spotting of Out-of-Vocabulary Queries , 2014, 2014 22nd International Conference on Pattern Recognition.

[139]  Chafic Mokbel,et al.  Dynamic and Contextual Information in HMM Modeling for Handwritten Word Recognition , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[140]  Cordelia Schmid,et al.  Product Quantization for Nearest Neighbor Search , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[141]  Venu Govindaraju,et al.  A Bayesian Approach to Script Independent Multilingual Keyword Spotting , 2014, 2014 14th International Conference on Frontiers in Handwriting Recognition.

[142]  Salvador España Boquera,et al.  Improving Offline Handwritten Text Recognition with Hybrid HMM/ANN Models , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[143]  Giovanni Seni,et al.  External word segmentation of off-line handwritten text lines , 1994, Pattern Recognit..

[144]  Basilios Gatos,et al.  Using attributes for word spotting and recognition in polytonic greek documents , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).

[145]  Lior Wolf,et al.  A Simple and Fast Word Spotting Method , 2014, 2014 14th International Conference on Frontiers in Handwriting Recognition.

[146]  A. Papandreou,et al.  An adaptive zoning technique for efficient word retrieval using dynamic time warping , 2014, DATeCH '14.

[147]  Nikos Papamarkos,et al.  A Document Image Retrieval System , 2010, Eng. Appl. Artif. Intell..

[148]  Nicole Vincent,et al.  Word spotting in historical printed documents using shape and sequence comparisons , 2012, Pattern Recognit..

[149]  Venu Govindaraju,et al.  Script Independent Word Spotting in Multilingual Documents , 2008, IJCNLP.

[150]  Basilios Gatos,et al.  Zoning Aggregated Hypercolumns for Keyword Spotting , 2016, 2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR).

[151]  Fabio Roli,et al.  Instance-Based Relevance Feedback in Image Retrieval Using Dissimilarity Spaces , 2008, Case-Based Reasoning on Images and Signals.

[152]  Ch. Choisy Dynamic Handwritten Keyword Spotting Based on the NSHP-HMM , 2007 .

[153]  Ching Y. Suen,et al.  A Novel Handwritten Urdu Word Spotting Based on Connected Components Analysis , 2010, 2010 20th International Conference on Pattern Recognition.

[154]  C. V. Jawahar,et al.  Deep Feature Embedding for Accurate Recognition and Retrieval of Handwritten Text , 2016, 2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR).

[155]  José A. Rodríguez-Serrano,et al.  A Model-Based Sequence Similarity with Application to Handwritten Word Spotting , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[156]  Josep Lladós,et al.  Integrating Visual and Textual Cues for Query-by-String Word Spotting , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[157]  C. V. Jawahar,et al.  Enhancing Word Image Retrieval in Presence of Font Variations , 2014, 2014 22nd International Conference on Pattern Recognition.

[158]  Jean-Yves Ramel,et al.  Performance evaluation of DTW and its variants for word spotting in degraded documents , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).

[159]  Venu Govindaraju,et al.  Keyword Spotting Framework Using Dynamic Background Model , 2012, 2012 International Conference on Frontiers in Handwriting Recognition.

[160]  Georgios Louloudis,et al.  Keyword Spotting in Handwritten Documents Using Projections of Oriented Gradients , 2016, 2016 12th IAPR Workshop on Document Analysis Systems (DAS).

[161]  Alicia Fornés,et al.  On the Influence of Word Representations for Handwritten Word Spotting in Historical Documents , 2012, Int. J. Pattern Recognit. Artif. Intell..

[162]  Anil K. Jain,et al.  Object detection using gabor filters , 1997, Pattern Recognit..

[163]  Lasko Laskov Adaptive Document Image Binarization with Application in Processing Astronomical Logbooks , 2012 .

[164]  Jean-Yves Ramel,et al.  Flexible Sequence Matching Technique: Application to Word Spotting in Degraded Documents , 2014, 2014 14th International Conference on Frontiers in Handwriting Recognition.

[165]  Ioannis Pratikakis,et al.  Segmentation-free Word Spotting in Historical Printed Documents , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[166]  Josep Lladós,et al.  Query driven word retrieval in graphical documents , 2010, DAS '10.

[167]  Gernot A. Fink,et al.  Grouping Historical Postcards Using Query-by-Example Word Spotting , 2014, 2014 14th International Conference on Frontiers in Handwriting Recognition.

[168]  Florent Perronnin,et al.  Score Normalization for HMM-based Word Spotting Using a Universal Background Model , 2008 .

[169]  Alicia Fornés,et al.  Handwritten word spotting by inexact matching of grapheme graphs , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).

[170]  W. Russell,et al.  Continuous hidden Markov modeling for speaker-independent word spotting , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[171]  Jean-Marc Ogier,et al.  Segmentation and Word Spotting Methods for Printed and Handwritten Arabic Texts: A Comparative Study , 2012, 2012 International Conference on Frontiers in Handwriting Recognition.

[172]  Sargur N. Srihari,et al.  Language Independent Word Spotting in Scanned Documents , 2008, ICADL.

[173]  Gernot A. Fink,et al.  Segmentation-free query-by-string word spotting with Bag-of-Features HMMs , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).

[174]  Xi Zhang,et al.  Handwritten word image matching based on Heat Kernel Signature , 2013, Pattern Recognit..

[175]  Thomas Mensink,et al.  Image Classification with the Fisher Vector: Theory and Practice , 2013, International Journal of Computer Vision.

[176]  Yuzuru Tanaka,et al.  Slit Style HOG Feature for Document Image Word Spotting , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[177]  Lambert Schomaker,et al.  Handwritten-Word Spotting Using Biologically Inspired Features , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[178]  Alicia Fornés,et al.  A Novel Learning-Free Word Spotting Approach Based on Graph Representation , 2014, 2014 11th IAPR International Workshop on Document Analysis Systems.

[179]  Ioannis Pratikakis,et al.  Adaptive degraded document image binarization , 2006, Pattern Recognit..

[180]  Yousri Kessentini,et al.  Keyword spotting in handwritten documents based on a generic text line HMM and a SVM verification , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).

[181]  Stéphane Bres,et al.  Word spotting in Alice's adventures underground using multi scale integral orientation features , 2010, Document Analysis Systems.

[182]  Venu Govindaraju,et al.  2009 10th International Conference on Document Analysis and Recognition A Steerable Directional Local Profile Technique for Extraction of Handwritten Arabic Text Lines , 2022 .

[183]  Ernest Valveny,et al.  Handwritten Word Spotting with Corrected Attributes , 2013, 2013 IEEE International Conference on Computer Vision.

[184]  Alan F. Smeaton,et al.  Word matching using single closed contours for indexing handwritten historical documents , 2006, International Journal of Document Analysis and Recognition (IJDAR).

[185]  Volkmar Frinken,et al.  Improving HMM-Based Keyword Spotting with Character Language Models , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[186]  Yihong Gong,et al.  Locality-constrained Linear Coding for image classification , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.