Retrieval methods for English-text with missrecognized OCR characters

This paper presents three probabilistic text retrieval methods designed to carry out a full-text search of English documents containing OCR errors. By searching for any query term on the premise that there are errors in the recognized text, the methods presented can tolerate such errors, and therefore costly manual post-editing is not required after OCR recognition. In the applied approach, confusion matrices are used to store characters which are likely to be interchanged when a particular character is missrecognized, and the respective probability of each occurrence. Moreover, a 2-gram matrix is used to store probabilities of character connection, i.e., which letter is likely to come after another. Multiple search terms are generated for an input query term by making reference to confusion matrices, after which a full-text search is run for each search term. The validity of retrieved terms is determined based on error-occurrence and character connection probabilities. The performance of these methods is experimentally evaluated by determining retrieval effectiveness, i.e., by calculating recall and precision rates. Results indicate marked improvement in comparison with exact matching.

[1]  A. Lawrence Spitz,et al.  Determination of the Script and Language Content of Document Images , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Alexandre S. Saidi,et al.  Grammatical Formalism for Document Understanding System: From Documents towards HTML Text , 1997, BSDIA.

[3]  Sargur N. Srihari,et al.  Document Image Binarization Based on Texture Features , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  J. Sauvola,et al.  Predictive coding for document layout characterization , 1997, Proceedings Workshop on Document Image Analysis (DIA'97).

[5]  Shoji Kurakake,et al.  Telop character extraction from video data , 1997, Proceedings Workshop on Document Image Analysis (DIA'97).

[6]  Majid Ahmadi,et al.  Document registration using projective geometry , 1997, IEEE Trans. Image Process..

[7]  Arnold W. M. Smeulders,et al.  Software System Design for Paper Map Conversion , 1995, GREC.

[8]  Andreas Myka,et al.  Fuzzy Full-Text Searches in OCR Databases , 1995, ADL.

[9]  David S. Doermann,et al.  The retrieval of document images: a brief survey , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[10]  Toyohide Watanabe,et al.  Layout-Based Approach for Extracting Constructive Elements of Bar-Charts , 1997, GREC.

[11]  Michèle Jardino Multilingual stochastic n-gram class language models , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[12]  Jason Tsong-Li Wang,et al.  Fast retrieval of electronic messages that contain mistyped words or spelling errors , 1997, IEEE Trans. Syst. Man Cybern. Part B.

[13]  Abdel Belaïd,et al.  Constraint Propagation vs. Syntactical Analysis for the Logical Structure Recognition of Library References , 1997, BSDIA.

[14]  Thien M. Ha Comparison of class-selective rejection rules for OCR , 1997, Electronic Imaging.

[15]  Yoshua Bengio,et al.  Global training of document processing systems using graph transformer networks , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[16]  Guy Lorette,et al.  Lexical analyzer based on a self-organizing feature map , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[17]  Ken Tomiyama,et al.  Reconstruction of 3D Solid Model from Three Orthographic Views - Top-Down Approach , 1995, GREC.

[18]  Jie Ding,et al.  Classification of oriental and European scripts by using characteristic features , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[19]  Richard Rogers,et al.  UW-ISL document image analysis toolbox: an experimental environment , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[20]  Horst Bunke,et al.  Lexicon reduction in an framework based on quantized feature vectors , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[21]  Stephen M. Harding,et al.  The Skeleton Document Image Retrieval System , 1997 .

[22]  Paul O'Neil,et al.  An incremental approach to text representation, categorization, and retrieval , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[23]  Christian Viard-Gaudin,et al.  A Kalman approach for stroke order recovering from off-line handwriting , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[24]  Rolf Ingold,et al.  A scenario model advocating user-driven adaptive document recognition systems , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[25]  Arnold W. M. Smeulders,et al.  A line tracker , 1997 .

[26]  Jean-Yves Ramel,et al.  Bezier curves as a tool to describe kinetic drawings , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[27]  Daniel P. Lopresti,et al.  OCR for World Wide Web images , 1997, Electronic Imaging.

[28]  Robert M. Haralick,et al.  The ISL document image analysis toolbox , 1997, Proceedings Workshop on Document Image Analysis (DIA'97).

[29]  Joe R. McDaniel,et al.  Automatic Interpretation of Chemical Structure Diagrams , 1995, GREC.

[30]  Yves Lecourtier,et al.  From acquisition to modelisation of a form base to retrieve information , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[31]  Anil K. Jain,et al.  Recognition of Digits in Hydrographic Maps: Binary Versus Topographic Analysis , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[32]  S. Sitharama Iyengar,et al.  Automated system for numerically rating document image quality , 1997, Electronic Imaging.

[33]  Jairo Rocha,et al.  Singularities and Regularities on Line Pictures via Symmetrical Trapezoids , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[34]  Sargur N. Srihari,et al.  Representing OCRed documents in HTML , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[35]  Atsuhiro Takasu,et al.  Approximate matching for OCR-processed bibliographic data , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[36]  Sargur N. Srihari,et al.  Use of document structure analysis to retrieve information from documents in digital libraries , 1997, Electronic Imaging.

[37]  Marcel J. T. Reinders,et al.  Information Fusion for Conflict Resolution in Map Interpretation , 1997, GREC.

[38]  Sriganesh Madhvanath,et al.  Pruning large lexicons using generalized word shape descriptors , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[39]  Andreas Myka,et al.  Measuring the effects of OCR errors on similarity linking , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[40]  Yuan Yan Tang,et al.  Multiresolution analysis in extraction of reference lines from documents with gray level background , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[41]  Horst Bunke,et al.  Automatic Learning and Recognition of Graphical Symbols in Engineering Drawings , 1995, GREC.

[42]  Thomas W. Rauber,et al.  A System for Automatic Form Reading , 1997, BSDIA.

[43]  Chin-Chuan Han,et al.  A fast approach to the detection and correction of skew documents , 1997, Pattern Recognit. Lett..

[44]  Nikolai Gorski Optimizing error-reject trade off in recognition systems , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[45]  Alessandro L. Koerich,et al.  Automatic Extraction of Filled-in Information from Bankchecks Based on Prior Knowledge about Layout Structure , 1997, BSDIA.

[46]  Andreas Dengel,et al.  Message extraction from printed documents-a complete solution , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[47]  Luyang Li,et al.  A Prototype for Adaptive Association of Street Names with Streets on Maps , 1997, GREC.

[48]  A. Takasu An approximate string match for garbled text with various accuracy , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[49]  Whoi-Yul Kim,et al.  Content-based trademark retrieval system using a visually salient feature , 1998, Image Vis. Comput..

[50]  Rui Zhang,et al.  Recognition of character strings from color urban map images on the basis of validation mechanism , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[51]  Bidyut Baran Chaudhuri,et al.  Skew Angle Detection of Digitized Indian Script Documents , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[52]  Gerard Medioni,et al.  Non-uniform skew estimation by tensor voting , 1997, Proceedings Workshop on Document Image Analysis (DIA'97).

[53]  Robert M. Haralick,et al.  A Benchmark: Performance Evaluation of Dashed-Line Detection Algorithms , 1995, GREC.

[54]  Manabu Ohta Probabilistic Retrieval Methods for Text with Miss-Recognized OCR Characters , 1996 .

[55]  G. S. Peake,et al.  Script and language identification from document images , 1997, Proceedings Workshop on Document Image Analysis (DIA'97).

[56]  Yoshihiro Shima,et al.  Evaluation of Information Retrieval Method based on 'non - deterministic text' of Character Recognition , 1995 .

[57]  Abdel Belaïd Future Trends in Retrospective Document Conversion , 1997, BSDIA.

[58]  Sean Gugler Interactive form recognition for common use , 1997, Electronic Imaging.

[59]  Steffen Frischknecht,et al.  Automatic Interpretation of Scanned Topographic Maps: A Raster-Based Approach , 1997, GREC.

[60]  Penelope Sibun,et al.  Language Determination: Natural Language Processing from Scanned Document Images , 1994, ANLP.

[61]  Klamer Schutte,et al.  Memory efficient skeletonization of utility maps , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[62]  Gary Geunbae Lee,et al.  Multi-level post-processing for Korean character recognition using morphological analysis and linguistic evaluation , 1997, Pattern Recognit..

[63]  Shona Douglas,et al.  Layout and language: preliminary investigations in recognizing the structure of tables , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[64]  Yves Lecourtier,et al.  Document Modeling for Form Class Identification , 1997, BSDIA.

[65]  Anil K. Jain,et al.  Address block location on complex mail pieces , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[66]  Its'hak Dinstein,et al.  Directional Decomposition of Line-Drawing Images Based on Regulated Morphological Operations , 1997, GREC.

[67]  Yves Lecourtier,et al.  An image interpretation device can not be reliable without any semantic coherency analysis of the interpreted objects-application to French cadastral maps , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[68]  Majid Ahmadi,et al.  Form registration: a computer vision approach , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[69]  Sang-Yong Han,et al.  Information Extraction from a Skewed Form Document in the Presence of Crossing Characters , 1997, GREC.

[70]  Dan S. Bloomberg Embedding digital data on paper in iconic text , 1997, Electronic Imaging.

[71]  Victor K. Y. Wu Automatic Text Detection and Recognition , 1997 .

[72]  A.W.M. Smeulders,et al.  A novel approach towards simulating graphics for performance analysis purposes , 1997 .

[73]  N. Scarabottolo,et al.  Towards a dedicated compression pipeline for document image archiving , 1997, Proceedings Workshop on Document Image Analysis (DIA'97).

[74]  Tao Hu,et al.  Document retrieval tolerating character recognition errors--evaluation and application , 1997, Pattern Recognit..

[75]  Ying Li,et al.  A knowledge-based image understanding environment for document processing , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[76]  A. Dengel,et al.  Logical labeling of document images based on form layout features , 1997, Proceedings Workshop on Document Image Analysis (DIA'97).

[77]  Siu Cheung Hui,et al.  Cursive word reference line detection , 1997, Pattern Recognit..

[78]  Rik D. T. Janssen INTERPRETATION OF MAPS: FROM BOTTOM-UP TO MODEL-BASED , 1997 .

[79]  Enrico Puppo On the topological representation of line drawings , 1997, Pattern Recognit. Lett..

[80]  Hsi-Jian Lee,et al.  Recognition of Chinese business cards , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[81]  Thomas Bayer,et al.  A generic system for processing invoices , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[82]  Robert J. Whitrow,et al.  A Combined High and Low Level Approach to Interpreting Scanned Engineering Drawings , 1995, GREC.

[83]  Jan J. Gerbrands,et al.  Knowledge-Based Segmentation for Automatic Map Interpretation , 1995, GREC.

[84]  Rama Chellappa,et al.  Multiscale Segmentation of Unstructured Document Pages Using Soft Decision Integration , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[85]  Maurice Szmurlo Boundary normalization for recognition of non-touching non-degraded characters , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[86]  Matti Pietikäinen,et al.  Techniques for the Automated Testing of Document Analysis Algorithms , 1997, BSDIA.

[87]  Liliane Peters,et al.  Fuzzy feature description of handwriting patterns , 1997, Pattern Recognit..

[88]  Joachim M. Gloger,et al.  Reject management in a handwriting recognition system , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[89]  Eiki Ishidera,et al.  Unconstrained Japanese address recognition using a combination of spatial information and word knowledge , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[90]  Hsi-Jian Lee,et al.  A language model based on semantically clustered words in a Chinese character recognition system , 1997, Pattern Recognit..

[91]  Abdel Belaïd,et al.  Logical structure recognition of scientific bibliographic references , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[92]  Heinrich Niemann,et al.  Fast address block location on handwritten and machine printed mail-piece images , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[93]  Lajos Hanzo,et al.  Block-run run-length coding of handwriting and bilevel graphics based on quadtree segmentation , 1997, Pattern Recognit. Lett..

[94]  Michael Cannon,et al.  Page segmentation using script identification vectors: A first look , 1997 .

[95]  Matti Pietikäinen,et al.  A distributed management system for testing document image analysis algorithms , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[96]  Vishal Misra,et al.  Detection of Horizontal Lines in Noisy Run Length Encoded Images: The FAST Method , 1995, GREC.

[97]  Thien M. Ha Efficient detection of abnormalities in large OCR databases , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[98]  Apostolos Antonacopoulos Local skew angle estimation from background space in text regions , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[99]  Claudia Wenzel Supporting information extraction from printed documents by Lexico-Semantic pattern matching , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[100]  Young-Bin Kwon,et al.  Automatic Region Labeling of the Layered Map , 1995, GREC.

[101]  Sargur N. Srihari,et al.  Integration of hand-written address interpretation technology into the United States Postal Service Remote Computer Reader system , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[102]  Xiaoping Chen,et al.  On-line recognition of Renqun's handwritten shorthand , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[103]  Suh-Yin Lee,et al.  On-line signature verification based on split-and-merge matching mechanism , 1997, Pattern Recognit. Lett..

[104]  Venu Govindaraju,et al.  Empirical Design of A Multi-Classifier Thresholding/Control Strategy for Recognition of Handwritten Street Names , 1997, Int. J. Pattern Recognit. Artif. Intell..

[105]  Laurent Robert,et al.  Image and text coupling for creating electronic books from manuscripts , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[106]  Yuan Yan Tang,et al.  Location and recognition of legal amounts on Chinese bank cheques , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[107]  Jorge Silva Centeno Segmentation of Thematic Maps Using Colour and Spatial Attributes , 1997, GREC.

[108]  Matti Pietikäinen,et al.  Graphical Tools and Techniques for Querying Document Image Databases , 1997, BSDIA.

[109]  Stefan Jäger A psychomotor method for tracking handwriting , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[110]  Donato Malerba,et al.  Information capture and semantic indexing of digital libraries through machine learning techniques , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[111]  Azriel Rosenfeld,et al.  The function of documents , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[112]  Masami Oguro Faxed document image restoration using gray level representation , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[113]  Alan F. Smeaton,et al.  Using character shape coding for information retrieval , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[114]  Frank Lebourgeois,et al.  Composite Document Analysis by Means of Typographic Characteristics , 1997, BSDIA.

[115]  Flávio Bortolozzi,et al.  Generation of Signatures by Deformations , 1997, BSDIA.

[116]  Kevin J. Parker,et al.  Color, complex document segmentation and compression , 1997, Electronic Imaging.

[117]  Hae-Kwang Kim,et al.  Efficient Automatic Text Location Method and Content-Based Indexing and Structuring of Video Database , 1996, J. Vis. Commun. Image Represent..

[118]  Feng Ma,et al.  A Chinese Bank Check Recognition System Based on the Fault Tolerant Technique , 1997, ICDAR.

[119]  Heinrich Niemann,et al.  Form-based localization of the destination address block on complex envelopes , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[120]  Alexander Filatov,et al.  Handwritten ZIP code recognition , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[121]  Federico Thomas,et al.  Efficient Morphological Set Transformations on Line Drawings , 1997, Int. J. Pattern Recognit. Artif. Intell..

[122]  Hsi-Jian Lee,et al.  Design of a mathematical expression understanding system , 1997, Pattern Recognit. Lett..

[123]  Matti Pietikäinen,et al.  Locally adaptive document skew detection , 1997, Electronic Imaging.

[124]  Andy C. Downton,et al.  TABS-a new software framework for document image processing, analysis, and understanding , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[125]  Robert M. Haralick,et al.  Performance evaluation of document layout analysis algorithms on the UW data set , 1997, Electronic Imaging.

[126]  Robert Cooperman Producing good font attribute determination using error-prone information , 1997, Electronic Imaging.

[127]  Yolande Belaïd,et al.  Form Item Extraction Based on Line Searching , 1995, GREC.

[128]  Jisheng Liang,et al.  Performance evaluation for line-drawing recognition systems , 1997, Electronic Imaging.

[129]  Atul K. Chhabra,et al.  Symbol Recognition : An Overview , 2005 .

[130]  Patrick Kelly,et al.  Automatic Script Identification From Document Images Using Cluster-Based Templates , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[131]  Dov Dori,et al.  Object-Process Based Segmentation and Recognition of ANSI and ISO Standard Dimensioning Texts , 1995, GREC.

[132]  Isao Yoshimura,et al.  An Application of the Sequential Dynamic Programming Matching Method to Off-Line Signature Verification , 1997, BSDIA.

[133]  Michihiko Minoh,et al.  A nonparametric density model for classification in a high dimensional space , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[134]  Ching Y. Suen,et al.  Application of majority voting to pattern recognition: an analysis of its behavior and performance , 1997, IEEE Trans. Syst. Man Cybern. Part A.

[135]  Réjean Plamondon,et al.  Extraction of items from checks , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[136]  Véronique Eglin,et al.  Logarithmic spiral grid and gaze control for the development of strategies of visual segmentation on a document , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[137]  Luigi P. Cordella,et al.  An Alternative Approach to the Performance Evaluation of Thinning Algorithms for Document Processing Applications , 1995, GREC.

[138]  Rodney M. Goodman,et al.  Keyword spotting for cursive document retrieval , 1997, Proceedings Workshop on Document Image Analysis (DIA'97).

[139]  Prasanna G. Mulgaonkar,et al.  Verification-Based Approach for Automated Text and Feature Extraction from Raster-Scanned Maps , 1995, GREC.

[140]  Kazuhide Sugawara Weighted Hough transform on a gridded image plane , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[141]  K. S. Ng,et al.  Artificial neural network for discrete cosine transform and image compression , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[142]  H. Bunke,et al.  AUTOMATIC INTERPRETATION AND EXECUTION OF MANUAL CORRECTIONS ON TEXT DOCUMENTS , 1997 .

[143]  Satoshi Naoi,et al.  Fast title extraction method for business documents , 1997, Electronic Imaging.

[144]  A. Peter Johnson,et al.  A Fast Algorithm for Bottom-Up Document Layout Analysis , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[145]  David S. Doermann An Introduction to Vectorization and Segmentation , 1997, GREC.

[146]  A. Gross,et al.  Homeomorphic digitization, correction and compression of digital documents , 1997, Proceedings Workshop on Document Image Analysis (DIA'97).

[147]  Makoto Nagao,et al.  Construction of retrieval system for pictorial book of flora , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[148]  Ching Y. Suen,et al.  Differentiation between oriental and European scripts , 1997 .

[149]  Kunikatsu Kobayashi,et al.  A consideration for measure of information based on the dempster‐shafer theory , 1986 .

[150]  Noboru Babaguchi,et al.  Media information processing in documents-generation of manuals of mechanical parts assembling , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[151]  L. Spitz Duplicate document detection , 1997, Electronic Imaging.

[152]  Gladys Monagan,et al.  Adding Geometric Constraints to the Vectorization of Line Drawings , 1995, GREC.

[153]  Jonathan J. Hull Document matching on CCITT Group 4 compressed images , 1997, Electronic Imaging.

[154]  Daniel P. Lopresti,et al.  Using Consensus Sequence Voting to Correct OCR Errors , 1997, Comput. Vis. Image Underst..

[155]  Alain Bouju,et al.  Former books digital processing: image warping , 1997, Proceedings Workshop on Document Image Analysis (DIA'97).

[156]  Koichi Kise,et al.  Document image segmentation as selection of Voronoi edges , 1997, Proceedings Workshop on Document Image Analysis (DIA'97).

[157]  Osamu Hori,et al.  Quantitative Measurement of the Performance of Raster-to-Vector Conversion Algorithms , 1995, GREC.

[158]  Wolfgang Effelsberg,et al.  Abstracting Digital Movies Automatically , 1996, J. Vis. Commun. Image Represent..

[159]  Ronny Martens,et al.  Dynamic programming optimisation for on-line signature verification , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[160]  Edward A. Green,et al.  Model-based analysis of printed tables , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[161]  Mario Köppen,et al.  An Image Consulting Framework for document analysis of Internet graphics , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[162]  Jonathan J. Hull,et al.  Document image database retrieval and browsing using texture analysis , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[163]  Rainer Hoch,et al.  Evaluating OCR and non-OCR text representations for learning document classifiers , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[164]  Marc Pierrot Deseilligny,et al.  Automatic Interpretation of Scanned Maps: Reconstruction of Contour Lines , 1997, GREC.

[165]  Sung Yang Bang,et al.  A measure of recognition difficulty for a character image database , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[166]  Jean-Yves Ramel,et al.  A Coarse Vectorization as an Initial Representation for the Understanding of Line Drawing Images , 1997, GREC.

[167]  Rangachar Kasturi,et al.  Improved Directional Morphological Operations for Separation of Characters from Maps/Graphics , 1997, GREC.

[168]  Isabelle Guyon,et al.  DATA SETS FOR OCR AND DOCUMENT IMAGE UNDERSTANDING RESEARCH , 1997 .

[169]  B.J. Oommen,et al.  Pattern recognition of strings with substitutions, insertions, deletions and generalized transpositions , 1997, Pattern Recognit..

[170]  Ching Y. Suen,et al.  Chinese document layout analysis based on adaptive split-and-merge and qualitative spatial reasoning , 1997, Pattern Recognit..

[171]  S. Chaudhuri,et al.  Robust detection of skew in document images , 1997, IEEE Trans. Image Process..

[172]  Masashi Koga,et al.  A method for connecting disappeared junction patterns on frame lines in form documents , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[173]  Rolf-Dieter Bippus 1-dimensional and pseudo 2-dimensional HMMs for the recognition of German literal amounts , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[174]  Fu Chang,et al.  A document analysis and recognition system , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[175]  F. Prêteux,et al.  Off-line signature verification by local granulometric size distributions , 1997 .

[176]  Melanie Hilario,et al.  An architecture for musical score recognition using high-level domain knowledge , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[177]  Karl-Hans Bläsius,et al.  Knowledge-based document analysis , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[178]  Robert Sabourin,et al.  Shape matrices as a mixed shape factor for off-line signature verification , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[179]  Konstantin Zuyev Table image segmentation , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[180]  Henri Maître,et al.  Map Analysis for Guided Interpretation of Aerial Images , 1997, GREC.