Document image retrieval with improvements in database quality

Modern technology has made it possible to produce, process, transmit and store digital images efficiently. Consequently, the amount of visual information is increasing at an accelerating rate in many diverse application areas. To fully exploit this new content-based image retrieval techniques are required. Document image retrieval systems can be utilized in many organizations which are using document image databases extensively. This thesis presents document image retrieval techniques and new approaches to improve database content. The goal of the thesis is to develop a functional retrieval system and to demonstrate that better retrieval results can be achieved with the proposed database generation methods. Retrieval system architecture, a document data model, and tools for querying document image databases are introduced. The retrieval framework presented allows users to interactively define, construct and combine queries using document or image properties: physical (structural), semantic, textual and visual image content. A technique for combining primitive features like color, shape and texture into composite features is presented. A novel search base reduction technique which uses structural and content properties of documents is proposed for speeding up the query process. A new model for database generation within the image retrieval system is presented. An approach for automated document image defect detection and management is presented to build high quality and retrievable database objects. In image database population, image feature profiles and their attributes are manipulated automatically to better match with query requirements determined by the available query methods, the application environment and the user. Experiments were performed with multiple image databases containing over one thousand images. They comprised a range of document and scene images from different categories, properties and condition. The results show that better recall and accuracy for retrieval is achieved with the proposed optimization techniques. The search base reduction technique results in a considerable speed-up in overall query processing. The constructed document image retrieval system performs well in different retrieval scenarios and provides a consistent basis for algorithm development. The proposed modular system structure and interfaces facilitate its usage in a wide variety of document image retrieval applications.

[1]  Yuan Yan Tang,et al.  Automatic document processing: A survey , 1996, Pattern Recognit..

[2]  K. Scarbrough,et al.  of Electrical Engineering , 1982 .

[3]  Shih-Fu Chang,et al.  SaFe: a general framework for integrated spatial and feature image search , 1997, Proceedings of First Signal Processing Society Workshop on Multimedia Signal Processing.

[4]  Gerd Maderlechner,et al.  Classification of documents by form and content , 1997, Pattern Recognit. Lett..

[5]  Robert M. Haralick,et al.  Power functions and their use in selecting distance functions for document degradation model validation , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[6]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[7]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[8]  Debra T. Burhans,et al.  Visual Semantics: Extracting Visual information from Text Accompanying Pictures , 1994, AAAI.

[9]  Kazem Taghva,et al.  MANICURE document processing system , 1998, Electronic Imaging.

[10]  Andreas Siebert,et al.  Segmentation-based image retrieval , 1997, Electronic Imaging.

[11]  Tom Minka,et al.  Interactive learning with a "society of models" , 1997, Pattern Recognit..

[12]  Dragutin Petkovic,et al.  Content-Based Representation and Retrieval of Visual Media: A State-of-the-Art Review , 1996 .

[13]  Alex Pentland,et al.  Photobook: tools for content-based manipulation of image databases , 1994, Electronic Imaging.

[14]  B. S. Manjunath,et al.  Adaptive filtering and indexing for image databases , 1995, Electronic Imaging.

[15]  Neill W. Campbell,et al.  Interpreting image databases by region classification , 1997, Pattern Recognit..

[16]  Brian Scassellati,et al.  Retrieving images by 2D shape: a comparison of computation methods with human perceptual judgments , 1994, Electronic Imaging.

[17]  David B. H. Tay,et al.  On the multiresolution enhancement of document images using fuzzy logic approach , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[18]  Jonathan J. Hull,et al.  Document image database retrieval and browsing using texture analysis , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[19]  Wen-Hsiang Tsai,et al.  Moment-preserving thresolding: A new approach , 1985, Comput. Vis. Graph. Image Process..

[20]  S. Sitharama Iyengar,et al.  Automated system for numerically rating document image quality , 1997, Electronic Imaging.

[21]  Svetha Venkatesh,et al.  Media-independent knowledge representation via UMART: unified mental annotation and retrieval tool , 1996, Electronic Imaging.

[22]  Yuan Yan Tang,et al.  Document structures: A survey , 1993, ICDAR.

[23]  Amarnath Gupta,et al.  Virage image search engine: an open framework for image management , 1996, Electronic Imaging.

[24]  Matti Pietikäinen,et al.  Unsupervised texture segmentation using feature distributions , 1997, Pattern Recognit..

[25]  J. Ashley,et al.  Automatic and Semi-Automatic Methods for Image Annotation and Retrieval in QBIC , 1995 .

[26]  Brian V. Funt,et al.  Color Constant Color Indexing , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[27]  Theodosios Pavlidis,et al.  Document de-Blurring using Maximum likelihood Methods , 1996, International Workshop on Document Analysis Systems.

[28]  Bin Yu,et al.  Page segmentation using document model , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[29]  Robert M. Haralick,et al.  Zone classification using texture features , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[30]  Thomas S. Huang,et al.  Relevance feedback: a power tool for interactive content-based image retrieval , 1998, IEEE Trans. Circuits Syst. Video Technol..

[31]  Shih-Fu Chang,et al.  Transform features for texture classification and discrimination in large image databases , 1994, Proceedings of 1st International Conference on Image Processing.

[32]  David A. Forsyth,et al.  Finding Naked People , 1996, ECCV.

[33]  Kannan,et al.  ON IMAGE SEGMENTATION TECHNIQUES , 2022 .

[34]  Michael J. Taylor,et al.  Enhancement of document images from cameras , 1998, Electronic Imaging.

[35]  Wen-Hsiang Tsai,et al.  Moment-preserving thresholding: a new approach , 1995 .

[36]  Majdi Ben Hadj Ali Background noise detection and cleaning in document images , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[37]  Hubert Emptoz,et al.  Analysis and conversion of documents , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[38]  Luigi Cinque,et al.  Indexing pictorial documents by their content: a survey of current techniques , 1997, Image Vis. Comput..

[39]  Teuvo Kohonen,et al.  Exploration of very large databases by self-organizing maps , 1997, Proceedings of International Conference on Neural Networks (ICNN'97).

[40]  Bidyut Baran Chaudhuri,et al.  Automatic detection of italic, bold and all-capital words in document images , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[41]  John R. Smith,et al.  Intelligent multimedia information retrieval , 1997 .

[42]  Robert M. Haralick,et al.  Global and local document degradation models , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[43]  Simone Santini,et al.  In search of information in visual media , 1997, CACM.

[44]  Victor Wu Document Image Clean-up and Binarization , 1998 .

[45]  P. Herrmann,et al.  Retrieval of document images using layout knowledge , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[46]  Rohini K. Srihari,et al.  Automatic Indexing and Content-Based Retrieval of Captioned Images , 1995, Computer.

[47]  Maylor K. H. Leung,et al.  Linear layout processing , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[48]  B. S. Manjunath,et al.  Texture Features for Browsing and Retrieval of Image Data , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[49]  Anil K. Jain,et al.  Texture classification and segmentation using multiresolution simultaneous autoregressive models , 1992, Pattern Recognit..

[50]  Simone Santini,et al.  Image Databases Are Not Databases with Images , 1997, ICIAP.

[51]  Jonathan J. Hull,et al.  Proper noun detection in document images , 1994, Pattern Recognit..

[52]  Dan S. Bloomberg,et al.  Detecting and locating partially specified keywords in scanned images using hidden Markov models , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[53]  Y. Vardi,et al.  From image deblurring to optimal investments : maximum likelihood solutions for positive linear inverse problems , 1993 .

[54]  Thien Huu Nguyen,et al.  Docbrowse: a System for Textual and Graphical Querying on Degraded Document Image Data , 1996, International Workshop on Document Analysis Systems.

[55]  Edward R. Dougherty,et al.  Facilitation of optimal binary morphological filter design via structuring element libraries and design constraints , 1992 .

[56]  Timo Honkela,et al.  Self-Organizing Maps In Natural Language Processing , 1997 .

[57]  Dragutin Petkovic,et al.  Query by Image and Video Content: The QBIC System , 1995, Computer.

[58]  Francine Chen,et al.  Summarization of Imaged Documents without OCR , 1998, Comput. Vis. Image Underst..

[59]  Carole A. Goble,et al.  Describing and classifying multimedia using the description logic GRAIL , 1996, Electronic Imaging.

[60]  David S. Doermann,et al.  The Indexing and Retrieval of Document Images: A Survey , 1998, Comput. Vis. Image Underst..

[61]  Pinar Duygulu Sahin,et al.  A heuristic algorithm for hierarchical representation of form documents , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[62]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[63]  Linda G. Shapiro,et al.  A flexible image database system for content-based retrieval , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[64]  Seinosuke Narita,et al.  Logical structure analysis of book document images using contents information , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[65]  HongJiang Zhang,et al.  Scheme for visual feature-based image indexing , 1995, Electronic Imaging.

[66]  Jianchang Mao,et al.  A model-based form processing sub-system , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[67]  Atsuhiro Takasu,et al.  A document understanding method for database construction of an electronic library , 1994, Proceedings of the 12th IAPR International Conference on Pattern Recognition, Vol. 3 - Conference C: Signal Processing (Cat. No.94CH3440-5).

[68]  Mysore Y. Jaisimha,et al.  DocBrowse: a system for information retrieval from document image data , 1996, Electronic Imaging.

[69]  John P. Oakley,et al.  Storage and Retrieval for Image and Video Databases , 1993 .

[70]  Mohamed S. Kamel,et al.  Extraction of Binary Character/Graphics Images from Grayscale Document Images , 1993, CVGIP Graph. Model. Image Process..

[71]  Juyang Weng,et al.  Efficient content-based image retrieval using automatic feature selection , 1995, Proceedings of International Symposium on Computer Vision - ISCV.

[72]  Christos Faloutsos,et al.  QBIC project: querying images by content, using color, texture, and shape , 1993, Electronic Imaging.

[73]  SaltonGerard,et al.  Term-weighting approaches in automatic text retrieval , 1988 .

[74]  Azriel Rosenfeld,et al.  The function of documents , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[75]  Thomas S. Huang,et al.  Relevance feedback techniques in interactive content-based image retrieval , 1997, Electronic Imaging.

[76]  Ramesh C. Jain Visual information management , 1997, CACM.

[77]  Matti Pietikäinen,et al.  Accurate color discrimination with classification based on feature distributions , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[78]  Aya Soffer Image categorization using texture features , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[79]  Markus A. Stricker,et al.  Spectral covariance and fuzzy regions for image indexing , 1997, Machine Vision and Applications.

[80]  Matti Pietikäinen,et al.  APPROACHES TO TEXTURE-BASED CLASSIFICATION, SEGMENTATION AND SURFACE INSPECTION , 1999 .

[81]  李幼升,et al.  Ph , 1989 .

[82]  Shih-Fu Chang,et al.  Tools and techniques for color image retrieval , 1996, Electronic Imaging.

[83]  Henry S. Baird,et al.  DATA STRUCTURES FOR PAGE READERS , 1995 .

[84]  Harry Wechsler,et al.  Face recognition using hybrid classifiers , 1997, Pattern Recognit..

[85]  Rosalind W. Picard,et al.  Finding similar patterns in large image databases , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[86]  Thomas P. Minka,et al.  An image database browser that learns from user interaction , 1996 .

[87]  Peter Stanchev,et al.  Content-Based Image Retrieval Systems , 2001 .

[88]  Bindu Rama Rao Object-oriented databases - technology, applications, and products , 1994 .

[89]  Giovanni Ramponi,et al.  Enhancing document images with a quadratic filter , 1993, Signal Process..

[90]  Henry S. Baird,et al.  Document image defect models , 1995 .

[91]  Shih-Fu Chang,et al.  Automated binary texture feature sets for image retrieval , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[92]  Markus A. Stricker,et al.  Similarity of color images , 1995, Electronic Imaging.

[93]  Carlo Meghini Toward a logical reconstruction of image retrieval , 1996, Electronic Imaging.

[94]  Anil K. Jain,et al.  Image-based form document retrieval , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[95]  Ramesh C. Jain,et al.  Content-Centric Computing in Visual Systems , 1997, ICIAP.

[96]  R. Manmatha,et al.  Document image cleanup and binarization , 1998, Electronic Imaging.

[97]  Shih-Fu Chang,et al.  Visually Searching the Web for Content , 1997, IEEE Multim..

[98]  Volker Märgner,et al.  Data structures and tools for document database generation: an experimental system , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[99]  Chuan-Heng Ang,et al.  Retrieving similar pictures from a pictorial database by an improved hashing table , 1997, Pattern Recognit. Lett..

[100]  Fang Liu,et al.  Periodicity, Directionality, and Randomness: Wold Features for Image Modeling and Retrieval , 1996, IEEE Trans. Pattern Anal. Mach. Intell..