On virtual partitioning of large dictionaries for contextual post-processing to improve character recognition

A new approach to the partitioning of large dictionaries by virtual views is presented. The basic idea is that additional knowledge sources of text recognition and text analysis are employed for fast dictionary look-up in order to prune the search space through static or dynamic views. The heart of the system is a redundant hashing technique which involves a set of hash functions dealing with noisy input efficiently. Currently, the system is composed of two main system components: the dictionary generator and the dictionary controller. While the dictionary generator initially builds the system by using profiles and source dictionaries, the controller allows the flexible integration of different search heuristics. Results prove that the system achieves a respectable speed-up of dictionary access time. >

[1]  Rainer Hoch Hybrid Structured Dictionary for Improving Text Recognition , 1992, MVA.

[2]  Lindsay J. Evett,et al.  Fast dictionary look-up for contextual word recognition , 1990, Pattern Recognit..

[3]  Schurmann A Multifont Word Recognition System for Postal Address Reading , 1978, IEEE Transactions on Computers.

[4]  Nancy Ide,et al.  Outline of a database model for electronic dictionaries , 1991, RIAO.

[5]  Rainer Hoch,et al.  From paper to office document standard representation , 1992, Computer.

[6]  Nobuyasu Itoh,et al.  A spelling correction method and its application to an OCR system , 1990, Pattern Recognit..

[7]  James L. Peterson,et al.  Computer programs for detecting and correcting spelling errors , 1980, CACM.

[8]  Udi Manber,et al.  Fast text searching: allowing errors , 1992, CACM.

[9]  Rainer Hoch,et al.  Fragmentary string matching by selective access to hybrid tries , 1992, Proceedings., 11th IAPR International Conference on Pattern Recognition. Vol.II. Conference B: Pattern Recognition Methodology and Systems.

[10]  Rainer Hoch,et al.  Designing a structured lexicon for document image analysis , 1992 .

[11]  Sargur N. Srihari,et al.  Integrating diverse knowledge sources in text recognition , 1982, TOIS.

[12]  R. Mahesh K. Sinha,et al.  On partitioning a dictionary for visual text recognition , 1990, Pattern Recognit..

[13]  Dave Elliman,et al.  A review of segmentation and contextual analysis techniques for text recognition , 1990, Pattern Recognit..