Language-Level Syntactic and Semantic Constraints Applied to Visual Word Recognition

Various aspects of using language-level syntactic and semantic constraints to improve the performance of word recognition algorithms are discussed. Following a brief presentation of a hypothesis generation model for handwritten word recognition, various types of language-level constraints are reviewed. Methods that exploit these characteristics are discussed including intra-document word correlation, common vocabularies, part-of-speech tag cooccurrence, structural parsing with a chart data structure, and semantic biasing with a thesaurus.

[1]  Sargur N. Srihari,et al.  An Integrated Algorithm for Text Recognition: Comparison with a Cascaded Algorithm , 1983, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Gerald Salton,et al.  Automatic text processing , 1988 .

[3]  Jonathan J. Hull,et al.  Semantic Information Extraction with a Thesaurus for Visual Word Recognition , 1993 .

[4]  Stephanie Seneff Probabilistic Parsing for Spoken Language Applications , 1989, IWPT.

[5]  George Nagy,et al.  Decoding Substitution Ciphers by Means of Word Matching with Application to OCR , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  M. Tomita,et al.  An efficient word lattice parsing algorithm for continuous speech recognition , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[7]  Jr. G. Forney,et al.  The viterbi algorithm , 1973 .

[8]  H. Kucera,et al.  Computational analysis of present-day American English , 1967 .

[9]  Lin-Shan Lee,et al.  An augmented chart data structure with efficient word lattice parsing scheme in speech recognition applications , 1990, COLING 1990.

[10]  George M. White,et al.  Natural language understanding and speech recognition , 1990, CACM.

[11]  Lalit R. Bahl,et al.  A Maximum Likelihood Approach to Continuous Speech Recognition , 1983, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  L. Rabiner,et al.  The acoustics, speech, and signal processing society - A historical perspective , 1984, IEEE ASSP Magazine.

[13]  G Salton,et al.  Developments in Automatic Text Retrieval , 1991, Science.

[14]  Ken Thompson,et al.  Reading Chess , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  Jonathan J. Hull,et al.  Keyword Location in Noisy Document Images , 1993 .

[16]  Roland Kuhn,et al.  Speech Recognition and the Frequency of Recently Used Words: A Modified Markov Model for Natural Language , 1988, COLING.

[17]  L. Rabiner,et al.  An introduction to hidden Markov models , 1986, IEEE ASSP Magazine.

[18]  W. Nelson Francis,et al.  FREQUENCY ANALYSIS OF ENGLISH USAGE: LEXICON AND GRAMMAR , 1983 .

[19]  Kenneth Ward Church,et al.  Parsing, Word Associations and Typical Predicate-Argument Relations , 1989, HLT.