A methodology for understanding handwritten addresses is proposed. Address understanding is the process of using multiple information sources to assign a fiveor nine-digit ZIP Code to an address block consisting of several text lines. This method uses many diverse pattern recognition and image processing algorithms. To fully process the address image, we must perform thresholding, remove underlining, separate lines of text, segment text lines into words, determine the syntax of the image, and recognize digits, characters, and words. Our approach emphasizes using contextual information that is available in the address to determine a ZIP Code for a mail-piece. While other research efforts concentrate on simply locating arid recognizing the ZIP Code, our system allows the use of recognition information from the city name and state name to develop a better understanding of the address. This recognition information combined with USPS directory information assists in determining the ZIP Code. Tests of the present system on 508 address images result in a 75.2% accept rate with a 1.6% error rate. This paper describes the algorithms used and suggests improvements for future systems.
[1]
Schurmann.
A Multifont Word Recognition System for Postal Address Reading
,
1978,
IEEE Transactions on Computers.
[2]
G. Winkler,et al.
A combination of statistical and syntactical pattern recognition applied to classification of unconstrained handwritten numerals
,
1980,
Pattern Recognit..
[3]
Ching Y. Suen,et al.
Computer Recognition of Totally unconstrained Handwritten ZIP Codes
,
1987,
Int. J. Pattern Recognit. Artif. Intell..
[4]
Sargur N. Srihari,et al.
A blackboard-based approach to handwritten ZIP code recognition
,
1988,
[1988 Proceedings] 9th International Conference on Pattern Recognition.
[5]
Sargur N. Srihari,et al.
A System to Locate and Recognize ZIP Codes in Handwritten Addresses
,
1989
.
[6]
Jonathan J. Hull,et al.
Multiple Algorithms for Handwritten Character Recognition
,
2000
.