A goal-oriented verification-based approach for target text line extraction from a document image captured by a pen scanner

We present a goal-oriented verification-based approach for target text line extraction from a document image captured by a pen scanner. Given a binary image, a series of processing steps are invoked adaptively, guided by the text line verification result in the preceding step. Each step adopts a strategy that is most effective for dealing with the problem concerned. Consequently, the target text line can be extracted in a more efficient and reliable way depending on the nature of the captured image. The effectiveness of the above approach is confirmed by a benchmark test.

[1]  Zhen-Long Bai,et al.  Underline detection and removal in a document image using multiple strategies , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[2]  Qiang Huo,et al.  Confidence guided progressive search and fast match techniques for high performance Chinese/English OCR , 2002, Object recognition supported by user interaction for service robots.

[3]  George Nagy,et al.  Twenty Years of Document Image Analysis in PAMI , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Zhen-Long Bai,et al.  An approach to extracting the target text line from a document image captured by a pen scanner , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[5]  George Nagy,et al.  HIERARCHICAL REPRESENTATION OF OPTICALLY SCANNED DOCUMENTS , 1984 .