Efficient text segmentation and adaptive color error diffusion for text enhancement

This paper proposes an adaptive error diffusion algorithm for text enhancement followed by an efficient text segmentation that uses the maximum gradient difference (MGD). The gradients are calculated along with scan lines, then the MGD values are filled within a local window to merge text segments. If the value is above a threshold, the pixel is considered as potential text. Isolated segments are then eliminated in a non-text region filtering process. After the text segmentation, a conventional error diffusion method is applied to the background, while edge enhancement error diffusion is used for the text. Since it is inevitable that visually objectionable artifacts are generated when using two different halftoning algorithms, gradual dilation is proposed to minimize the boundary artifacts in the segmented text blocks before halftoning. Sharpening based on the gradually dilated text region (GDTR) then prevents the printing of successive dots around the text region boundaries. The method is extended to halftone color images to sharpen the text regions. The proposed adaptive error diffusion algorithm involves color halftoning that controls the amount of edge enhancement using a general error filter. However, edge enhancement unfortunately produces color distortion, as edge enhancement and color difference are trade-offs. The multiplicative edge enhancement parameters are selected based on the amount of edge sharpening and color difference. Plus, an additional error factor is introduced to reduce the dot elimination artifact generated by the edge enhancement error diffusion. In experiments, the text of a scanned image was sharper when using the proposed algorithm than with conventional error diffusion without changing the background.

[1]  Kevin J. Parker,et al.  Digital halftoning technique using a blue-noise mask , 1992 .

[2]  Edward K. Wong,et al.  A new robust algorithm for video text extraction , 2003, Pattern Recognit..

[3]  Jan P. Allebach,et al.  A dual interpretation for direct binary search and its implications for tone reproduction and texture quality , 2000, IEEE Trans. Image Process..

[4]  Shigeru Akamatsu,et al.  Recognizing Characters in Scene Images , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Lina J. Karam,et al.  Morphological text extraction from images , 2000, IEEE Trans. Image Process..

[6]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[7]  Alan C. Bovik,et al.  Modeling and quality assessment of halftoning by error diffusion , 2000, IEEE Trans. Image Process..

[8]  Antonios Atsalakis,et al.  Gray-Level Reduction Using Local Spatial Features , 2000, Comput. Vis. Image Underst..

[9]  Keith T. Knox,et al.  Evolution of error diffusion , 1998, Electronic Imaging.

[10]  Charalambos Strouthopoulos,et al.  Text extraction in complex color documents , 2002, Pattern Recognit..

[11]  Rae-Hong Park,et al.  Document image binarization based on topographic analysis using a water flow model , 2002, Pattern Recognit..

[12]  Reiner Eschbach,et al.  Error-diffusion algorithm with edge enhancement , 1991 .

[13]  Anil K. Jain,et al.  Locating text in complex color images , 1995, Pattern Recognit..

[14]  Apostolos Antonacopoulos,et al.  Two approaches for text segmentation in Web images , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..