A fuzzy Approach to Text Segmentation in Web Images based on Human Colour perception

This chapter describes a new approach for the segmentation of text in images on Web pages. In the same spirit as the authors’ previous work on this subject, this approach attempts to model the ability of humans to differentiate between colours. In this case, pixels of similar colour are first grouped using a colour distance defined in a perceptually uniform colour space (as opposed to the commonly used RGB). The resulting colour connected components are then grouped to form larger (character-like) regions with the aid of a propinquity measure, which is the output of a fuzzy inference system. This measure expresses the likelihood for merging two components based on two features. The first feature is the colour distance between the components, in the L*a*b* colour space. The second feature expresses the topological relationship of two components. The results of the method indicate a better performance than previous methods devised by the authors and possibly better (a direct comparison is not really possible due to the differences in application domain characteristics between this and previous methods) performance to other existing methods.

[1]  R. Carter,et al.  CIE L*u*v* Color‐Difference Equations for Self‐Luminous Displays , 1983 .

[2]  Apostolos Antonacopoulos,et al.  An Anthropocentric Approach to Text Extraction from WWW Images , 2000 .

[3]  Apostolos Antonacopoulos,et al.  Page Segmentation Using the Description of the Background , 1998, Comput. Vis. Image Underst..

[4]  Daniel P. Lopresti,et al.  Locating and Recognizing Text in WWW Images , 2000, Information Retrieval.

[5]  Apostolos Antonacopoulos,et al.  Accessing textual information embedded in Internet images , 2000, IS&T/SPIE Electronic Imaging.

[6]  Anil K. Jain,et al.  Automatic text location in images and video frames , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[7]  Daniel P. Lopresti,et al.  Extracting text from WWW images , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[8]  Daniel P. Lopresti,et al.  Document Analysis and the World Wide Web , 1996, DAS.

[9]  David S. Doermann,et al.  Automatic text detection and tracking in digital video , 2000, IEEE Trans. Image Process..

[10]  G. Wyszecki,et al.  Color Science Concepts and Methods , 1982 .

[11]  Apostolos Antonacopoulos,et al.  Automated Interpretation of Visual Representations: Extracting Textual Information from WWW Images , 1999, Visual Representations and Interpretations.

[12]  Michael K. Brown,et al.  Web Page Analysis for Voice Browsing , 2001 .

[13]  Jianying Hu,et al.  Flexible Web document analysis for delivery to narrow-bandwidth devices , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.