Logos extraction on picture documents using shape and color density

Logos detection on textual images is a decisive stage in documents recognition and classification system. The over or the sub-detection of logos strongly penalizes the system capacities and corrupts the subsequent stages result. We developed here an effective and robust logo extraction algorithm while considering the two principals proprieties of logos: spatial compactness and colorimetric uniformity. First, the image content is reduced and transformed using mathematical morphology operators to decrease the distance between the identical logo parts. Afterwards the logo regions of height spatial and chromatic densities are detected. The results demonstrate the robustness of the proposed method over a range of representative text images.

[1]  Matti Pietikäinen,et al.  Page segmentation and classification using fast feature extraction and connectivity analysis , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[2]  Tuan D. Pham Unconstrained logo detection in document images , 2003, Pattern Recognit..

[3]  R. Manmatha,et al.  Document image cleanup and binarization , 1998, Electronic Imaging.

[4]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[5]  David S. Doermann,et al.  The Indexing and Retrieval of Document Images: A Survey , 1998, Comput. Vis. Image Underst..

[6]  Ehud Rivlin,et al.  Logo recognition using geometric invariants , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[7]  Jean Serra,et al.  Image Analysis and Mathematical Morphology , 1983 .

[8]  S. Bergler,et al.  Skew detection, page segmentation, and script classification of printed document images , 1998, SMC'98 Conference Proceedings. 1998 IEEE International Conference on Systems, Man, and Cybernetics (Cat. No.98CH36218).

[9]  Victor Wu Document Image Clean-up and Binarization , 1998 .

[10]  R. Yager,et al.  Approximate Clustering Via the Mountain Method , 1994, IEEE Trans. Syst. Man Cybern. Syst..

[11]  Whoi-Yul Kim,et al.  Content-based trademark retrieval system using a visually salient feature , 1998, Image Vis. Comput..

[12]  Sandy Irani,et al.  LOGO DETECTION IN DOCUMENT IMAGES , 1997 .