Fuzzy C-Mean: A Statistical Feature Classification of Text and Image Segmentation Method

Classification of text and image using statistical features (mean and standard deviation of pixel color values) is found to be a simple yet powerful method for text and image segmentation. The features constitute a systematic structure that segregates one from another. We identified this segregation in the form of class clustering by means of Fuzzy C-Mean method, which determined each cluster location using maximum membership defuzzification and neighborhood smoothing techniques. The method can then be applied to classify text, image, and background areas in optical character recognition (OCR) application for elaborated open document systems.

[1]  Tzung-Pei Hong,et al.  Induction of fuzzy rules and membership functions from training examples , 1996, Fuzzy Sets Syst..

[2]  Thomas A. Runkler,et al.  Image segmentation using fuzzy clustering with fractal features , 1997, Proceedings of 6th International Fuzzy Systems Conference.

[3]  George J. Klir,et al.  Fuzzy sets and fuzzy logic - theory and applications , 1995 .

[4]  Medina Ruben,et al.  Segmentation of ventricular angiographic images using fuzzy clustering , 1995, Proceedings of 17th International Conference of the Engineering in Medicine and Biology Society.

[5]  Robert M. Gray,et al.  Text and picture segmentation by the distribution analysis of wavelet coefficients , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).