Interval type-2 fuzzy set and human vision based multi-scale geometric analysis for text-graphics segmentation

This paper presents a novel method for texture-based text-graphic segmentation in a text embedded image. In the method, features are computed applying Multi-scale Geometric Analysis(MGA). The MGA of the image is done by Nonsubsampled contourlet transform(NSCT). The NSCT sub-bands help to generate the features which represent textures of the text portions and graphics portions of the image. In a segmentation process, the uncertainties arise mainly for two reasons: one is the ambiguity in gray level and other is the spatial ambiguity. Here the uncertainties are managed by interval type2 fuzzy set (IT2FS). The human vision model called human psychovisual phenomenon (HVS) is incorporated in the process for generating the interval type-2 fuzzy membership functions (IT2FMF). The efficiency of the proposed scheme is measured on the benchmark dataset. The robustness and performance bound of the proposed technique under noise corruption are measured statistically using modified Cramer-Rao bound. We found that effectiveness of the features by NSCT in combination with the IT2FS are quite promising in comparison to the state-of-the-arts methods.

[1]  Ziyang Li,et al.  A segmentation algorithm based on image projection for complex text layout , 2017 .

[2]  Sankar K. Pal,et al.  Thresholding for edge detection using human psychovisual phenomena , 1986, Pattern Recognit. Lett..

[3]  Stéphane Mallat,et al.  Image compression with geometrical wavelets , 2000, Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101).

[4]  N. N. Karnik,et al.  Introduction to type-2 fuzzy logic systems , 1998, 1998 IEEE International Conference on Fuzzy Systems Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98CH36228).

[5]  Malay Kumar Kundu,et al.  Uncertainty Relations and Time-Frequency Distributions for Unsharp Observables , 1996, Inf. Sci..

[6]  Hojin Cho,et al.  Canny Text Detector: Fast and Robust Scene Text Localization Algorithm , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Azriel Rosenfeld,et al.  Fuzzy Geometry: An updated Overview , 1998, Inf. Sci..

[8]  Po-Yueh Chen,et al.  DWT Based Text Localization , 2004 .

[9]  Chucai Yi,et al.  Text String Detection From Natural Scenes by Structure-Based Partition and Grouping , 2011, IEEE Transactions on Image Processing.

[10]  Harald Sack,et al.  A framework for improved video text detection and recognition , 2014, Multimedia Tools and Applications.

[11]  Wei Shen,et al.  Text detection in scene images based on exhaustive segmentation , 2017, Signal Process. Image Commun..

[12]  Weilin Huang,et al.  Text-Attentional Convolutional Neural Network for Scene Text Detection , 2015, IEEE Transactions on Image Processing.

[13]  Mausumi Acharyya,et al.  M-Band Wavelets: Application to Texture Segmentation for Real Life Image Analysis , 2003, Int. J. Wavelets Multiresolution Inf. Process..

[14]  Huchuan Lu,et al.  Scene text detection via stroke width , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[15]  Nam Ik Cho,et al.  Feeding Hand-Crafted Features for Enhancing the Performance of Convolutional Neural Networks , 2018, ArXiv.

[16]  Yong Yang,et al.  Method for unconstrained text detection in natural scene image , 2017, IET Comput. Vis..

[17]  Chunheng Wang,et al.  Scene text detection using graph model built upon maximally stable extremal regions , 2013, Pattern Recognit. Lett..

[18]  C. A. Murthy,et al.  Unsupervised Feature Selection Using Feature Similarity , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[19]  Jean-Marc Odobez,et al.  Text detection, recognition in images and video frames , 2004, Pattern Recognit..

[20]  Janusz Kacprzyk,et al.  Entropy for intuitionistic fuzzy sets , 2001, Fuzzy Sets Syst..

[21]  S.M. Lucas,et al.  ICDAR 2005 text locating competition results , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[22]  Guoyou Wang,et al.  Detecting natural scenes text via auto image partition, two-stage grouping and two-layer classification , 2015, Pattern Recognit. Lett..

[23]  Luc Van Gool,et al.  The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.

[24]  Xiaolin Li,et al.  Single Shot Text Detector with Regional Attention , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[25]  Weilin Huang,et al.  Robust Scene Text Detection with Convolution Neural Network Induced MSER Trees , 2014, ECCV.

[26]  George G. Coghill,et al.  Text analysis using local energy , 2001, Pattern Recognit..

[27]  Minh N. Do,et al.  Ieee Transactions on Image Processing the Contourlet Transform: an Efficient Directional Multiresolution Image Representation , 2022 .

[28]  Mahesh Viswanathan,et al.  A prototype document image analysis system for technical journals , 1992, Computer.

[29]  Pramod K. Varshney,et al.  Performance Limit of Image Segmentation Algorithms , 2011 .

[30]  Pan He,et al.  Detecting Text in Natural Image with Connectionist Text Proposal Network , 2016, ECCV.

[31]  Seok-Woo Jang,et al.  Texture feature-based text region segmentation in social multimedia data , 2015, Multimedia Tools and Applications.

[32]  Dimosthenis Karatzas,et al.  Multi-script Text Extraction from Natural Scenes , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[33]  Minh N. Do,et al.  The Nonsubsampled Contourlet Transform: Theory, Design, and Applications , 2006, IEEE Transactions on Image Processing.

[34]  E. Candès,et al.  New tight frames of curvelets and optimal representations of objects with piecewise C2 singularities , 2004 .

[35]  Xiang Bai,et al.  Scene text detection and recognition: recent advances and future trends , 2015, Frontiers of Computer Science.

[36]  Zhuowen Tu,et al.  Detecting Texts of Arbitrary Orientations in 1 Natural Images , 2012 .

[37]  Pramod K. Varshney,et al.  On performance limits of image segmentation algorithms , 2015, Comput. Vis. Image Underst..

[38]  Kaizhu Huang,et al.  Robust Text Detection in Natural Scene Images , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[39]  C. A. Murthy,et al.  Histogram thresholding by minimizing graylevel fuzziness , 1992, Inf. Sci..

[40]  Fei Yin,et al.  A Seed-Based Segmentation Method for Scene Text Extraction , 2014, 2014 11th IAPR International Workshop on Document Analysis Systems.

[41]  Malay Kumar Kundu,et al.  Accurate segmentation of complex document image using digital shearlet transform with neutrosophic set as uncertainty handling tool , 2017, Appl. Soft Comput..

[42]  Humberto Bustince,et al.  Restricted equivalence functions , 2006, Fuzzy Sets Syst..

[43]  Shaswati Roy,et al.  Rough-fuzzy clustering and multiresolution image analysis for text-graphics segmentation , 2015, Appl. Soft Comput..

[44]  E. Candès New tight frames of curvelets and optimal representations of objects with C² singularities , 2002 .

[45]  Hyung Jeong Yang,et al.  Automatic detection and recognition of Korean text in outdoor signboard images , 2010, Pattern Recognit. Lett..

[46]  Fernando López-García,et al.  Fast Surface Grading Using Color Statistics in the CIE Lab Space , 2005, IbPRIA.

[47]  Mausumi Acharyya,et al.  Document image segmentation using wavelet scale-space features , 2002, IEEE Trans. Circuits Syst. Video Technol..

[48]  Xin He,et al.  Scene Text Detection and Recognition: The Deep Learning Era , 2018, International Journal of Computer Vision.

[49]  Thanarat H. Chalidabhongse,et al.  Thai text localization in natural scene images using Convolutional Neural Network , 2014, Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific.

[50]  Rita Cucchiara,et al.  Layout analysis and content enrichment of digitized books , 2014, Multimedia Tools and Applications.

[51]  Weilin Huang,et al.  Text Localization in Natural Images Using Stroke Feature Transform and Text Covariance Descriptors , 2013, 2013 IEEE International Conference on Computer Vision.

[52]  Mausumi Acharyya,et al.  Adaptive basis selection for multi texture segmentation by M-band wavelet packet frames , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[53]  Tien Dat Nguyen,et al.  Combining Deep and Handcrafted Image Features for Presentation Attack Detection in Face Recognition Systems Using Visible-Light Camera Sensors , 2018, Sensors.

[54]  Alexander Hauptmann,et al.  The Best of BothWorlds: Combining Data-Independent and Data-Driven Approaches for Action Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).