论文信息 - Detection and recognition via adaptive binarization and fuzzy clustering

Detection and recognition via adaptive binarization and fuzzy clustering

Detection and identification of text in natural scene images pose major challenges: image quality varies as scenes are taken under different conditions (lighting, angle and resolution) and the contained text entities can be in any form (size, style and orientation). In this paper, a robust approach is proposed to localize, extract and recognize scene texts of different sizes, fonts and orientations from images of varying quality. The proposed method consists of the following steps: preprocessing and enhancement of input image using the National Television System Committee (NTSC) color mapping and the contrast enhancement via mean histogram stretching; candidate text regions detection using hybrid adaptive segmentation and fuzzy c-means clustering techniques; a two-stage text extraction from the candidate text regions to filter out false text regions include local character filtering according to a rule-based approach using shape and statistical features and text region filtering via stroke width transform (SWT); and finally, text recognition using Tesseract OCR engine. The proposed method was evaluated using two benchmark datasets: ICDAR2013 and KAIST image datasets. The proposed method effectively dealt with complex scene images containing texts of various font sizes, colors, and orientation; and outperformed state-of-the-art methods, achieving >80% in both precision and recall measures.

Siti Norul Huda Sheikh Abdullah | Fariza Fauzi | Saad M. Ismail | S. Abdullah | F. Fauzi

[1] Rajib Ghosh,et al. Devanagari text extraction from natural scene images , 2014, 2014 International Conference on Advances in Computing, Communications and Informatics (ICACCI).

[2] Dimosthenis Karatzas,et al. Multi-script Text Extraction from Natural Scenes , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[3] Juraj Horváth,et al. Image Segmentation Using Fuzzy C-Means , 2005 .

[4] Tony X. Han,et al. Scene text detection based on component-level fusion and region-level verification , 2015, 2015 IEEE International Conference on Image Processing (ICIP).

[5] Huizhong Chen,et al. Robust text detection in natural images with edge-enhanced Maximally Stable Extremal Regions , 2011, 2011 18th IEEE International Conference on Image Processing.

[6] Guan Gui,et al. Template Matching-Based Method for Intelligent Invoice Information Identification , 2019, IEEE Access.

[7] Palaiahnakote Shivakumara,et al. A robust arbitrary text detection system for natural scene images , 2014, Expert Syst. Appl..

[8] Manoj Kumar,et al. Text Detection using Multilayer Separation in Real Scene Images , 2010, 2010 10th IEEE International Conference on Computer and Information Technology.

[9] Kumary R Soumya,et al. TEXT EXTRACTION FROM IMAGES: A SURVEY , 2014 .

[10] Palaiahnakote Shivakumara,et al. Multi-Script-Oriented Text Detection and Recognition in Video/Scene/Born Digital Images , 2019, IEEE Transactions on Circuits and Systems for Video Technology.

[11] Humera Tariq,et al. K-Means Cluster Analysis for Image Segmentation , 2014 .

[12] Daijin Kim,et al. Scene text detection with robust character candidate extraction method , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).

[13] Shiliang Sun,et al. Text detection in nature scene images using two-stage nontext filtering , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).

[14] E.-M. Nosal,et al. Flood-fill algorithms used for passive acoustic detection and tracking , 2008, 2008 New Trends for Environmental Monitoring Using Passive Systems.

[15] Jiri Matas,et al. Real-Time Lexicon-Free Scene Text Localization and Recognition , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16] Kaizhu Huang,et al. Robust Text Detection in Natural Scene Images , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17] Saeed Mian Qaisar,et al. Scene to Text Conversion and Pronunciation for Visually Impaired People , 2019, 2019 Advances in Science and Engineering Technology International Conferences (ASET).

[18] Chunhua Shen,et al. Toward End-to-End Car License Plate Detection and Recognition With Deep Neural Networks , 2019, IEEE Transactions on Intelligent Transportation Systems.

[19] Manoj Kumar,et al. Automatic text location from complex natural scene images , 2010, 2010 The 2nd International Conference on Computer and Automation Engineering (ICCAE).

[20] Chunheng Wang,et al. Scene Text Recognition Using Part-Based Tree-Structured Character Detection , 2013, CVPR 2013.

[21] Yang Wu,et al. An efficient coarse-to-fine scheme for text detection in videos , 2011, The First Asian Conference on Pattern Recognition.

[22] Tatiana Novikova,et al. Large-Lexicon Attribute-Consistent Text Recognition in Natural Images , 2012, ECCV.

[23] T. Santhanam,et al. A SURVEY ON VARIOUS APPROACHES OF TEXT EXTRACTION IN IMAGES , 2012 .

[24] Natsuda Kaothanthong,et al. ACCURACY IMPROVEMENT OF A PROVINCE NAME RECOGNITION ON THAI LICENSE PLATE , 2018, 2018 International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP).

[25] Reza Safabakhsh,et al. Text Detection in Natural Scenes using Fully Convolutional DenseNets , 2018, 2018 4th Iranian Conference on Signal Processing and Intelligent Systems (ICSPIS).

[26] Hassan Aghaeinia,et al. Robust color image segmentation using fuzzy c-means with weighted hue and intensity , 2016, Digit. Signal Process..

[27] Fei Yin,et al. Scene Text Localization Using Gradient Local Correlation , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[28] Rahul Dagade,et al. Survey on Text Detection, Segmentation and Recognition from a Natural Scene Images , 2014 .

[29] Yi Zhang,et al. End-to-End Vessel Plate Number Detection and Recognition Using Deep Convolutional Neural Networks and LSTMs , 2018, 2018 11th International Symposium on Computational Intelligence and Design (ISCID).

[30] Khairuddin Omar,et al. Degraded Historical Document Binarization: A Review on Issues, Challenges, Techniques, and Future Directions , 2019, J. Imaging.

[31] Yi-Ping Yang,et al. Locating text based on connected component and SVM , 2007, 2007 International Conference on Wavelet Analysis and Pattern Recognition.

[32] Jun Guo,et al. Text extraction from natural scene image: A survey , 2013, Neurocomputing.

[33] Anil K. Jain,et al. Text information extraction in images and video: a survey , 2004, Pattern Recognit..

[34] Palaiahnakote Shivakumara,et al. A Laplacian Method for Video Text Detection , 2000, 2009 10th International Conference on Document Analysis and Recognition.

[35] Chew Lim Tan,et al. Ieee Transactions on Pattern Analysis and Machine Intelligence, Manuscript Id a Laplacian Approach to Multi-oriented Text Detection in Video , 2022 .