Text Spotting for Curved Metal Surface: Clustering, Fitting, and Rectifying

In the machining system, how to recognize text on the curved metal surface of parts is still a challenging question. In this article, a text spotting framework for curved metal surface based on image rectification and CNN is proposed. First, a curved text rectification method through CNN-based detection and text curve fitting is investigated. The CNN model for text detection is trained on the original images, which is used to obtain the location of characters of the text. The characters are clustered based on location and size, and the text line is fitted based on the centers of characters that belong to the same cluster. The text image is expanded according to the fitting text line, which finishes the rectification of the curve text image. Second, plenty of original text images are rectified by the proposed rectification method to construct the training data set. Third, in the recognition stage, an original text image is rectified and used as the input of the CNN recognition model. In the experiments, after comparing the detection performance between YOLOv3, Faster R-CNN, SSD-ResNet101, and RetinaNet101, RetinaNet101 was chosen as the text detector. Also, the recognition results on the rectification training set and test set generated by the proposed framework are better than those on the original image data sets. The experimental results show that the proposed method can achieve competitive results text spotting for curved metal surface.

[1]  Xiang Bai,et al.  TextBoxes++: A Single-Shot Oriented Scene Text Detector , 2018, IEEE Transactions on Image Processing.

[2]  Alessandro Ferrero,et al.  Camera as the instrument: the rising trend of vision based measurement , 2014, IEEE Instrumentation & Measurement Magazine.

[3]  Bijan Shirinzadeh,et al.  A vision-based measurement algorithm for micro/nano manipulation , 2013, 2013 IEEE/ASME International Conference on Advanced Intelligent Mechatronics.

[4]  Shuchang Zhou,et al.  EAST: An Efficient and Accurate Scene Text Detector , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Shuigeng Zhou,et al.  AON: Towards Arbitrarily-Oriented Text Recognition , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[6]  Lei Sun,et al.  Mask R-CNN With Pyramid Attention Network for Scene Text Detection , 2018, 2019 IEEE Winter Conference on Applications of Computer Vision (WACV).

[7]  Jiri Matas,et al.  Scene Text Localization and Recognition with Oriented Stroke Detection , 2013, 2013 IEEE International Conference on Computer Vision.

[8]  Yuwaldi Away,et al.  Visual inventory inspection using optical character recognition , 2004, Proceedings. International Conference on Computer Graphics, Imaging and Visualization, 2004. CGIV 2004..

[9]  Kai Wang,et al.  End-to-end scene text recognition , 2011, 2011 International Conference on Computer Vision.

[10]  Wei Liu,et al.  SSD: Single Shot MultiBox Detector , 2015, ECCV.

[11]  Xiang Bai,et al.  Detecting Oriented Text in Natural Images by Linking Segments , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[12]  Ross B. Girshick,et al.  Focal Loss for Dense Object Detection , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Xiang Bai,et al.  Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Ankush Gupta,et al.  Synthetic Data for Text Localisation in Natural Images , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Andrew Zisserman,et al.  Deep Features for Text Spotting , 2014, ECCV.

[16]  Raquel Frizera Vassallo,et al.  Character Recognition for Automatic Draught Survey , 2018, 2018 13th IEEE International Conference on Industry Applications (INDUSCON).

[17]  Pan He,et al.  Detecting Text in Natural Image with Connectionist Text Proposal Network , 2016, ECCV.

[18]  Kai Chen,et al.  A Convolutional Neural Network-Based Chinese Text Detection Algorithm via Text Structure Modeling , 2017, IEEE Transactions on Multimedia.

[19]  Wei Feng,et al.  TextDragon: An End-to-End Framework for Arbitrary Shaped Text Spotting , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[20]  Junjie Yan,et al.  FOTS: Fast Oriented Text Spotting with a Unified Network , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[21]  Dawei Li,et al.  Automatic Detection and Classification of Sewer Defects via Hierarchical Deep Learning , 2019, IEEE Transactions on Automation Science and Engineering.

[22]  R. Joe Stanley,et al.  Adaptable Ring for Vision-Based Measurements and Shape Analysis , 2017, IEEE Transactions on Instrumentation and Measurement.

[23]  Sang Woo Kim,et al.  Recognition of Slab Identification Numbers Using a Deep Convolutional Neural Network , 2016, 2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA).

[24]  Usman Tariq,et al.  Application of Machine Learning to Evaluate Insulator Surface Erosion , 2020, IEEE Transactions on Instrumentation and Measurement.

[25]  Umapada Pal,et al.  Multi-oriented Text Recognition in Graphical Documents Using HMM , 2014, 2014 11th IAPR International Workshop on Document Analysis Systems.

[26]  Alessandro Bissacco,et al.  Towards Unconstrained End-to-End Text Spotting , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[27]  Xiang Bai,et al.  Robust Scene Text Recognition with Automatic Rectification , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[28]  Fei Gao,et al.  A Novel VBM Framework of Fiber Recognition Based on Image Segmentation and DCNN , 2020, IEEE Transactions on Instrumentation and Measurement.

[29]  Zong-Yi Wang,et al.  A vision-based system for post-welding quality measurement and defect detection , 2016 .

[30]  Youbao Tang,et al.  Scene Text Detection Using Superpixel-Based Stroke Feature Transform and Deep Learning Based Region Classification , 2018, IEEE Transactions on Multimedia.

[31]  Shah Nawaz,et al.  Meter Digit Recognition Via Faster R-CNN , 2019, 2019 International Conference on Robotics and Automation in Industry (ICRAI).

[32]  Changhyun Park,et al.  Development of recognition system for billet identification , 2007, SICE Annual Conference 2007.

[33]  Andrew Zisserman,et al.  Spatial Transformer Networks , 2015, NIPS.

[34]  Sang Woo Kim,et al.  A text localization algorithm in images for automatic recognition of slab identification numbers , 2009, 2009 European Control Conference (ECC).

[35]  Ali Farhadi,et al.  YOLOv3: An Incremental Improvement , 2018, ArXiv.

[36]  Pan He,et al.  Reading Scene Text in Deep Convolutional Sequences , 2015, AAAI.

[37]  Xiang Bai,et al.  ASTER: An Attentional Scene Text Recognizer with Flexible Rectification , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[38]  Abdulmotaleb El Saddik,et al.  Modified Syntactic Method to Recognize Bengali Handwritten Characters , 2007 .

[39]  Wei Zhou,et al.  TextField: Learning a Deep Direction Field for Irregular Scene Text Detection , 2018, IEEE Transactions on Image Processing.

[40]  Shuigeng Zhou,et al.  Focusing Attention: Towards Accurate Text Recognition in Natural Images , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[41]  Xiang Bai,et al.  An End-to-End Trainable Neural Network for Image-Based Sequence Recognition and Its Application to Scene Text Recognition , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[42]  Xiangyang Xue,et al.  Arbitrary-Oriented Scene Text Detection via Rotation Proposals , 2017, IEEE Transactions on Multimedia.

[43]  Li-kun Lu,et al.  The application of computer vision algorithms in touching characters recognition of industry products , 2010, 2010 International Conference on Mechanic Automation and Control Engineering.

[44]  Yisu Ge,et al.  A Mechanism and Method of Leak Detection for Pressure Vessel: Whether, When, and How , 2020, IEEE Transactions on Instrumentation and Measurement.

[45]  Rassoul Amirfattahi,et al.  A new binarization method for high accuracy handwritten digit recognition of slabs in steel companies , 2017, 2017 10th Iranian Conference on Machine Vision and Image Processing (MVIP).

[46]  Wenyu Liu,et al.  TextBoxes: A Fast Text Detector with a Single Deep Neural Network , 2016, AAAI.

[47]  Han Hu,et al.  WordSup: Exploiting Word Annotations for Character Based Text Detection , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[48]  Xin He,et al.  TextSnake: A Flexible Representation for Detecting Text of Arbitrary Shapes , 2018, ECCV.

[49]  Alfredo Paolillo,et al.  Metrological Characterization of a Vision-Based Measurement System for the Online Inspection of Automotive Rubber Profile , 2009, IEEE Transactions on Instrumentation and Measurement.