Object Detection in Natural Scene Images Using Thresholding Techniques

A lot of attention has emerged regarding the aspects of text detection and identification as OCR has generated a lot of prominence over the years. There has been a number of experiments conducted in this field to make the results more and more accurate. Most of the experiments carried out have paid attention to only a few attributes and not a lot of trails have been done for unusual scenarios, like a lot of techniques produces accurate results only for horizontal textual orientation. So there should different techniques for analyzing such images which have a complex background, different font styles, colors, textual orientations. Text detection on images containing texts of different orientations, different font types, and images with complex backgrounds is taken for the proposed work. There are mainly 3 steps in the algorithm proposed, the Canny edge detection approach for gradient filtering is applied in the first stage to detect the skeletal structure of various objects in the image. In the next stage textual threshold-based object filtering is carried out using the convolution technique with a heuristic thresholding model. The textual object filtering after convolution is subject to the last stage called post enhancement technique. In this stage, partial non-textual objects being filtered out are employed for removal based on geometrical properties of gradients of images, thus retaining only the textual objects. Finally, the textual object filtered gradient image is considered as a mask image for mapping it to the original image for text detection. Experimentations are conducted on Google Street View Datasets for which a subjective evaluation procedure is adapted to validate the results resulting in promising outcomes for more than 50% of images.

[1]  Maya R. Gupta,et al.  OCR binarization and image pre-processing for searching historical documents , 2007, Pattern Recognit..

[2]  C. Garcia,et al.  Text detection and segmentation in complex color images , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[3]  N. Shobha Rani,et al.  An Efficient Technique for Detection and Removal of Lines with Text Stroke Crossings in Document Images , 2018 .

[4]  Christof Koch,et al.  AdaBoost for Text Detection in Natural Scene , 2011, 2011 International Conference on Document Analysis and Recognition.

[5]  Albert Gordo,et al.  Rosetta: Large Scale System for Text Detection and Recognition in Images , 2018, KDD.

[6]  Nobuo Ezaki,et al.  Text detection from natural scene images: towards a system for visually impaired persons , 2004, ICPR 2004.

[7]  D. Adlakha,et al.  Analytical Comparison between Sobel and Prewitt Edge Detection Techniques , 2016 .

[8]  Yonatan Wexler,et al.  Detecting text in natural scenes with stroke width transform , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[9]  Shih-Ming Yang,et al.  A fast method for image noise estimation using Laplacian operator and adaptive edge detection , 2008, 2008 3rd International Symposium on Communications, Control and Signal Processing.

[10]  Mubarak Shah,et al.  Image Geo-Localization Based on MultipleNearest Neighbor Feature Matching UsingGeneralized Graphs , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Huizhong Chen,et al.  Robust text detection in natural images with edge-enhanced Maximally Stable Extremal Regions , 2011, 2011 18th IEEE International Conference on Image Processing.

[12]  N. R. Shetty,et al.  Emerging Research in Computing, Information, Communication and Applications , 2016 .

[13]  Ieee Xplore,et al.  IEEE Transactions on Pattern Analysis and Machine Intelligence Information for Authors , 2022, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  N. Shobha Rani,et al.  A font style classification system for English OCR , 2017, 2017 International Conference on Intelligent Computing and Control (I2C2).

[15]  Xiang Bai,et al.  Symmetry-based text line detection in natural scenes , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[16]  오랑 디알라메,et al.  Augmented reality panorama supporting visually imparired individuals , 2011 .

[17]  Matti Pietikäinen,et al.  Adaptive document image binarization , 2000, Pattern Recognit..

[18]  Lei Yang,et al.  An improved Sobel edge detection , 2010, 2010 3rd International Conference on Computer Science and Information Technology.

[19]  Shijian Lu,et al.  Text Flow: A Unified Text Detection System in Natural Scene Images , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[20]  Mohamed Ali,et al.  Using the Canny edge detector for feature extraction and enhancement of remote sensing images , 2001, IGARSS 2001. Scanning the Present and Resolving the Future. Proceedings. IEEE 2001 International Geoscience and Remote Sensing Symposium (Cat. No.01CH37217).

[21]  Lei Sun,et al.  Robust Text Detection in Natural Scene Images by Generalized Color-Enhanced Contrasting Extremal Region and Neural Networks , 2014, 2014 22nd International Conference on Pattern Recognition.

[22]  Kaizhu Huang,et al.  Robust Text Detection in Natural Scene Images , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  Shuchang Zhou,et al.  EAST: An Efficient and Accurate Scene Text Detector , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[24]  Zhuowen Tu,et al.  Detecting Texts of Arbitrary Orientations in 1 Natural Images , 2012 .