Identification and Recognition of Printed Distorted Characters Using Proposed DCR Method

Technological advancement has been developed immensely in two areas over the past few years; these are computer and communication networks. This breakthrough has provided an opportunity for people around the world to access literature and important information. If the database of these manuscripts can be developed and make them available on the internet, the people will access those information as desired. The documents can be well-preserved in digital library so that they cannot be degraded. They are trying to improve the recognition rate in order to get the precise meaning of texts. Distorted character recognition is complicated in contrast with simple plain text images. Sometimes distorted character is seemed to be other character and the accuracy rate drops down drastically. As a result significant information is misplaced. In this paper, we are proposing an algorithm which will improve character recognition rate however it is distorted document image. The accuracy rate of our recommended algorithm is 97%.

[1]  Haddan Mostafavier,et al.  Image Correlation with Geometric Distortion Part 1: Acquisition Performance , 1978, IEEE Transactions on Aerospace and Electronic Systems.

[2]  J. Howley,et al.  Book-Burning and the Uses of Writing in Ancient Rome: Destructive Practice between Literature and Document , 2017, Journal of Roman Studies.

[3]  Krys J. Kochut,et al.  A Brief Survey of Text Mining: Classification, Clustering and Extraction Techniques , 2017, ArXiv.

[4]  Yoichi Tomiura,et al.  Improving OCR for Historical Documents by Modeling Image Distortion , 2019, ICADL.

[5]  Qiang Huo,et al.  High performance Chinese OCR based on Gabor features, discriminative feature extraction and model training , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[6]  Matthew N. Dailey,et al.  Automatic Radial Distortion Estimation from a Single Image , 2012, Journal of Mathematical Imaging and Vision.

[7]  Xiang-Gen Xia,et al.  A multiresolution watermark for digital images , 1997, Proceedings of International Conference on Image Processing.

[8]  S. Erturk,et al.  Digital image stabilization with sub-image phase correlation based global motion estimation , 2003, IEEE Trans. Consumer Electron..

[9]  Atul Negi,et al.  Robust Text Line, Word And Character Extraction from Telugu Document Image , 2009, 2009 Second International Conference on Emerging Trends in Engineering & Technology.

[10]  Subhadip Basu,et al.  Design of an Optical Character Recognition System for Camera-based Handheld Devices , 2011, ArXiv.

[11]  Lei Zhang,et al.  Gradient Magnitude Similarity Deviation: A Highly Efficient Perceptual Image Quality Index , 2013, IEEE Transactions on Image Processing.

[12]  Rohit Prasad,et al.  Improvements in hidden Markov model based Arabic OCR , 2008, 2008 19th International Conference on Pattern Recognition.

[13]  Hermann Ney,et al.  RWTH OCR: A Large Vocabulary Optical Character Recognition System for Arabic Scripts , 2012 .

[14]  M. Nasipuri,et al.  Text line extraction from handwritten document pages using spiral run length smearing algorithm , 2012, 2012 International Conference on Communications, Devices and Intelligent Systems (CODIS).

[15]  S. V. Seeri,et al.  A novel approach for Kannada text extraction , 2012, International Conference on Pattern Recognition, Informatics and Medical Engineering (PRIME-2012).

[16]  Xiaojun Zhai,et al.  OCR-based neural network for ANPR , 2012, 2012 IEEE International Conference on Imaging Systems and Techniques Proceedings.

[17]  Yafang Xue,et al.  Optical Character Recognition , 2022 .

[18]  Klas Josephson,et al.  Pose estimation with radial distortion and unknown focal length , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.