Development of OCR system on android platforms to aid reading with a refreshable braille display in real time

Abstract Individuals with visual impairment are limited in terms of communication, interaction and personal autonomy due to the lack of literature in Braille which is mainly attributable to economic reasons. This paper proposes a reading system for visually impaired persons using a portable device. This work proposes and evaluates a combination of segmentation, feature extraction and machine learning techniques to achieve the best conversion of text to braille as quickly and accurately as possible. The experiments showed that the Central Moments extractor with Multi Layer Perceptron were the best combination for the OCR system developed with 99.86% accuracy and 99.93% specificity. Furthermore, we assess the portable device usability with elementary teachers and with teachers and students in an association of the blind. The results of this system can contribute to improved socialization between visually impaired persons and stimulate their intellectual health.

[1]  João Paulo Papa,et al.  Ultrasonic Sensor Signals and Optimum Path Forest Classifier for the Microstructural Characterization of Thermally-Aged Inconel 625 Alloy , 2015, Sensors.

[2]  David G. Stork,et al.  Pattern Classification , 1973 .

[3]  Seong-Hwan Yoon,et al.  Development of Braille Block for Visually-Impaired Persons using Unsaturated Polyester Resin☆ , 2016 .

[4]  Thomas G. Dietterich,et al.  Solving Multiclass Learning Problems via Error-Correcting Output Codes , 1994, J. Artif. Intell. Res..

[5]  Majid Mirmehdi,et al.  Recognizing Text-Based Traffic Signs , 2015, IEEE Transactions on Intelligent Transportation Systems.

[6]  Gretchen A. Stevens,et al.  Global prevalence of vision impairment and blindness: magnitude and temporal trends, 1990-2010. , 2013, Ophthalmology.

[7]  Alan M. Braga,et al.  A new approach to calculate the nodule density of ductile cast iron graphite using a Level Set , 2016 .

[8]  Yingli Tian,et al.  Portable Camera-Based Assistive Text and Product Label Reading From Hand-Held Objects for Blind Persons , 2014, IEEE/ASME Transactions on Mechatronics.

[9]  Jon Atli Benediktsson,et al.  Retrieval of the Height of Buildings From WorldView-2 Multi-Angular Imagery Using Attribute Filters and Geometric Invariant Moments , 2012, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[10]  Bruce W. Suter,et al.  The multilayer perceptron as an approximation to a Bayes optimal discriminant function , 1990, IEEE Trans. Neural Networks.

[11]  David W. Aha,et al.  Instance-Based Learning Algorithms , 1991, Machine Learning.

[12]  João Paulo Papa,et al.  A path- and label-cost propagation approach to speedup the training of the optimum-path forest classifier , 2014, Pattern Recognit. Lett..

[13]  Rizqi Andry Ardiansyah,et al.  Design of An Electronic Narrator on Assistant Robot for Blind People , 2016 .

[14]  João Paulo Papa,et al.  Supervised pattern classification based on optimum-path forest , 2009 .

[15]  Dominik Spinczyk,et al.  Tutoring math platform accessible for visually impaired people , 2017, Comput. Biol. Medicine.

[16]  Naixue Xiong,et al.  A Fingerprint Recognition Scheme Based on Assembling Invariant Moments for Cloud Computing Communications , 2011, IEEE Systems Journal.

[17]  Gao Pengcheng,et al.  Fast Chinese calligraphic character recognition with large-scale data , 2015, Multimedia Tools and Applications.

[18]  Koby Crammer,et al.  On the Algorithmic Implementation of Multiclass Kernel-based Vector Machines , 2002, J. Mach. Learn. Res..

[19]  Lei Han,et al.  A Novel Character Recognition Algorithm Based on Hidden Markov Models , 2009, AICI.

[20]  S. Mad Saad,et al.  Development of piezoelectric Braille cell control system using microcontroller unit (MCU) , 2010 .

[21]  Victor Hugo C. de Albuquerque,et al.  A novel Vickers hardness measurement technique based on Adaptive Balloon Active Contour Method , 2016, Expert Syst. Appl..

[22]  Matti Pietikäinen,et al.  Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[23]  Robert M. Haralick,et al.  Textural Features for Image Classification , 1973, IEEE Trans. Syst. Man Cybern..

[24]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[25]  Nuno Fonseca,et al.  Camera Reading for Blind People , 2014 .

[26]  Simon Haykin,et al.  Neural Networks and Learning Machines , 2010 .

[27]  Fatih Basçiftçi,et al.  An interactive and multi-functional refreshable Braille device for the visually impaired , 2016, Displays.

[28]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[29]  Roland T. Chin,et al.  On the Detection of Dominant Points on Digital Curves , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[30]  Hanan Samet,et al.  Efficient Component Labeling of Images of Arbitrary Dimension Represented by Linear Bintrees , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[31]  Mohammed Zaki,et al.  Design of an Embedded Arabic Optical Character Recognition , 2013, J. Signal Process. Syst..

[32]  S. Sathiya Keerthi,et al.  Which Is the Best Multiclass SVM Method? An Empirical Study , 2005, Multiple Classifier Systems.

[33]  Banshidhar Majhi,et al.  Mammogram classification using two dimensional discrete wavelet transform and gray-level co-occurrence matrix for detection of breast cancer , 2015, Neurocomputing.

[34]  H. Taylor,et al.  Number of People Blind or Visually Impaired by Cataract Worldwide and in World Regions, 1990 to 2010. , 2015, Investigative ophthalmology & visual science.

[35]  Seth R Flaxman,et al.  Visual impairment and blindness due to macular diseases globally: a systematic review and meta-analysis. , 2014, American journal of ophthalmology.

[36]  João Paulo Papa,et al.  Efficient supervised optimum-path forest classification for large datasets , 2012, Pattern Recognit..

[37]  Hans Limburg,et al.  Global Vision Impairment and Blindness Due to Uncorrected Refractive Error, 1990–2010 , 2016, Optometry and vision science : official publication of the American Academy of Optometry.

[38]  Jürgen Beyerer,et al.  Performance improvement of character recognition in industrial applications using prior knowledge for more reliable segmentation , 2013, Expert Syst. Appl..

[39]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[40]  Jan Flusser,et al.  Projection Operators and Moment Invariants to Image Blurring , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[41]  R.M. Haralick,et al.  Statistical and structural approaches to texture , 1979, Proceedings of the IEEE.

[42]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[43]  K. Ragavi,et al.  Portable Text to Speech Converter for the Visually Impaired , 2016 .

[44]  Mohammed S. Sayed,et al.  An efficient algorithm for Arabic optical font recognition using scale-invariant detector , 2015, International Journal on Document Analysis and Recognition (IJDAR).

[45]  Nello Cristianini,et al.  Large Margin DAGs for Multiclass Classification , 1999, NIPS.

[46]  João Paulo Papa,et al.  EEG signal classification for epilepsy diagnosis via optimum path forest - A systematic assessment , 2014, Neurocomputing.

[47]  Ming-Kuei Hu,et al.  Visual pattern recognition by moment invariants , 1962, IRE Trans. Inf. Theory.

[48]  Gernot A. Fink,et al.  Open-vocabulary recognition of machine-printed Arabic text using hidden Markov models , 2016, Pattern Recognit..

[49]  Victor Hugo C. de Albuquerque,et al.  Brazilian vehicle identification using a new embedded plate recognition system , 2015 .