Development of a character CAPTCHA recognition system for the visually impaired community using deep learning

This study proposed an assistive system to recognize the special character CAPTCHAs for the visually impaired community in the Chinese region. To improve the recognition precision, a convolutional neural network (CNN), which is named Captchanet for recognition, was proposed. Firstly, a ten-layer network architecture was designed and three improved training strategies were proposed for the deep learning model. Secondly, a customized Chinese character training set was designed using a novel and fast method, with the view of overcoming the limitation in labeled data collection and uneven data distribution. Finally, the experiments were conducted on the test set gathered from public websites to test the effectiveness of the proposed Captchanet. The statistical results demonstrated that the Captchanet has better classification performance and has obtained higher success rates of recognition than the well-known machine learning approaches and CNN-based approaches.

[1]  Kai Chen,et al.  A Convolutional Neural Network-Based Chinese Text Detection Algorithm via Text Structure Modeling , 2017, IEEE Transactions on Multimedia.

[2]  Gaoliang Peng,et al.  A deep convolutional neural network with new training methods for bearing fault diagnosis under noisy environment and different working load , 2018, Mechanical Systems and Signal Processing.

[3]  Hui Gao,et al.  Segmentation of connected characters in text-based CAPTCHAs for intelligent character recognition , 2016, Multimedia Tools and Applications.

[4]  Fan Lin,et al.  Chinese Character CAPTCHA Recognition and performance estimation via deep neural network , 2018, Neurocomputing.

[5]  Jing Xu,et al.  Image Enhancement for Surveillance Video of Coal Mining Face Based on Single-Scale Retinex Algorithm Combined with Bilateral Filtering , 2017, Symmetry.

[6]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[7]  Hang Joon Kim,et al.  On-line recognition of handwritten chinese characters based on hidden markov models , 1997, Pattern Recognit..

[8]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Shie Mannor,et al.  A Tutorial on the Cross-Entropy Method , 2005, Ann. Oper. Res..

[10]  Ching Y. Suen,et al.  A fast parallel algorithm for thinning digital patterns , 1984, CACM.

[11]  Oleg Starostenko,et al.  Breaking text-based CAPTCHAs with variable word and character orientation , 2015, Pattern Recognit..

[12]  Jiri Matas,et al.  Systematic evaluation of CNN advances on the ImageNet , 2016, ArXiv.

[13]  Laurent Wendling,et al.  Character recognition based on non-linear multi-projection profiles measure , 2015, Frontiers of Computer Science.

[14]  Weiming Jiang,et al.  DenseNet with Up-Sampling block for recognizing texts in images , 2019, Neural Computing and Applications.

[15]  Yike Guo,et al.  A machine learning attack against variable-length Chinese character CAPTCHAs , 2018, Applied Intelligence.

[16]  Xiangyang Luo,et al.  Selective Learning Confusion Class for Text-Based CAPTCHA Recognition , 2019, IEEE Access.

[17]  Jun Sun,et al.  Building Fast and Compact Convolutional Neural Networks for Offline Handwritten Chinese Character Recognition , 2017, Pattern Recognit..

[18]  Fei Yin,et al.  Discriminative quadratic feature learning for handwritten Chinese character recognition , 2016, Pattern Recognit..

[19]  Xiangyang Luo,et al.  A Survey on Breaking Technique of Text-Based CAPTCHA , 2017, Secur. Commun. Networks.

[20]  Jing Wang,et al.  CAPTCHA recognition based on deep convolutional neural network. , 2019, Mathematical biosciences and engineering : MBE.