论文信息 - A Parallel Neuromorphic Text Recognition System and Its Implementation on a Heterogeneous High-Performance Computing Cluster

A Parallel Neuromorphic Text Recognition System and Its Implementation on a Heterogeneous High-Performance Computing Cluster

Given the recent progress in the evolution of high-performance computing (HPC) technologies, the research in computational intelligence has entered a new era. In this paper, we present an HPC-based context-aware intelligent text recognition system (ITRS) that serves as the physical layer of machine reading. A parallel computing architecture is adopted that incorporates the HPC technologies with advances in neuromorphic computing models. The algorithm learns from what has been read and, based on the obtained knowledge, it forms anticipations of the word and sentence level context. The information processing flow of the ITRS imitates the function of the neocortex system. It incorporates large number of simple pattern detection modules with advanced information association layer to achieve perception and recognition. Such architecture provides robust performance to images with large noise. The implemented ITRS software is able to process about 16 to 20 scanned pages per second on the 500 trillion floating point operations per second (TFLOPS) Air Force Research Laboratory (AFRL)/Information Directorate (RI) Condor HPC after performance optimization.

[1] Anil K. Jain,et al. Feature extraction methods for character recognition-A survey , 1996, Pattern Recognit..

[2] B. N. Chatterji. Feature Extraction Methods for Character Recognition , 1986 .

[3] Darko Brodic,et al. Preprocessing of binary document images by morphological operators , 2011, 2011 Proceedings of the 34th International Convention MIPRO.

[4] Yuxin Peng,et al. Using Multiple Frame Integration for the Text Recognition of Video , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[5] Majid Ahmadi,et al. Pattern recognition with moment invariants: A comparative study and new results , 1991, Pattern Recognit..

[6] Majid Ahmadi,et al. Handwritten numeral recognition with multiple features and multistage classifiers , 1994, Proceedings of IEEE International Symposium on Circuits and Systems - ISCAS '94.

[7] Peter Weinstein,et al. Towards a Complete, Multi-level Cognitive Architecture , 2007 .

[8] Robert Hecht-Nielsen. Confabulation theory - the mechanism of thought , 2007 .

[9] Mohamad H. Hassoun,et al. Associative neural memories , 1993 .

[10] R. Smith,et al. An Overview of the Tesseract OCR Engine , 2007, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007).

[11] Eddy Muntina Dharma,et al. Japanese character (Kana) pattern recognition application using neural network , 2011, Proceedings of the 2011 International Conference on Electrical Engineering and Informatics.

[12] Alireza Khotanzad,et al. Invariant Image Recognition by Zernike Moments , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[13] W. Bruce Croft,et al. Probabilistic Retrieval of OCR Degraded Text Using N-Grams , 1997, ECDL.

[14] Mindy Bokser,et al. Omnidocument technologies , 1992, Proc. IEEE.

[15] Craig A. Knoblock,et al. Recognition of Multi-oriented, Multi-sized, and Curved Text , 2011, 2011 International Conference on Document Analysis and Recognition.

[16] Thomas M. Breuel,et al. The OCRopus open source OCR system , 2008, Electronic Imaging.

[17] Venu Govindaraju,et al. Topic based language models for OCR correction , 2008, AND '08.

[18] Silke Wagner,et al. Using web search engines to improve text recognition , 2008, 2008 19th International Conference on Pattern Recognition.

[19] Faisal Shafait,et al. Document image analysis with OCRopus , 2009, 2009 IEEE 13th International Multitopic Conference.

[20] Stephen A. Ritz,et al. Distinctive features, categorical perception, and probability learning: some applications of a neural model , 1977 .

[21] J. Mantas,et al. An overview of character recognition methodologies , 1986, Pattern Recognit..

[22] Ching Y. Suen,et al. Historical review of OCR research and development , 1992, Proc. IEEE.

[23] Patrick van der Smagt,et al. Introduction to neural networks , 1995, The Lancet.

[24] Klaus Kofler,et al. Performance and Scalability of GPU-Based Convolutional Neural Networks , 2010, 2010 18th Euromicro Conference on Parallel, Distributed and Network-based Processing.

[25] Simon M. Lucas,et al. A Comparison of Syntactic and Statistical Techniques for Off-Line OCR , 1994, ICGI.

[26] Ching Y. Suen,et al. An Evaluation of Parallel Thinning Algorithms for Character Recognition , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[27] László Györfi,et al. A Probabilistic Theory of Pattern Recognition , 1996, Stochastic Modelling and Applied Probability.

[28] Qing Wu,et al. Confabulation based sentence completion for machine reading , 2011, 2011 IEEE Symposium on Computational Intelligence, Cognitive Algorithms, Mind, and Brain (CCMB).

[29] Lucian-Ovidiu Fedorovici,et al. Improved neural network OCR based on preprocessed blob classes , 2010, 2010 International Joint Conference on Computational Cybernetics and Technical Informatics.

[30] Abraham Schultz. Collective recall via the brain-state-in-a-box network , 1993, IEEE Trans. Neural Networks.

[31] ChengXiang Zhai,et al. A Content-based Probabilistic Correction Model for OCR Document Retrieval , 2002 .

[32] Taskeen Nadkar,et al. OCR-based chassis-number recognition using artificial neural networks , 2009, 2009 IEEE International Conference on Vehicular Electronics and Safety (ICVES).

[33] Qing Wu,et al. Performance optimization for pattern recognition using associative neural memory , 2008, 2008 IEEE International Conference on Multimedia and Expo.

[34] George Nagy,et al. 29 Optical character recognition - Theory and practice , 1982, Classification, Pattern Recognition and Reduction of Dimensionality.