Toward Distributed Cursive Writing OCR Systems Based on a Combination of Complementary Approaches

Large amounts of cursive writing documents are still waiting to be computerized for several and different purposes. These documents are in general of medium to low quality; hence they require a sophisticated recognition algorithm capable of properly extracting the correct text from low quality cursive documents. The Dynamic Time Warp (DTW) algorithm is among the most effective algorithms for cursive writing optical character recognition (OCR). However, the DTW is a rather complex task requiring extensive computational capabilities, which hinders its commercial deployment on nonspecialized stand alone machines. Volunteer grids, such as XtremWeb and BOINC, provide viable infrastructures to speed up the DTW execution time. Recent experiments conducted on the Scientific Research Tunisian Grid (SRTG), an XtremWeb volunteer grid, confirmed this claim and showed a very tangible speedup along very high recognition rates. Such infrastructures present several practical advantages, such as the possibility of noncondemnation of the involved computers and the possibility of their simultaneous use by different users and/or applications. Unfortunately, volunteer grid infrastructures are inherently unable to guarantee the continuous availability of the stored data and, more importantly, the engaged processing capacities. Any involved computer may renegade and depart from the system at will, which consequently affects the application performance. Agent technology can be exploited here to solve the problem. In this chapter, we propose a service-oriented grid architecture (SOGA) based on the integration of both grid and agent technologies. An analytical study is conducted to ascertain and evaluate the key performance parameters of our proposed SOGA. The results confirm that our proposal provides a solid and viable solution for the large scale recognition of printed cursive writing based on the DTW algorithm.

[1]  Minjie Zhang,et al.  Agent-Based Grid Computing , 2008, Computational Intelligence: A Compendium.

[2]  Gilles Fedak,et al.  BitDew: a programmable environment for large-scale data management and distribution , 2008, HiPC 2008.

[3]  Richard Wolski,et al.  Dynamically forecasting network performance using the Network Weather Service , 1998, Cluster Computing.

[4]  Steven Tuecke,et al.  The Anatomy of the Grid , 2003 .

[5]  Ian T. Foster,et al.  Die Anatomie des Grid , 2002, Peer-to-Peer: Ökonomische, technische und juristische Perspektiven.

[6]  Alessandro Vinciarelli,et al.  A survey on off-line Cursive Word Recognition , 2002, Pattern Recognit..

[7]  Raphaël Couturier,et al.  Synchronous and asynchronous solution of a 3D transport model in a grid computing environment , 2006 .

[8]  King-Sun Fu,et al.  VLSI architecture for dynamic time-warp recognition of handwritten symbols , 1986, IEEE Trans. Acoust. Speech Signal Process..

[9]  Jean-Luc Gauvain,et al.  A dynamic programming processor for speech recognition , 1989 .

[10]  Abdelfettah Belghith,et al.  A multipurpose multi-agent system based on a loosely coupled architecture to speedup the DTW algorithm for Arabic printed cursive OCR , 2005, The 3rd ACS/IEEE International Conference onComputer Systems and Applications, 2005..

[11]  Nicholas R. Jennings,et al.  The Gaia Methodology for Agent-Oriented Analysis and Design , 2000, Autonomous Agents and Multi-Agent Systems.

[12]  Nicholas R. Jennings,et al.  Intelligent agents: theory and practice , 1995, The Knowledge Engineering Review.

[13]  Michael D. Brown,et al.  An algorithm for connected word recognition , 1982, ICASSP.

[14]  Frédéric Desprez,et al.  Large scale execution of a bioinformatic application on a volunteer grid , 2008, 2008 IEEE International Symposium on Parallel and Distributed Processing.

[15]  Robert M. Haralick,et al.  A segmentation-free approach to text recognition with application to Arabic text , 1996, International Journal on Document Analysis and Recognition.

[16]  Victor R. Lesser,et al.  Cooperative Multiagent Systems: A Personal View of the State of the Art , 1999, IEEE Trans. Knowl. Data Eng..

[17]  Zhao Ji-ping Research on Mobile Agent Security , 2005 .

[18]  M. Khemakhem,et al.  A P2P Grid Architecture for Distributed Arabic Ocr Based on the Dtw Algorithm , 2009 .

[19]  Caroline C. Hayes,et al.  Agents in a Nutshell - A Very Brief Introduction , 1999, IEEE Trans. Knowl. Data Eng..

[20]  Nicholas R. Jennings,et al.  A methodology for agent-oriented analysis and design , 1999, AGENTS '99.

[21]  Nicholas R. Jennings,et al.  Brain Meets Brawn: Why Grid and Agents Need Each Other , 2004, Towards the Learning Grid.

[22]  Jacques Ferber,et al.  Les Systèmes multi-agents: vers une intelligence collective , 1995 .

[23]  Clement Jonquet,et al.  Dynamic Service Generation: Agent interactions for service exchange on the Grid. (Génération dynamique de service : Interactions entre agents pour l'échange de service Grid) , 2006 .

[24]  Edson Cáceres,et al.  Parallel dynamic programming for solving the string editing problem on a CGM/BSP , 2002, SPAA '02.

[25]  Amal El Fallah-Seghrouchni,et al.  Les systèmes multi-agents , 2006 .

[26]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[27]  Maher Khemakhem Reconnaissance de caractères imprimés par comparaison dynamique , 1987 .

[28]  S. Kanoun.,et al.  Reconnaissance d'images de textes arabes par approche affixale , 2004 .

[29]  Neil W. Bergmann,et al.  An Arabic optical character recognition system using recognition-based segmentation , 2001, Pattern Recognit..

[30]  Adnan Amin,et al.  Off-line Arabic character recognition: the state of the art , 1998, Pattern Recognit..

[31]  Piero Mella Complex Systems vs. Simplex Systems (^) The Behaviour of Collectivities following the Combinatory System View , 2002 .