Behaviour-Based Clustering of Neural Networks

The field of off-line optical character recognition (OCR) has been a topic of intensive research for many years (Bozinovic, 1989; Bunke, 2003; Plamondon, 2000; Toselli, 2004). One of the first steps in the classical architecture of a text recognizer is preprocessing, where noise reduction and normalization take place. Many systems do not require a binarization step, so the images are maintained in gray-level quality. Document enhancement not only influences the overall performance of OCR systems, but it can also significantly improve document readability for human readers. In many cases, the noise of document images is heterogeneous, and a technique fitted for one type of noise may not be valid for the overall set of documents. One possible solution to this problem is to use several filters or techniques and to provide a classifier to select the appropriate one. Neural networks have been used for document enhancement (see (Egmont-Petersen, 2002) for a review of image processing with neural networks). One advantage of neural network filters for image enhancement and denoising is that a different neural filter can be automatically trained for each type of noise. This work proposes the clustering of neural network filters to avoid having to label training data and to reduce the number of filters needed by the enhancement system. An agglomerative hierarchical clustering algorithm of supervised classifiers is proposed to do this. The technique has been applied to filter out the background noise from an office (coffee stains and footprints on documents, folded sheets with degraded printed text, etc.). BACKGROUND

[1]  Enrique Fernández-Blanco,et al.  2D-PAGE Analysis Using Evolutionary Computation , 2009, Encyclopedia of Artificial Intelligence.

[2]  Hermann Ney,et al.  Integrated Handwriting Recognition And Interpretation Using Finite-State Models , 2004, Int. J. Pattern Recognit. Artif. Intell..

[3]  Sargur N. Srihari,et al.  On-Line and Off-Line Handwriting Recognition: A Comprehensive Survey , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Kevin Curran,et al.  Ubiquitous Developments in Ambient Computing and Intelligence: Human-Centered Applications , 2011 .

[5]  Sargur N. Srihari,et al.  Off-Line Cursive Script Word Recognition , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Alejandro Pazos Sierra,et al.  Encyclopedia of Artificial Intelligence , 2008 .

[7]  Kenji Suzuki,et al.  Neural Edge Enhancer for Supervised Edge Enhancement from Noisy Images , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Habib Hamam,et al.  DeepKøver: An Adaptive Intelligent Assistance System for Monitoring Impaired People in Smart Homes , 2011 .

[9]  Vijayan Sugumaran Intelligent, Adaptive and Reasoning Technologies: New Developments and Applications , 2011 .

[10]  Jesús Favela,et al.  Adaptive Awareness of Hospital Patient Information through Multiple Sentient Displays , 2009, Int. J. Ambient Comput. Intell..

[11]  Charles C. Willow A Neural Network-Based Agent Framework for Mail Server Management , 2005, Int. J. Intell. Inf. Technol..

[12]  David Villa,et al.  How Intelligent Are Ambient Intelligence Systems? , 2010, Int. J. Ambient Comput. Intell..

[13]  Michael Egmont-Petersen,et al.  Image processing with neural networks - a review , 2002, Pattern Recognit..

[14]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[15]  Tapas Kanungo,et al.  Estimating degradation model parameters using neighborhood pattern distributions: an optimization approach , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  V. Sugumaran The Inaugural Issue of the International Journal of Intelligent Information Technologies , 2005 .