Fuzzy Techniques for Text Localisation in Images

Summary. Text information extraction represents a fundamental issue in the context of digital image processing. Inside this wide area of research, a number of specific tasks can be identified ranging from text detection to text recognition. In this chapter, we deal with the particular problem of text localisation, which aims at determining the exact location where the text is situated inside a document image. The strict connection between text localisation and image segmentation is highlighted in the chapter and a review of methods for image segmentation is proposed. Particularly, the benefits coming from the employment of fuzzy and neuro-fuzzy techniques in this field is assessed, thus indicating a way to combine Computational Intelligence methods and document image analysis. Three peculiar methods based on image segmentation are presented to show different applications of fuzzy and neuro-fuzzy techniques in the context of text localisation.

[1]  Anil K. Jain,et al.  Text segmentation using gabor filters for automatic document processing , 1992, Machine Vision and Applications.

[2]  C. H. Chen,et al.  Handbook of Pattern Recognition and Computer Vision , 1993 .

[3]  Yi Lu,et al.  Character segmentation in handwritten words - An overview , 1996, Pattern Recognit..

[4]  Rama Chellappa,et al.  Unsupervised segmentation of polarimetric SAR data using the covariance matrix , 1992, IEEE Trans. Geosci. Remote. Sens..

[5]  Seong-Whan Lee,et al.  A New Methodology for Gray-Scale Character Segmentation and Recognition , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Anil K. Jain,et al.  Document Representation and Its Application to Page Decomposition , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  P.V.C. Hough,et al.  Machine Analysis of Bubble Chamber Pictures , 1959 .

[8]  Tony Lindeberg,et al.  Scale-Space Theory in Computer Vision , 1993, Lecture Notes in Computer Science.

[9]  Datong Chen,et al.  Text enhancement with asymmetric filter for video OCR , 2001, Proceedings 11th International Conference on Image Analysis and Processing.

[10]  Ellen K. Hughes,et al.  Video OCR for digital news archive , 1998, Proceedings 1998 IEEE International Workshop on Content-Based Access of Image and Video Database.

[11]  Robert Fullér,et al.  Introduction to neuro-fuzzy systems , 1999, Advances in soft computing.

[12]  Ciro Castiello,et al.  Neuro-fuzzy Analysis of Document Images by the KERNEL System , 2005, WILF.

[13]  Guanrong Chen,et al.  Introduction to Fuzzy Sets, Fuzzy Logic, and Fuzzy Control Systems , 2000 .

[14]  Karim Hadjar,et al.  Newspaper page decomposition using a split and merge approach , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[15]  Korris Fu-Lai Chung,et al.  Hybrid Chinese/English text detection in images and video frames , 2002, Object recognition supported by user interaction for service robots.

[16]  JungHyun Han,et al.  Hybrid approach to efficient text extraction in complex color images , 2004, Pattern Recognit. Lett..

[17]  J. C. Dunn,et al.  A Fuzzy Relative of the ISODATA Process and Its Use in Detecting Compact Well-Separated Clusters , 1973 .

[18]  José Manuel Rebordão,et al.  An amplitude segmentation method based on the distribution function of an image , 1984, Comput. Vis. Graph. Image Process..

[19]  Motoi Iwata,et al.  Segmentation of Page Images Using the Area Voronoi Diagram , 1998, Comput. Vis. Image Underst..

[20]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[21]  Ingrid Daubechies,et al.  Ten Lectures on Wavelets , 1992 .

[22]  Jiangying Zhou,et al.  Page segmentation and classification , 1992, CVGIP Graph. Model. Image Process..

[23]  Nicolai Petkov,et al.  Comparison of texture features based on Gabor filters , 2002, IEEE Trans. Image Process..

[24]  Adnan Amin,et al.  Automatic thresholding of gray-level using multistage approach , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[25]  Yaochu Jin,et al.  Advanced fuzzy systems design and applications , 2003, Studies in Fuzziness and Soft Computing.

[26]  Anil K. Jain,et al.  Goal-Directed Evaluation of Binarization Methods , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[27]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[28]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[29]  Jean-Pierre Desclés,et al.  EXCOM: An Automatic Annotation Engine for Semantic Information , 2006, FLAIRS.

[30]  R.M. Haralick,et al.  Statistical and structural approaches to texture , 1979, Proceedings of the IEEE.

[31]  Jean-Marc Odobez,et al.  Text segmentation and recognition in complex background based on Markov random field , 2002, Object recognition supported by user interaction for service robots.

[32]  Ciro Castiello,et al.  MULTISCALE PAGE SEGMENTATION USING WAVELET PACKET ANALYSIS , 2007 .

[33]  Bülent Sankur,et al.  Survey over image thresholding techniques and quantitative performance evaluation , 2004, J. Electronic Imaging.

[34]  Narendra Ahuja,et al.  Detecting Faces in Images: A Survey , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[35]  S.C. Hinds,et al.  A document skew detection method using run-length encoding and the Hough transform , 1990, [1990] Proceedings. 10th International Conference on Pattern Recognition.

[36]  Etienne E. Kerre,et al.  Defuzzification: criteria and classification , 1999, Fuzzy Sets Syst..

[37]  Chuen-Tsai Sun,et al.  Neuro-fuzzy modeling and control , 1995, Proc. IEEE.

[38]  Azriel Rosenfeld,et al.  Histogram concavity analysis as an aid in threshold selection , 1983, IEEE Transactions on Systems, Man, and Cybernetics.

[39]  Anil K. Jain,et al.  Texture Analysis , 2018, Handbook of Image Processing and Computer Vision.

[40]  Wen-Hsiang Tsai,et al.  Document image segmentation and quality improvement by moiré pattern analysis , 2000, Signal Process. Image Commun..

[41]  Ciro Castiello,et al.  Document page segmentation using neuro-fuzzy approach , 2008, Appl. Soft Comput..

[42]  Paul Scheunders,et al.  Wavelets for texture analysis, an overview , 1997 .

[43]  Friedrich M. Wahl,et al.  Block segmentation and text extraction in mixed text/image documents , 1982, Comput. Graph. Image Process..

[44]  Shigeru Akamatsu,et al.  Recognizing Characters in Scene Images , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[45]  Alexiei Dingli,et al.  Automatic semantic annotation using unsupervised information extraction and integration , 2003 .

[46]  Henri Prade,et al.  What are fuzzy rules and how to use them , 1996, Fuzzy Sets Syst..

[47]  C. S. George Lee,et al.  Neural fuzzy systems: a neuro-fuzzy synergism to intelligent systems , 1996 .

[48]  Hong-Ye Gao,et al.  Applied wavelet analysis with S-plus , 1996 .

[49]  Ronald R. Coifman,et al.  Entropy-based algorithms for best basis selection , 1992, IEEE Trans. Inf. Theory.

[50]  Mausumi Acharyya,et al.  Document image segmentation using wavelet scale-space features , 2002, IEEE Trans. Circuits Syst. Video Technol..

[51]  Alan Watt,et al.  The computer image , 1998 .

[52]  Giovanna Castellano,et al.  Knowledge discovery by a neuro-fuzzy modeling framework , 2005, Fuzzy Sets Syst..

[53]  L O Hall,et al.  Review of MR image segmentation techniques using pattern recognition. , 1993, Medical physics.

[54]  M. Sugeno,et al.  Structure identification of fuzzy model , 1988 .

[55]  Daniel P. Lopresti,et al.  Finding text in color images , 1998, Electronic Imaging.

[56]  Florence Rossant,et al.  A global method for music symbol recognition in typeset music sheets , 2002, Pattern Recognit. Lett..

[57]  Bart Kosko,et al.  Neural networks and fuzzy systems: a dynamical systems approach to machine intelligence , 1991 .

[58]  George J. Klir,et al.  Fuzzy Sets, Fuzzy Logic, and Fuzzy Systems - Selected Papers by Lotfi A Zadeh , 1996, Advances in Fuzzy Systems - Applications and Theory.

[59]  Dzung L. Pham,et al.  Spatial Models for Fuzzy Clustering , 2001, Comput. Vis. Image Underst..

[60]  David S. Doermann,et al.  Superresolution-based enhancement of text in digital video , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[61]  Sushmita Mitra,et al.  Neuro-fuzzy rule generation: survey in soft computing framework , 2000, IEEE Trans. Neural Networks Learn. Syst..

[62]  Shahram Latifi,et al.  Document segmentation using polynomial spline wavelets , 2001, Pattern Recognit..

[63]  Hans-Jürgen Zimmermann,et al.  Introduction to Fuzzy Sets , 1985 .

[64]  Apostolos Antonacopoulos,et al.  Two approaches for text segmentation in Web images , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[65]  Mahesh Viswanathan,et al.  A prototype document image analysis system for technical journals , 1992, Computer.

[66]  Wan-Chi Siu,et al.  Multimedia Information Retrieval and Management: Technological Fundamentals and Applications , 2010 .

[67]  Daniel P. Lopresti,et al.  OCR for World Wide Web images , 1997, Electronic Imaging.

[68]  Stéphane Mallat,et al.  A Theory for Multiresolution Signal Decomposition: The Wavelet Representation , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[69]  Hong Yan,et al.  Text region extraction in a document image based on the Delaunay tessellation , 2003, Pattern Recognit..

[70]  Andries P. Engelbrecht,et al.  Computational Intelligence: An Introduction , 2002 .

[71]  Robert M. Haralick,et al.  Textural Features for Image Classification , 1973, IEEE Trans. Syst. Man Cybern..

[72]  Włodzisław Duch,et al.  Quo vadis, computational intelligence? , 2004 .

[73]  Goh Wee Leng,et al.  Text segmentation for automatic document processing , 1995 .

[74]  Sing-Tze Bow,et al.  Pattern recognition and image preprocessing , 1992 .

[75]  Lawrence O'Gorman,et al.  The Document Spectrum for Page Layout Analysis , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[76]  Rafael C. González,et al.  Digital image processing, 3rd Edition , 2008 .

[77]  C. V. Jawahar,et al.  Fuzzy statistics of digital images , 1996, IEEE Signal Processing Letters.

[78]  Constantin Orasan,et al.  Automatic Annotation of Corpora for Text Summarisation: A Comparative Study , 2005, CICLing.

[79]  Rama Chellappa,et al.  Multiscale Segmentation of Unstructured Document Pages Using Soft Decision Integration , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[80]  David Doermann,et al.  Text enhancement in digital video , 1999, Electronic Imaging.

[81]  Ronald R. Coifman,et al.  Wavelet analysis and signal processing , 1990 .

[82]  Ching Y. Suen,et al.  Chinese document layout analysis based on adaptive split-and-merge and qualitative spatial reasoning , 1997, Pattern Recognit..

[83]  David S. Doermann,et al.  Automatic text detection and tracking in digital video , 2000, IEEE Trans. Image Process..

[84]  Fuhui Long,et al.  Fundamentals of Content-Based Image Retrieval , 2003 .

[85]  John W. Sammon,et al.  An Optimal Discriminant Plane , 1970, IEEE Transactions on Computers.

[86]  Henry S. Baird,et al.  Image segmentation by shape-directed covers , 1990, [1990] Proceedings. 10th International Conference on Pattern Recognition.

[87]  Yan Solihin,et al.  Integral Ratio: A New Class of Global Thresholding Techniques for Handwriting Images , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[88]  Venu Govindaraju,et al.  Analysis of textual images using the Hough transform , 1989, Machine Vision and Applications.

[89]  Lina J. Karam,et al.  Morphological text extraction from images , 2000, IEEE Trans. Image Process..

[90]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[91]  Anil K. Jain,et al.  Text information extraction in images and video: a survey , 2004, Pattern Recognit..

[92]  Alberto Del Bimbo,et al.  Semantics in Visual Information Retrieval , 1999, IEEE Multim..

[93]  Melanie Mitchell,et al.  An introduction to genetic algorithms , 1996 .