Retraction Note: Document image analysis: issues, comparison of methods and remaining problems

Image analysis is an interesting research area with a large variety of challenging applications. Researchers have worked from decades on this topic, as witnessed by the scientific literature. However, document image analysis is the special case in image analysis as their spatial properties are different from natural images. Therefore, the main focus of this paper is to describe image denoising issues in general and document image issues in particular. Since the field of document processing is relatively new, it is also dynamic, so current methods have room for improvement and innovations are still being made. Several algorithms proposed in the literature are described. Critical discussions are reported about the current status of the field and open problems are highlighted. It is also demonstrated that, there are rarely definitive techniques for all cases of a certain problem. We surveyed the state of art, analyzed recent trends and tried to identify challenges for future research in this field.

[1]  N. Movahhedinia,et al.  On Skew Estimation of Persian/Arabic Printed Documents , 2008 .

[2]  Martin Vetterli,et al.  Spatially adaptive wavelet thresholding with context modeling for image denoising , 2000, IEEE Trans. Image Process..

[3]  A. Lawrence Spitz,et al.  Determination of the Script and Language Content of Document Images , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Harry Wechsler,et al.  Automated page orientation and skew angle detection for binary document images , 1994, Pattern Recognit..

[5]  Lixin Fan,et al.  Binarizing document image using coplanar prefilter , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[6]  Nikos Papamarkos,et al.  Local Skew Correction in Documents , 2008, Int. J. Pattern Recognit. Artif. Intell..

[7]  Nikos Fakotakis,et al.  Improved document skew detection based on text line connected-component clustering , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[8]  Aysin Ertüzün,et al.  Applications of multiwavelet techniques to image denoising , 2002, Proceedings. International Conference on Image Processing.

[9]  I. Johnstone,et al.  Adapting to Unknown Smoothness via Wavelet Shrinkage , 1995 .

[10]  Majdi Ben Hadj Ali Background noise detection and cleaning in document images , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[11]  Yan Guo-ping,et al.  Image Denoise Based on Soft-Threshold and Edge Enhancement , 2007, Second Workshop on Digital Media and its Application in Museum & Heritages (DMAMH 2007).

[12]  Adam Krzyzak,et al.  Verification - a method of enhancing the recognizers of isolated and touching handwritten numerals , 2002, Pattern Recognit..

[13]  M. Sarfraz,et al.  Skew Estimation and Correction of Text Using Bounding Box , 2008, 2008 Fifth International Conference on Computer Graphics, Imaging and Visualisation.

[14]  Amandeep Kaur,et al.  Hough transform based fast skew detection and accurate skew correction methods , 2008, Pattern Recognit..

[15]  Jeffrey Ng,et al.  A steerable complex wavelet construction and its application to image denoising , 2005, IEEE Transactions on Image Processing.

[16]  Venu Govindaraju,et al.  Analysis of textual images using the Hough transform , 1989, Machine Vision and Applications.

[17]  Ruola Ning,et al.  Image denoising based on wavelets and multifractals for singularity detection , 2005, IEEE Transactions on Image Processing.

[18]  Michael L. Lightstone,et al.  A new efficient approach for the removal of impulse noise from highly corrupted images , 1996, IEEE Trans. Image Process..

[19]  Anil K. Jain,et al.  A robust and fast skew detection algorithm for generic documents , 1996, Pattern Recognit..

[20]  Michel Barlaud,et al.  Variational approach for edge-preserving regularization using coupled PDEs , 1998, IEEE Trans. Image Process..

[21]  James D. Johnston,et al.  Spatial noise shaping based on human visual sensitivity and its application to image coding , 2002, IEEE Trans. Image Process..

[22]  Anil K. Jain Fundamentals of Digital Image Processing , 2018, Control of Color Imaging Systems.

[23]  S. Chaudhuri,et al.  Robust detection of skew in document images , 1997, IEEE Trans. Image Process..

[24]  Yue Lu,et al.  A nearest-neighbor chain based approach to skew estimation in document images , 2003, Pattern Recognit. Lett..

[25]  Xue Shufang,et al.  An Efficient Salt-and-Pepper Noise Removal , 2006 .

[26]  Seong-Whan Lee,et al.  Faxed image restoration using Kalman filtering , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[27]  Xiaoyan Zhu,et al.  A new textual/non-textual classifier for document skew correction , 2002, Object recognition supported by user interaction for service robots.

[28]  Raymond H. Chan,et al.  Salt-and-pepper noise removal by median-type noise detectors and detail-preserving regularization , 2005, IEEE Transactions on Image Processing.

[29]  Andreas Jung,et al.  An introduction to a new data analysis tool: Independent Component Analysis , 2002 .

[30]  Mauro Barni,et al.  A quasi-Euclidean norm to speed up vector median filtering , 2000, IEEE Trans. Image Process..

[31]  M. Nikolova A Variational Approach to Remove Outliers and Impulse Noise , 2004 .

[32]  K. S. Baird,et al.  Anatomy of a versatile page reader , 1992, Proc. IEEE.

[33]  Changming Sun,et al.  Skew and slant correction for document images using gradient direction , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[34]  Siu Cheung Hui,et al.  Cursive word reference line detection , 1997, Pattern Recognit..

[35]  Basanna V. Dhandra,et al.  Skew Detection in Binary Image Documents Based on Image Dilation and Region labeling Approach , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[36]  Steven J. Simske,et al.  Image Denoising Through Support Vector Regression , 2007, 2007 IEEE International Conference on Image Processing.

[37]  Rafael Dueire Lins,et al.  A fast orientation and skew detection algorithm for monochromatic document images , 2005, DocEng '05.

[38]  A. Ben Hamza,et al.  Removing Noise and Preserving Details with Relaxed Median Filters , 1999, Journal of Mathematical Imaging and Vision.

[39]  Xiaoyi Jiang,et al.  Skew detection of document images by focused nearest-neighbor clustering , 1999, Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318).

[40]  Rafael Dueire Lins,et al.  A New Algorithm for Skew Detection in Images of Documents , 2004, ICIAR.

[41]  Andrew D. Bagdanov,et al.  Projection profile based skew estimation algorithm for JBIG compressed images , 1998, International Journal on Document Analysis and Recognition.

[42]  Zhang Ping,et al.  Text document filters using morphological and geometrical features of characters , 2000, WCC 2000 - ICSP 2000. 2000 5th International Conference on Signal Processing Proceedings. 16th World Computer Congress 2000.

[43]  Anil K. Jain,et al.  Text information extraction in images and video: a survey , 2004, Pattern Recognit..

[44]  Radu Ciprian Bilcu,et al.  Fast Non-Local Means for Image De-noising , 2007 .

[45]  Jiliu Zhou,et al.  An Efficient Salt-and-Pepper Noise Removal on Local Edge-Preserving Function , 2008, 2008 International Conference on Embedded Software and Systems Symposia.

[46]  Chien-Hsing Chou,et al.  Estimation of skew angles for scanned documents based on piecewise covering by parallelograms , 2007, Pattern Recognit..

[47]  Jean-Michel Morel,et al.  A Review of Image Denoising Algorithms, with a New One , 2005, Multiscale Model. Simul..

[48]  Peter E. Hart,et al.  Image continuation , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[49]  Rakhi C. Motwani,et al.  Survey of Image Denoising Techniques , 2004 .

[50]  Palaiahnakote Shivakumara,et al.  Skew estimation of binary document images using static and dynamic thresholds useful for document image mosaicing. , 2003 .

[51]  Shuqun Zhang,et al.  A new impulse detector for switching median filters , 2002, IEEE Signal Processing Letters.

[52]  Azriel Rosenfeld,et al.  A method of detecting the orientation of aligned components , 1986, Pattern Recognit. Lett..

[53]  B. GATOS,et al.  Skew detection and text line position determination in digitized documents , 1997, Pattern Recognit..

[54]  K. R. Arvind,et al.  Entropy Based Skew Correction of Document Images , 2007, PReMI.

[55]  Hideaki Ozawa,et al.  A character image enhancement method from characters with various background images , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[56]  Bidyut Baran Chaudhuri,et al.  An improved document skew angle estimation technique , 1996, Pattern Recognit. Lett..

[57]  Edward R. Dougherty,et al.  Hands-on Morphological Image Processing , 2003 .

[58]  Dan S. Bloomberg,et al.  Measuring document image skew and orientation , 1995, Electronic Imaging.

[59]  Mahmoud R. El-Sakka,et al.  Novel Adaptive Filtering for Salt-and-Pepper Noise Removal from Binary Document Images , 2004, ICIAR.

[60]  Yasuto Ishitani,et al.  Document skew detection based on local region complexity , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[61]  Zhou Wang,et al.  Progressive switching median filter for the removal of impulse noise from highly corrupted images , 1999 .

[62]  Guillermo Sapiro,et al.  Fast image and video denoising via nonlocal means of similar neighborhoods , 2005, IEEE Signal Processing Letters.

[63]  Jonathan J. Hull Document Image skew Detection: Survey and Annotated Bibliography , 1996, DAS.

[64]  Mohamed Cheriet,et al.  A New Approach for Skew Correction of Documents Based on Particle Swarm Optimization , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[65]  Adnan Amin,et al.  A Document Skew Detection Method Using the Hough Transform , 2000, Pattern Analysis & Applications.

[66]  H. Seo,et al.  A Comparison of Some State of the Art Image Denoising Methods , 2007, 2007 Conference Record of the Forty-First Asilomar Conference on Signals, Systems and Computers.

[67]  Richard G. Baraniuk,et al.  Multiple wavelet basis image denoising using Besov ball projections , 2004, IEEE Signal Processing Letters.

[68]  Nikos A. Nikolaou,et al.  A New Technique for Global and Local Skew Correction in Binary Documents , 2007, ACIVS.

[69]  Abdullah Zawawi Talib,et al.  Removing salt-and-pepper noise from binary images of engineering drawings , 2008, 2008 19th International Conference on Pattern Recognition.

[70]  Alan C. Bovik,et al.  Streaking in median filtered images , 1987, IEEE Trans. Acoust. Speech Signal Process..

[71]  Henry S. Baird,et al.  The skew angle of printed documents , 1995 .

[72]  Peng-Yeng Yin Skew detection and block classification of printed documents , 2001, Image Vis. Comput..

[73]  Yrjö Neuvo,et al.  Detail-preserving median based filters in image processing , 1994, Pattern Recognit. Lett..

[74]  Justin K. Romberg,et al.  Bayesian tree-structured image modeling using wavelet-domain hidden Markov models , 2001, IEEE Trans. Image Process..

[75]  Tieniu Tan,et al.  A general algorithm for document skew angle estimation , 1997, Proceedings of International Conference on Image Processing.

[76]  S.C. Hinds,et al.  A document skew detection method using run-length encoding and the Hough transform , 1990, [1990] Proceedings. 10th International Conference on Pattern Recognition.

[77]  Palaiahnakote Shivakumara,et al.  Skew detection technique for binary document images based on Hough transform , 2006 .

[78]  Lawrence O'Gorman,et al.  The Document Spectrum for Page Layout Analysis , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[79]  Wenli Xu,et al.  Spatially adaptive wavelet denoising using the minimum description length principle , 2004, IEEE Trans. Image Process..

[80]  Yuttapong Rangsanseri,et al.  Removing salt-and-pepper noise in text/graphics images , 1998, IEEE. APCCAS 1998. 1998 IEEE Asia-Pacific Conference on Circuits and Systems. Microelectronics and Integrating Systems. Proceedings (Cat. No.98EX242).

[81]  Tapas Kanungo,et al.  Morphological degradation models and their use in document image restoration , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[82]  Sung-Jea Ko,et al.  Center weighted median filters and their applications to image enhancement , 1991 .

[83]  Palaiahnakote Shivakumara,et al.  An Accurate and Efficient skew estimation Technique for South Indian Documents: a New boundary Growing and Nearest Neighbor Clustering Based Approach , 2007, Int. J. Robotics Autom..

[84]  Eero P. Simoncelli,et al.  Image compression via joint statistical characterization in the wavelet domain , 1999, IEEE Trans. Image Process..

[85]  Dalong Li,et al.  Support vector regression based image denoising , 2009, Image Vis. Comput..

[86]  Chin-Chuan Han,et al.  A fast approach to the detection and correction of skew documents , 1997, Pattern Recognit. Lett..

[87]  Norihiro Hagita,et al.  Automated entry system for printed documents , 1990, Pattern Recognit..