Efficient Removal of Noisy Borders of Monochromatic Documents

Very often the digitalization process using automatically fed production line scanners yields monochromatic images framed by a noisy border. This paper presents a pre-processing scheme based on sub sampling which speeds up the border removal process. The technique introduced was tested on over 20,000 images and provided same quality images than the best algorithm in the literature and amongst commercial tools with an average speed-up around 50%.

[1]  Lawrence O'Gorman,et al.  Executive briefing : document image analysis , 1997 .

[2]  Henry S. Baird,et al.  Document image defect models and their uses , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[3]  D.X. Le,et al.  Automated borders detection and adaptive segmentation for binary document images , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[4]  Marc Berger,et al.  Computer Graphics With Pascal , 1986 .

[5]  Lawrence O'Gorman,et al.  Document Image Analysis , 1996 .

[6]  Rafael Dueire Lins,et al.  An environment for processing images of historical documents , 1994, Microprocess. Microprogramming.

[7]  Rafael Dueire Lins,et al.  Comparative study of file formats for image storage and transmission , 2004, J. Electronic Imaging.

[8]  Ronald L. Rivest,et al.  Introduction to Algorithms , 1990 .

[9]  Lawrence O'Gorman,et al.  Document Image Analysis: An Executive Briefing , 1997 .

[10]  Kuo-Chin Fan,et al.  Marginal noise removal of document images , 2002, Pattern Recognit..

[11]  Mohamed S. Kamel,et al.  Image Analysis and Recognition , 2014, Lecture Notes in Computer Science.

[12]  Rafael Dueire Lins,et al.  A new algorithm for removing noisy borders from monochromatic documents , 2004, SAC '04.

[13]  Rafael Dueire Lins,et al.  BigBatch: a document processing platform for clusters and grids , 2008, SAC '08.

[14]  Robert M. Haralick,et al.  Global and local document degradation models , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[15]  Clifford Stein,et al.  Introduction to Algorithms, 2nd edition. , 2001 .