Adaptive Methods for Robust Document Image Understanding

A vast amount of digital document material is continuously being produced as part of major digitization efforts around the world. In this context, generic and efficient automatic solutions for document image understanding represent a stringent necessity. We propose a generic framework for document image understanding systems, usable for practically any document types available in digital form. Following the introduced workflow, we shift our attention to each of the following processing stages in turn: quality assurance, image enhancement, color reduction and binarization, skew and orientation detection, page segmentation and logical layout analysis. We review the state of the art in each area, identify current deficiencies, point out promising directions and give specific guidelines for future investigation. We address some of the identified issues by means of novel algorithmic solutions putting special focus on generality, computational efficiency and the exploitation of all available sources of information. More specifically, we introduce the following original methods: a fully automatic detection of color reference targets in digitized material, accurate foreground extraction from color historical documents, font enhancement for hot metal typesetted prints, a theoretically optimal solution for the document binarization problem from both computational complexityand threshold selection point of view, a layout-independent skew and orientation detection, a robust and versatile page segmentation method, a semi-automatic front page detection algorithm and a complete framework for article segmentation in periodical publications. The proposed methods are experimentally evaluated on large datasets consisting of real-life heterogeneous document scans. The obtained results show that a document understanding system combining these modules is able to robustly process a wide variety of documents with good overall accuracy.

[1]  Karl-Michael Schneider,et al.  Information extraction from calls for papers with conditional random fields and layout features , 2006, Artificial Intelligence Review.

[2]  Jeffrey J. Rodriguez,et al.  Image classification based on focus , 2008, 2008 15th IEEE International Conference on Image Processing.

[3]  Thomas M. Breuel,et al.  Performance Comparison of Six Algorithms for Page Segmentation , 2006, Document Analysis Systems.

[4]  Christoph Seibert,et al.  Constant-Time Locally Optimal Adaptive Binarization , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[5]  Cordelia Schmid,et al.  A Comparison of Affine Region Detectors , 2005, International Journal of Computer Vision.

[6]  Abdel Belaïd,et al.  Page segmentation by segment tracing , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[7]  Haiping Lu,et al.  Distance-reciprocal distortion measure for binary document images , 2004, IEEE Signal Processing Letters.

[8]  Yizhou Wang,et al.  Decomposing Document Images by Heuristic Search , 2007, EMMCVPR.

[9]  Patrick Hébert,et al.  Median Filtering in Constant Time , 2007, IEEE Transactions on Image Processing.

[10]  Changjun Li,et al.  The CIECAM02 Color Appearance Model , 2002, CIC.

[11]  Tim Ritchings,et al.  Representation and classification of complex-shaped printed regions using white tiles , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[12]  Faisal Shafait Document Image Dewarping Contest , 2007 .

[13]  Thomas M. Breuel,et al.  Resolution independent skew and orientation detection for document images , 2009, Electronic Imaging.

[14]  T. Kanade,et al.  Color information for region segmentation , 1980 .

[15]  Mark S. Drew,et al.  Spatio-chromatic decorrelation for color image compression , 2008, Signal Process. Image Commun..

[16]  Doug Downey,et al.  Local and Global Algorithms for Disambiguation to Wikipedia , 2011, ACL.

[17]  Naohiro Amamoto,et al.  Block segmentation and text area extraction of vertically/horizontally written document , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[18]  Bidyut Baran Chaudhuri,et al.  An improved document skew angle estimation technique , 1996, Pattern Recognit. Lett..

[19]  Yang Xue,et al.  Uniform color spaces based on CIECAM02 and IPT color difference equations , 2008 .

[20]  Dan S. Bloomberg,et al.  Measuring document image skew and orientation , 1995, Electronic Imaging.

[21]  Venu Govindaraju,et al.  Large scale address recognition systems Truthing, testing, tools, and other evaluation issues , 2002, International Journal on Document Analysis and Recognition.

[22]  Robert M. Haralick,et al.  Recursive X-Y cut using bounding boxes of connected components , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[23]  Robert M. Haralick,et al.  Document image understanding: geometric and logical layout , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[24]  Philip N. Klein,et al.  Recognition of shapes by editing their shock graphs , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25]  L. O'Gorman Image and document processing techniques for the RightPages electronic library system , 1992, Proceedings., 11th IAPR International Conference on Pattern Recognition. Vol.II. Conference B: Pattern Recognition Methodology and Systems.

[26]  Cecilia Di Ruberto,et al.  Recognition of shapes by attributed skeletal graphs , 2004, Pattern Recognit..

[27]  Jonathan J. Hull Document Image skew Detection: Survey and Annotated Bibliography , 1996, DAS.

[28]  Bülent Sankur,et al.  Survey over image thresholding techniques and quantitative performance evaluation , 2004, J. Electronic Imaging.

[29]  A. Carbonaro,et al.  A comprehensive approach to image-contrast enhancement , 1999, Proceedings 10th International Conference on Image Analysis and Processing.

[30]  Hrishikesh B. Aradhye A generic method for determining up/down orientation of text in roman and non-roman scripts , 2005, Pattern Recognit..

[31]  Ioannis Pratikakis,et al.  ICDAR 2011 Document Image Binarization Contest (DIBCO 2011) , 2011, 2011 International Conference on Document Analysis and Recognition.

[32]  A. Chalechale,et al.  Edge image description using angular radial partitioning , 2004 .

[33]  Michelangelo Ceci,et al.  Machine learning methods for automatically processing historical documents: from paper acquisition to XML transformation , 2004, First International Workshop on Document Image Analysis for Libraries, 2004. Proceedings..

[34]  Luc Vincent,et al.  Google Book Search: Document Understanding on a Massive Scale , 2007, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007).

[35]  Daniel Cremers,et al.  Fast Matching of Planar Shapes in Sub-cubic Runtime , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[36]  Andrew McCallum,et al.  Collective Segmentation and Labeling of Distant Entities in Information Extraction , 2004 .

[37]  R. Furmaniak Unsupervised Newspaper Segmentation Using Language Context , 2007, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007).

[38]  Christian Bauckhage,et al.  The Good, the Bad, and the Ugly: Predicting Aesthetic Image Labels , 2010, 2010 20th International Conference on Pattern Recognition.

[39]  Friedrich M. Wahl,et al.  Document Analysis System , 1982, IBM J. Res. Dev..

[40]  David S. Doermann,et al.  Stroke-Like Pattern Noise Removal in Binary Document Images , 2011, 2011 International Conference on Document Analysis and Recognition.

[41]  Hamid K. Aghajan,et al.  Estimation of skew angle in text-image analysis bySLIDE: Subspace-based line detection , 2005, Machine Vision and Applications.

[42]  Nikos A. Nikolaou,et al.  Color reduction for complex document images , 2009, Int. J. Imaging Syst. Technol..

[43]  Ioannis Pratikakis,et al.  Adaptive degraded document image binarization , 2006, Pattern Recognit..

[44]  Henry S. Baird,et al.  Towards Versatile Document Analysis Systems , 2006, Document Analysis Systems.

[45]  Mohammed Atiquzzaman,et al.  A Robust Hough Transform Technique for Complete Line Segment Description , 1995, Real Time Imaging.

[46]  Lawrence O'Gorman,et al.  The Document Spectrum for Page Layout Analysis , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[47]  Baozong Yuan,et al.  Isothetic polygon representation for contours , 1992, CVGIP Image Underst..

[48]  R. Hunter Photoelectric Color Difference Meter , 1958 .

[49]  Abdel Belaïd,et al.  Document Logical Structure Analysis Based on Perceptive Cycles , 2006, Document Analysis Systems.

[50]  C. Clausner,et al.  Historical Document Layout Analysis Competition , 2011, 2011 International Conference on Document Analysis and Recognition.

[51]  Apostolos Antonacopoulos,et al.  ICDAR 2009 Page Segmentation Competition , 2003, 2009 10th International Conference on Document Analysis and Recognition.

[52]  Robert Geist,et al.  Re‐coloring Images for Gamuts of Lower Dimension , 2005, Comput. Graph. Forum.

[53]  Andrew D. Bagdanov,et al.  Projection profile based skew estimation algorithm for JBIG compressed images , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[54]  Christopher G. Harris,et al.  A Combined Corner and Edge Detector , 1988, Alvey Vision Conference.

[55]  W. Guitang,et al.  A new method for image segmentation , 2009, 2009 Asia-Pacific Conference on Computational Intelligence and Industrial Applications (PACIIA).

[56]  Haruo Asada,et al.  Major components of a complete text reading system , 1992 .

[57]  Richard Rogers,et al.  UW-ISL document image analysis toolbox: an experimental environment , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[58]  Patrick Kelly,et al.  Quality assessment and restoration of typewritten document images , 1999, International Journal on Document Analysis and Recognition.

[59]  Horng-Jinh Chang,et al.  On Sample Size in Using Central Limit Theorem for Gamma Distribution , 2008 .

[60]  Gerhard Paass,et al.  Machine Learning for Document Structure Recognition , 2012, Modeling, Learning, and Processing of Text Technological Data Structures.

[61]  Sargur N. Srihari Document Image Understanding , 1986, FJCC.

[62]  Nikos Papamarkos,et al.  A technique for fuzzy document binarization , 2001, DocEng '01.

[63]  Andrew D. Bagdanov,et al.  Evaluation of document image skew estimation techniques , 1996, Electronic Imaging.

[64]  Stefano Messelodi,et al.  Geometric Layout Analysis Techniques for Document Image Understanding: a Review , 2008 .

[65]  Dan Liu,et al.  A new approach to document analysis based on modified fractal signature , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[66]  Vinciane Lacroix,et al.  Automatic Palette Identification of Colored Graphics , 2009, GREC.

[67]  Venu Govindaraju,et al.  Character image enhancement by selective region-growing , 1996, Pattern Recognit. Lett..

[68]  Thomas M. Breuel,et al.  High Performance Document Layout Analysis , 2003 .

[69]  Seiichi Uchida,et al.  Dewarping of document image by global optimization , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[70]  J. Besag Statistical Analysis of Non-Lattice Data , 1975 .

[71]  Michael Randolph Garey,et al.  The complexity of the generalized Lloyd - Max problem , 1982, IEEE Trans. Inf. Theory.

[72]  Joan L. Mitchell,et al.  JPEG: Still Image Data Compression Standard , 1992 .

[73]  K. Martin,et al.  Vector filtering for color imaging , 2005, IEEE Signal Processing Magazine.

[74]  Anil K. Jain,et al.  Page segmentation using tecture analysis , 1996, Pattern Recognit..

[75]  Henry S. Baird,et al.  Image segmentation by shape-directed covers , 1990, [1990] Proceedings. 10th International Conference on Pattern Recognition.

[76]  Hiromichi Fujisawa,et al.  Machine Learning in Document Analysis and Recognition , 2008, Studies in Computational Intelligence.

[77]  Bodin Dresevic,et al.  Book Layout Analysis: TOC Structure Extraction Engine , 2008, INEX.

[78]  R. Smith,et al.  An Overview of the Tesseract OCR Engine , 2007, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007).

[79]  Dorin Comaniciu,et al.  Mean Shift: A Robust Approach Toward Feature Space Analysis , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[80]  Andrew McCallum,et al.  An Introduction to Conditional Random Fields for Relational Learning , 2007 .

[81]  Yalin Wang,et al.  Document zone content classification and its performance evaluation , 2006, Pattern Recognit..

[82]  Hong Yan,et al.  Newspaper document analysis featuring connected line segmentation , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[83]  William A. Pearlman,et al.  A new, fast, and efficient image codec based on set partitioning in hierarchical trees , 1996, IEEE Trans. Circuits Syst. Video Technol..

[84]  Paul A. Viola,et al.  Efficient geometric algorithms for parsing in two dimensions , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[85]  Sung-Il Chien,et al.  Improvement of Binarization Method Using a Water Flow Model for Document Images with Complex Backgrounds , 2004, PRICAI.

[86]  Jiri Matas,et al.  Robust wide-baseline stereo from maximally stable extremal regions , 2004, Image Vis. Comput..

[87]  William H. Press,et al.  Numerical recipes in C , 2002 .

[88]  Dov Dori,et al.  From Raster to Vectors: Extracting Visual Information from Line Drawings , 1999, Pattern Analysis & Applications.

[89]  Thomas M. Breuel,et al.  Efficient implementation of local adaptive thresholding techniques using integral images , 2008, Electronic Imaging.

[90]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[91]  Xiaofan Lin Quality assurance in high volume document digitization: a survey , 2006, Second International Conference on Document Image Analysis for Libraries (DIAL'06).

[92]  Rafael Dueire Lins,et al.  A new rotation algorithm for monochromatic images , 2005, DocEng '05.

[93]  Bin Fu,et al.  A Model Based Book Dewarping Method to Handle 2D Images Captured by a Digital Camera , 2007, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007).

[94]  Elisa H. Barney Smith,et al.  An analysis of binarization ground truthing , 2010, DAS '10.

[95]  Horng-Jinh Chang,et al.  Determination of sample size in using central limit theorem for weibull distribution , 2006 .

[96]  Yasubumi Sakakibara,et al.  RNA secondary structural alignment with conditional random fields , 2005, ECCB/JBI.

[97]  Hong Yan,et al.  Skew Correction of Document Images Using Interline Cross-Correlation , 1993, CVGIP Graph. Model. Image Process..

[98]  Seong-Whan Lee,et al.  Reference line extraction from form documents with complicated backgrounds , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[99]  Basilios Gatos,et al.  ICDAR 2003 page segmentation competition , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[100]  Frank Nielsen,et al.  Statistical region merging , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[101]  O. Cuisenaire Distance transformations: fast algorithms and applications to medical image processing , 1999 .

[102]  Martial Hebert,et al.  Man-made structure detection in natural images using a causal multiscale random field , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[103]  Anil K. Jain,et al.  Learning Texture Discrimination Masks , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[104]  Stefan Eickeler,et al.  A new quality assessment and improvement system for print media , 2012, EURASIP J. Adv. Signal Process..

[105]  Sung-Bae Cho,et al.  Geometric Structure Analysis of Document Images: A Knowledge-Based Approach , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[106]  Hsi-Jian Lee,et al.  Efficiently extracting and classifying objects for analyzing color documents , 2009, Machine Vision and Applications.

[107]  Andrew McCallum,et al.  Dynamic conditional random fields: factorized probabilistic models for labeling and segmenting sequence data , 2004, J. Mach. Learn. Res..

[108]  Anna Tonazzini,et al.  Fast correction of bleed-through distortion in grayscale documents by a blind source separation technique , 2007, International Journal of Document Analysis and Recognition (IJDAR).

[109]  Joann M. Taylor,et al.  Digital Color Imaging Handbook , 2004 .

[110]  Mark D. Fairchild,et al.  Meet iCAM: A Next-Generation Color Appearance Model , 2002, Color Imaging Conference.

[111]  Syed Saqib Bukhari,et al.  Dewarping of Document Images using Coupled-Snakes , 2009 .

[112]  Nadia Bali,et al.  Automatic accurate broken character restoration for patrimonial documents , 2006, International Journal of Document Analysis and Recognition (IJDAR).

[113]  Touraj Tajbakhsh,et al.  Semiautomatic color checker detection in distorted images , 2008 .

[114]  Marco Aiello,et al.  Combining linguistic and spatial information for document analysis , 2000, RIAO.

[115]  Abdel Belaïd,et al.  XML Data Representation in Document Image Analysis , 2007, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007).

[116]  Vincent Kanade,et al.  Clustering Algorithms , 2021, Wireless RF Energy Transfer in the Massive IoT Era.

[117]  Henry S. Baird,et al.  Iterated Document Content Classification , 2007, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007).

[118]  Roberto Manduchi,et al.  Bilateral filtering for gray and color images , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[119]  Peter Shirley,et al.  Fundamentals of computer graphics , 2018 .

[120]  Apostolos Antonacopoulos Local skew angle estimation from background space in text regions , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[121]  F. A. Seiler,et al.  Numerical Recipes in C: The Art of Scientific Computing , 1989 .

[122]  Mohamed Cheriet,et al.  A New Approach for Skew Correction of Documents Based on Particle Swarm Optimization , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[123]  Basilios Gatos,et al.  ICDAR2005 page segmentation competition , 2007, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[124]  Panos E. Trahanias,et al.  Directional processing of color images: theory and experimental results , 1996, IEEE Trans. Image Process..

[125]  B. Messmer Efficient graph matching algorithms , 1995 .

[126]  Apostolos Antonacopoulos,et al.  A Realistic Dataset for Performance Evaluation of Document Layout Analysis , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[127]  Apostolos Antonacopoulos,et al.  The PAGE (Page Analysis and Ground-Truth Elements) Format Framework , 2010, 2010 20th International Conference on Pattern Recognition.

[128]  Andrew McCallum,et al.  Information extraction from research papers using conditional random fields , 2006, Inf. Process. Manag..

[129]  Thomas M. Breuel,et al.  Document cleanup using page frame detection , 2008, International Journal of Document Analysis and Recognition (IJDAR).

[130]  Miguel Á. Carreira-Perpiñán,et al.  Multiscale conditional random fields for image labeling , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[131]  Mahdi Nezamabadi,et al.  Color Appearance Models , 2014, J. Electronic Imaging.

[132]  Matti Pietikäinen,et al.  Adaptive document image binarization , 2000, Pattern Recognit..

[133]  Jian Lu,et al.  Signal Recovery and Noise Reduction with Wavelets , 1995 .

[134]  Hung-Ming Sun,et al.  Page segmentation for Manhattan and non-Manhattan layout documents via selective CRLA , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[135]  Henry S. Baird,et al.  Language-free layout analysis , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[136]  Esko Ukkonen,et al.  Algorithms for Approximate String Matching , 1985, Inf. Control..

[137]  J.-H. Lee,et al.  Digital color halftoning , 2005, IEEE Signal Processing Magazine.

[138]  Christian Bauckhage,et al.  The Snippet Statistics of Font Recognition , 2010, 2010 20th International Conference on Pattern Recognition.

[139]  Sandy Irani,et al.  Greedy Algorithm for Local Contrast Enhancement of Images , 2005, ICIAP.

[140]  Karim Hadjar,et al.  Newspaper page decomposition using a split and merge approach , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[141]  Thomas M. Breuel,et al.  An algorithm for finding maximal whitespace rectangles at arbitrary orientations for document layout analysis , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[142]  Venu Govindaraju,et al.  Analysis of textual images using the Hough transform , 1989, Machine Vision and Applications.

[143]  Sargur N. Srihari,et al.  Knowledge-based derivation of document logical structure , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[144]  Donato Malerba,et al.  A knowledge-based approach to the layout analysis , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[145]  A. Lawrence Spitz,et al.  Correcting for variable skew in document images , 2003, Document Analysis and Recognition.

[146]  Matti Pietikäinen,et al.  Skew Angle Detection Using Texture Direction Analysis , 1995 .

[147]  Jingying Chen,et al.  Noisy logo recognition using line segment Hausdorff distance , 2003, Pattern Recognit..

[148]  Thomas M. Breuel,et al.  Document image zone classification - a simple high-performance approach , 2007, VISAPP.

[149]  Cordelia Schmid,et al.  Evaluation of Interest Point Detectors , 2000, International Journal of Computer Vision.

[150]  Ioannis Pratikakis,et al.  Automatic Table Detection in Document Images , 2005, ICAPR.

[151]  Matti Pietikäinen,et al.  Page Segmentation and Zone Classification: The State of the Art , 1999 .

[152]  King-Sun Fu,et al.  Subgraph error-correcting isomorphisms for syntactic pattern recognition , 1983, IEEE Transactions on Systems, Man, and Cybernetics.

[153]  Adnan Amin,et al.  Automatic thresholding of gray-level using multistage approach , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[154]  Otfried Cheong,et al.  Euclidean minimum spanning trees and bichromatic closest pairs , 1990, SCG '90.

[155]  David S. Doermann,et al.  Clutter noise removal in binary document images , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[156]  Song Mao,et al.  Empirical Performance Evaluation Methodology and Its Application to Page Segmentation Algorithms , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[157]  P. Danielsson Euclidean distance mapping , 1980 .

[158]  Azriel Rosenfeld,et al.  Document structure analysis algorithms: a literature survey , 2003, IS&T/SPIE Electronic Imaging.

[159]  S. Chen,et al.  Simultaneous Layout Style and Logical Entity Recognition in a Heterogeneous Collection of Documents , 2007, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007).

[160]  Luc Vincent,et al.  Pink Panther: A Complete Environment For Ground-Truthing And Benchmarking Document Page Segmentation , 1998, Pattern Recognit..

[161]  K JainAnil,et al.  Document Representation and Its Application to Page Decomposition , 1998 .

[162]  Syed Saqib Bukhari,et al.  An Image Based Performance Evaluation Method for Page Dewarping Algorithms Using SIFT Features , 2011, CBDAR.

[163]  M. Orlowski,et al.  A new algorithm for the largest empty rectangle problem , 1990, Algorithmica.

[164]  Friedrich M. Wahl,et al.  Block segmentation and text extraction in mixed text/image documents , 1982, Comput. Graph. Image Process..

[165]  Ruiheng Qiu,et al.  Comprehensive Global Typography Extraction System for Electronic Book Documents , 2008, 2008 The Eighth IAPR International Workshop on Document Analysis Systems.

[166]  Rae-Hong Park,et al.  Document image binarization based on topographic analysis using a water flow model , 2002, Pattern Recognit..

[167]  Gerhard Paass,et al.  Exploiting Semantic Constraints for Estimating Supersenses with CRFs , 2009, SDM.

[168]  Robert M. Haralick,et al.  An automatic algorithm for text skew estimation in document images using recursive morphological transforms , 1994, Proceedings of 1st International Conference on Image Processing.

[169]  Andy M. Yip,et al.  Photometric and Geometric Restoration of Document Images Using Inpainting and Shape-from-Shading , 2007, AAAI.

[170]  B. Kapralos,et al.  I An Introduction to Digital Image Processing , 2022 .

[171]  Tin Kam Ho,et al.  Enhancing degraded document images via bitmap clustering and averaging , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[172]  Bülent Sankur,et al.  Statistical evaluation of image quality measures , 2002, J. Electronic Imaging.

[173]  Anil K. Jain,et al.  Feature extraction methods for character recognition-A survey , 1996, Pattern Recognit..

[174]  George Nagy,et al.  Automated Evaluation of OCR Zoning , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[175]  Matthew Richardson,et al.  Markov logic networks , 2006, Machine Learning.

[176]  Ajai Jain,et al.  The Handbook of Pattern Recognition and Computer Vision , 1993 .

[177]  Apostolos Antonacopoulos,et al.  Methodology for flexible and efficient analysis of the performance of page segmentation algorithms , 1999, Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318).

[178]  Josef Kittler,et al.  Minimum error thresholding , 1986, Pattern Recognit..

[179]  Kristen Maria Summers Near-wordless document structure classification , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[180]  Yue Lu,et al.  Improved nearest neighbor based approach to accurate document skew estimation , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[181]  Yalin Wang,et al.  Zone content classification and its performance evaluation , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[182]  Jean-Michel Jolion,et al.  Text localization, enhancement and binarization in multimedia documents , 2002, Object recognition supported by user interaction for service robots.

[183]  George Nagy,et al.  Performance metrics for document understanding systems , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[184]  Jean-Luc Meunier Automated Quality Assurance for Document Logical Analysis , 2010, 2010 20th International Conference on Pattern Recognition.

[185]  Thomas M. Breuel,et al.  Pixel-Accurate Representation and Evaluation of Page Segmentation in Document Images , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[186]  Raymond W. Smith Hybrid Page Layout Analysis via Tab-Stop Detection , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[187]  Jean-Luc Meunier,et al.  On tables of contents and how to recognize them , 2009, International Journal of Document Analysis and Recognition (IJDAR).

[188]  Reinhard Klette,et al.  Handbook of image processing operators , 1996 .

[189]  Jaime G. Carbonell,et al.  Segmentation Conditional Random Fields (SCRFs): A New Approach for Protein Fold Recognition , 2005, RECOMB.

[190]  Norihiro Hagita,et al.  Automated entry system for printed documents , 1990, Pattern Recognit..

[191]  K. S. Baird,et al.  Anatomy of a versatile page reader , 1992, Proc. IEEE.

[192]  Michael Gervautz,et al.  A simple method for color quantization: octree quantization , 1990 .

[193]  Heng Tao Shen,et al.  Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[194]  Gerhard Rigoll,et al.  Recognition of JPEG compressed face images based on statistical methods , 2000, Image Vis. Comput..

[195]  David S. Doermann,et al.  A model-based line detection algorithm in documents , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[196]  Sargur N. Srihari,et al.  An integrated approach to document decomposition and structural analysis , 1996, Int. J. Imaging Syst. Technol..

[197]  E. R. Davies,et al.  On the noise suppression and image enhancement characteristics of the median, truncated median and mode filters , 1988, Pattern Recognit. Lett..

[198]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[199]  Anil K. Jain,et al.  Document Representation and Its Application to Page Decomposition , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[200]  Franklin C. Crow,et al.  Summed-area tables for texture mapping , 1984, SIGGRAPH.

[201]  Changsong Liu,et al.  Form frame line detection with directional single-connected chain , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[202]  Christopher R. Dance,et al.  Perspective estimation for document images , 2001, IS&T/SPIE Electronic Imaging.

[203]  Edward R. Dougherty,et al.  Enhancement and Restoration of Digital Documents: Statistical Design of Nonlinear Algorithms , 1997 .

[204]  Calvin R. Maurer,et al.  A Linear Time Algorithm for Computing Exact Euclidean Distance Transforms of Binary Images in Arbitrary Dimensions , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[205]  Leonidas J. Guibas,et al.  Primitives for the manipulation of general subdivisions and the computation of Voronoi diagrams , 1983, STOC.

[206]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[207]  Rama Chellappa,et al.  Multiscale Segmentation of Unstructured Document Pages Using Soft Decision Integration , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[208]  Aapo Hyvärinen,et al.  Fast and robust fixed-point algorithms for independent component analysis , 1999, IEEE Trans. Neural Networks.

[209]  Christopher D. Manning,et al.  Incorporating Non-local Information into Information Extraction Systems by Gibbs Sampling , 2005, ACL.

[210]  G. McLachlan,et al.  The EM algorithm and extensions , 1996 .

[211]  Paul S. Heckbert Color image quantization for frame buffer display , 1982, SIGGRAPH.

[212]  Chun-Jen Chen,et al.  A linear-time component-labeling algorithm using contour tracing technique , 2004, Comput. Vis. Image Underst..

[213]  Ioannis Pratikakis,et al.  ICDAR 2009 Document Image Binarization Contest (DIBCO 2009) , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[214]  Ben Taskar,et al.  Discriminative Probabilistic Models for Relational Data , 2002, UAI.

[215]  Apostolos Antonacopoulos,et al.  Flexible Text Recovery from Degraded Typewritten Historical Documents , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[216]  David S. Doermann,et al.  Automatic Document Logo Detection , 2007, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007).

[217]  Jiangying Zhou,et al.  Page segmentation and classification , 1992, CVGIP Graph. Model. Image Process..

[218]  Apostolos Antonacopoulos,et al.  Performance Analysis Framework for Layout Analysis Methods , 2007, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007).

[219]  Rangachar Kasturi,et al.  Generation Of A Line Description File For Graphics Recognition , 1988, Defense, Security, and Sensing.

[220]  Stavros J. Perantonis,et al.  Automatic page analysis for the creation of a digital library from newspaper archives , 2000, International Journal on Digital Libraries.

[221]  Nils J. Nilsson,et al.  A Formal Basis for the Heuristic Determination of Minimum Cost Paths , 1968, IEEE Trans. Syst. Sci. Cybern..

[222]  Matti Pietikäinen,et al.  Adaptive document binarization , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[223]  Francesca Cesarini,et al.  Encoding of modified X-Y trees for document classification , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[224]  Motoi Iwata,et al.  Segmentation of Page Images Using the Area Voronoi Diagram , 1998, Comput. Vis. Image Underst..

[225]  Anil K. Jain,et al.  Goal-Directed Evaluation of Binarization Methods , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[226]  Po-Rong Chang,et al.  Color correction for scanner and printer using B-spline CMAC neural networks , 1994, Proceedings of APCCAS'94 - 1994 Asia Pacific Conference on Circuits and Systems.

[227]  George Nagy,et al.  HIERARCHICAL REPRESENTATION OF OPTICALLY SCANNED DOCUMENTS , 1984 .

[228]  Wencheng Wu,et al.  The CIEDE2000 color-difference formula: Implementation notes, supplementary test data, and mathematical observations , 2005 .

[229]  Andrzej Cichocki,et al.  Adaptive blind signal and image processing , 2002 .

[230]  Apostolos Antonacopoulos,et al.  Scenario Driven In-depth Performance Evaluation of Document Layout Analysis Methods , 2011, 2011 International Conference on Document Analysis and Recognition.

[231]  H. Fischer A History of the Central Limit Theorem: From Classical to Modern Probability Theory , 2010 .

[232]  Henry S. Baird,et al.  The skew angle of printed documents , 1995 .

[233]  Per-Erik Forssén,et al.  Maximally Stable Colour Regions for Recognition and Matching , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[234]  Boris Chidlovskii,et al.  Stacked dependency networks for layout document structuring , 2008, SAC '08.

[235]  Wenyin Liu,et al.  An online composite graphics recognition approach based on matching of spatial relation graphs , 2004, Document Analysis and Recognition.

[236]  Apostolos Antonacopoulos,et al.  Ground Truth for Layout Analysis Performance Evaluation , 2006, Document Analysis Systems.

[237]  Venu Govindaraju,et al.  Document image analysis: A primer , 2002 .

[238]  Thomas M. Breuel,et al.  Example-Based Logical Labeling of Document Title Page Images , 2007, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007).

[239]  Seungjin Choi,et al.  Independent Component Analysis , 2009, Handbook of Natural Computing.

[240]  Sahin Albayrak,et al.  Automated Ground Truth Data Generation for Newspaper Document Images , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[241]  Wen Gao,et al.  Thresholding technique with adaptive window selection for uneven lighting image , 2005, Pattern Recognit. Lett..

[242]  Patrick Gallinari,et al.  Relaxation Labeling for Selecting and Exploiting Efficiently Non-local Dependencies in Sequence Labeling , 2007, PKDD.

[243]  Vladimir I. Levenshtein,et al.  Binary codes capable of correcting deletions, insertions, and reversals , 1965 .

[244]  Neil A. Dodgson,et al.  Decolorize: Fast, contrast enhancing, color to grayscale conversion , 2007, Pattern Recognit..

[245]  Anna Tonazzini,et al.  Multichannel Blind Separation and Deconvolution of Images for Document Analysis , 2010, IEEE Transactions on Image Processing.

[246]  Anil K. Jain,et al.  A robust and fast skew detection algorithm for generic documents , 1996, Pattern Recognit..

[247]  Sergei Vassilvitskii,et al.  k-means++: the advantages of careful seeding , 2007, SODA '07.

[248]  Anand Rangarajan,et al.  Graph matching by graduated assignment , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[249]  Hanan Samet,et al.  A general approach to connected-component labeling for arbitrary image representations , 1992, JACM.

[250]  Anna Tonazzini,et al.  Independent component analysis for document restoration , 2004, Document Analysis and Recognition.

[251]  C. Chow,et al.  Automatic boundary detection of the left ventricle from cineangiograms. , 1972, Computers and biomedical research, an international journal.

[252]  Rafael Dueire Lins,et al.  A fast orientation and skew detection algorithm for monochromatic document images , 2005, DocEng '05.

[253]  Rui Xu,et al.  Survey of clustering algorithms , 2005, IEEE Transactions on Neural Networks.

[254]  J. A. López del Val,et al.  Principal Components Analysis , 2018, Applied Univariate, Bivariate, and Multivariate Statistics Using Python.

[255]  Kazuhito Murakami,et al.  High speed line detection by Hough transform in local area , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[256]  Christoph Seibert,et al.  Fast Seamless Skew and Orientation Detection in Document Images , 2010, 2010 20th International Conference on Pattern Recognition.

[257]  Jian Fan,et al.  Robust Color Image Enhancement of Digitized Books , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[258]  Balas K. Natarajan,et al.  Sparse Approximate Solutions to Linear Systems , 1995, SIAM J. Comput..

[259]  Hanning Zhou,et al.  Page frame segmentation for contextual advertising in print on demand books , 2009, 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.