Automatic Document Layout Analysis through Relational Machine Learning

The current spread of digital documents raised the need of effective content-based retrieval techniques. Since manual indexing is infeasible and subjective, automatic techniques are the obvious solution. In particular, the ability of properly identifying and understanding a document’s structure is crucial, in order to focus on the most significant components only. At a geometrical level, this task is known as Layout Analysis, and thoroughly studied in the literature. On suitable descriptions of the document layout, Machine Learning techniques can be applied to automatically infer models of classes of documents and of their components. Indeed, organizing the documents on the grounds of the knowledge they contain is fundamental for being able to correctly access them according to the user’s needs.

[1]  Daniel P. Lopresti,et al.  Document Analysis Systems V , 2002, Lecture Notes in Computer Science.

[2]  Luc De Raedt,et al.  Inductive Logic Programming: Theory and Methods , 1994, J. Log. Program..

[3]  Sharad C. Seth,et al.  A trainable, single-pass algorithm for column segmentation , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[4]  Sargur N. Srihari,et al.  Classification of newspaper image blocks using texture analysis , 1989, Comput. Vis. Graph. Image Process..

[5]  Kevin Laven,et al.  A statistical learning approach to document image analysis , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[6]  Yupin Luo,et al.  A new component based algorithm for newspaper layout analysis , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[7]  Jiangying Zhou,et al.  Page segmentation and classification , 1992, CVGIP Graph. Model. Image Process..

[8]  Donato Malerba,et al.  A Logic Framework for the Incremental Inductive Synthesis of Datalog Theories , 1997, LOPSTR.

[9]  Max J. Egenhofer,et al.  Advances in Spatial Databases , 1997, Lecture Notes in Computer Science.

[10]  Thomas M. Breuel,et al.  Two Geometric Algorithms for Layout Analysis , 2002, Document Analysis Systems.

[11]  Max J. Egenhofer,et al.  Reasoning about Binary Topological Relations , 1991, SSD.

[12]  Ming Chen,et al.  Analysis, understanding and representation of Chinese newspaper with complex layout , 2000, Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101).

[13]  Seong-Whan Lee,et al.  Parameter-Free Geometric Document Layout Analysis , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Stefano Ferilli,et al.  Machine Learning for Digital Document Processing: from Layout Analysis to Metadata Extraction , 2008, Machine Learning in Document Analysis and Recognition.

[15]  Michelangelo Ceci,et al.  Correcting the document layout: a machine learning approach , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[16]  Lawrence O'Gorman,et al.  The Document Spectrum for Page Layout Analysis , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[17]  A. Peter Johnson,et al.  A Fast Algorithm for Bottom-Up Document Layout Analysis , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[18]  Mahesh Viswanathan,et al.  A prototype document image analysis system for technical journals , 1992, Computer.

[19]  C. Viard-Gaudin,et al.  A background based adaptive page segmentation algorithm , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[20]  Masayuki Okamoto,et al.  A hybrid page segmentation method , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[21]  Dan Liu,et al.  A new approach to document analysis based on modified fractal signature , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[22]  Ryszard S. Michalski Inferential Theory of Learning and Inductive Databases , 2003 .

[23]  Alan Bundy,et al.  Logic Program Synthesis via Proof Planning , 1992, LOPSTR.

[24]  Rama Chellappa,et al.  Multiscale Segmentation of Unstructured Document Pages Using Soft Decision Integration , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[25]  Anil K. Jain,et al.  Page segmentation using tecture analysis , 1996, Pattern Recognit..

[26]  Koichi Kise,et al.  Page segmentation based on thinning of background , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[27]  Friedrich M. Wahl,et al.  Document Analysis System , 1982, IBM J. Res. Dev..

[28]  Ching Y. Suen,et al.  Chinese document layout analysis based on adaptive split-and-merge and qualitative spatial reasoning , 1997, Pattern Recognit..

[29]  Shin-Ywan Wang,et al.  Block selection: a method for segmenting a page image of various editing styles , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[30]  Ryszard S. Michalski,et al.  Inferential Theory of Learning: Developing Foundations for Multistrategy Learning , 1992 .

[31]  Jianming Hu,et al.  Page segmentation of Chinese newspapers , 2002, Pattern Recognit..

[32]  Dimitris Papadias,et al.  Spatial Relations, Minimum Bounding Rectangles, and Spatial Data Structures , 1997, Int. J. Geogr. Inf. Sci..

[33]  George Nagy,et al.  HIERARCHICAL REPRESENTATION OF OPTICALLY SCANNED DOCUMENTS , 1984 .

[34]  Chien-Hsing Chou,et al.  A machine-learning approach for analyzing document layout structures with two reading orders , 2008, Pattern Recognit..

[35]  Hiromichi Fujisawa,et al.  Machine Learning in Document Analysis and Recognition , 2008, Studies in Computational Intelligence.

[36]  Anil K. Jain,et al.  Text segmentation using gabor filters for automatic document processing , 1992, Machine Vision and Applications.

[37]  Matti Pietikäinen,et al.  Page segmentation and classification using fast feature extraction and connectivity analysis , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[38]  Mahesh Viswanathan,et al.  Syntactic Segmentation and Labeling of Digitized Pages from Technical Journals , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[39]  Andreas Dengel,et al.  Computer understanding of document structure , 1996 .

[40]  Henry S. Baird,et al.  Language-free layout analysis , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[41]  Nicola Fanizzi,et al.  Incremental multistrategy learning for document processing , 2003, Appl. Artif. Intell..

[42]  Frank Y. Shih,et al.  Adaptive document block segmentation and classification , 1996, IEEE Trans. Syst. Man Cybern. Part B.

[43]  Jean-Daniel Zucker,et al.  Semantic Abstraction for Concept Representation and Learning , 2001 .

[44]  Thomas G. Dietterich,et al.  Solving the Multiple Instance Problem with Axis-Parallel Rectangles , 1997, Artif. Intell..