Thick 2D relations for document understanding

We use a propositional language of qualitative rectangle relations to detect the reading order from document images. To this end, we define the notion of a document encoding rule and we analyze possible formalisms to express document encoding rules such as LaTeX and SGML. Document encoding rules expressed in the propositional language of rectangles are used to build a reading order detector for document images. In order to achieve robusmess and avoid brittleness when applying the system to real life document images, the notion of a thick boundary interpretation for a qualitative relation is introduced. The framework is tested on a collection of heterogeneous document images showing recall rates up to 89%.

[1]  Donald E. Knuth,et al.  The T E Xbook , 1987 .

[2]  Michel Goossens,et al.  The LaTeX companion , 1993 .

[3]  Chris Buckley,et al.  OHSUMED: an interactive retrieval evaluation and new large test collection for research , 1994, SIGIR '94.

[4]  Yves Robert,et al.  Symmetric matrix-vector product on a ring of processors , 1990 .

[5]  Seinosuke Toda On the Complexity of Topological Sorting , 1990, Inf. Process. Lett..

[6]  Marcel Worring,et al.  Logical structure detection for heterogeneous document classes , 2000, IS&T/SPIE Electronic Imaging.

[7]  B. C. Brookes,et al.  Information Sciences , 2020, Cognitive Skills You Need for the 21st Century.

[8]  Donald E. Knuth,et al.  The Art of Computer Programming, Volumes 1-3 Boxed Set , 1998 .

[9]  Ruari McLean,et al.  The Thames and Hudson Manual of Typography , 1980 .

[10]  Francesca Cesarini,et al.  INFORMys: A Flexible Invoice-Like Form-Reader System , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Jayme Luiz Szwarcfiter,et al.  A Structured Program to Generate all Topological Sorting Arrangements , 1974, Information Processing Letters.

[12]  Anthony G. Cohn,et al.  A Spatial Logic based on Regions and Connection , 1992, KR.

[13]  James F. Allen Maintaining knowledge about temporal intervals , 1983, CACM.

[14]  Philippe Balbiani Jean-Fran,et al.  A Model for Reasoning about Bidimensional Temporal Relations , 1998 .

[15]  Donato Malerba,et al.  Machine Learning for Intelligent Processing of Printed Documents , 2000, Journal of Intelligent Information Systems.

[16]  Wei-Ying Ma,et al.  Block-based web search , 2004, SIGIR '04.

[17]  Sargur N. Srihari,et al.  Using domain knowledge to derive the logical structure of documents , 1996, Electronic Imaging.

[18]  David Thomas,et al.  The Art in Computer Programming , 2001 .

[19]  J. Ian Munro,et al.  Efficient Determination of the Transitive Closure of a Directed Graph , 1971, Inf. Process. Lett..

[20]  Marco Aiello,et al.  Document understanding for a broad class of documents , 2002, Int. J. Document Anal. Recognit..

[21]  Luis Fariñas del Cerro,et al.  A Model for Reasoning about Bidemsional Temporal Relations , 1998, KR.

[22]  Marcel Worring,et al.  Content based internet access to paper documents , 1999, International Journal on Document Analysis and Recognition.

[23]  Hanno Walischewski,et al.  Automatic knowledge acquisition for spatial document interpretation , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[24]  Donato Malerba,et al.  Transforming paper documents into XML format with WISDOM++ , 2001, International Journal on Document Analysis and Recognition.

[25]  Thomas Kieninger,et al.  Document Structure Analysis Based on Layout and Textual Features , 2000 .

[26]  Marco Aiello,et al.  Combining linguistic and spatial information for document analysis , 2000, RIAO.

[27]  Francesca Cesarini,et al.  A two level knowledge approach for understanding documents of a multi-class domain , 1999, Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318).

[28]  Stephen Warshall,et al.  A Theorem on Boolean Matrices , 1962, JACM.

[29]  Thomas Kieninger,et al.  Rule-based document structure understanding with a fuzzy combination of layout and textual features , 2001, International Journal on Document Analysis and Recognition.

[30]  Xuhong Li,et al.  A document classification and extraction system with learning ability , 1999, Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318).

[31]  Donald E. Knuth,et al.  The TeXbook , 1984 .

[32]  Sung-Bae Cho,et al.  Geometric Structure Analysis of Document Images: A Knowledge-Based Approach , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[33]  Haruo Asada,et al.  Major components of a complete text reading system , 1992 .

[34]  George Nagy,et al.  Twenty Years of Document Image Analysis in PAMI , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[35]  Johan van Benthem,et al.  The Logic of Time , 1983 .