Bidimensional relations for reading order detection

We use a propositional language of qualitative rectangle relations to detect the reading order from document images. Document encoding rules are introduced and, expressed in the propositional language of rectangles, are used to build a reading order detector for document images. Results of testing the framework on a collection of heterogeneous document images are reviewed.

[1]  Ruari McLean,et al.  The Thames and Hudson Manual of Typography , 1980 .

[2]  Thomas Kieninger,et al.  Document Structure Analysis Based on Layout and Textual Features , 2000 .

[3]  Hanno Walischewski,et al.  Automatic knowledge acquisition for spatial document interpretation , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[4]  Donato Malerba,et al.  Transforming paper documents into XML format with WISDOM++ , 2001, International Journal on Document Analysis and Recognition.

[5]  Francesca Cesarini,et al.  A two level knowledge approach for understanding documents of a multi-class domain , 1999, Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318).

[6]  Thomas Kieninger,et al.  Rule-based document structure understanding with a fuzzy combination of layout and textual features , 2001, International Journal on Document Analysis and Recognition.

[7]  Marco Aiello,et al.  Thick 2D relations for document understanding , 2004, Inf. Sci..

[8]  Marco Aiello,et al.  Document understanding for a broad class of documents , 2002, Int. J. Document Anal. Recognit..

[9]  Donato Malerba,et al.  Machine Learning for Intelligent Processing of Printed Documents , 2000, Journal of Intelligent Information Systems.

[10]  Sargur N. Srihari,et al.  Using domain knowledge to derive the logical structure of documents , 1996, Electronic Imaging.

[11]  Haruo Asada,et al.  Major components of a complete text reading system , 1992 .

[12]  James F. Allen Maintaining knowledge about temporal intervals , 1983, CACM.