The Delaunay Document Layout Descriptor

Security applications related to document authentication require an exact match between an authentic copy and the original of a document. This implies that the documents analysis algorithms that are used to compare two documents (original and copy) should provide the same output. This kind of algorithm includes the computation of layout descriptors from the segmentation result, as the layout of a document is a part of its semantic content. To this end, this paper presents a new layout descriptor that significantly improves the state of the art. The basic of this descriptor is the use of a Delaunay triangulation of the centroids of the document regions. This triangulation is seen as a graph and the adjacency matrix of the graph forms the descriptor. While most layout descriptors have a stability of 0% with regard to an exact match, our descriptor has a stability of 74% which can be brought up to 100% with the use of an appropriate matching algorithm. It also achieves 100% accuracy and retrieval in a document retrieval scheme on a database of 960 document images. Furthermore, this descriptor is extremely efficient as it performs a search in constant time with respect to the size of the document database and it reduces the size of the index of the database by a factor 400.

[1]  Quynh H. Dang,et al.  Secure Hash Standard | NIST , 2015 .

[2]  Ernest Valveny,et al.  A Rotation Invariant Page Layout Descriptor for Document Classification and Retrieval , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[3]  Ronald L. Rivest,et al.  The MD5 Message-Digest Algorithm , 1992, RFC.

[4]  Bertrand Coüasnon DMOS, a generic document recognition method: application to table structure analysis in a general and in a specific way , 2005, International Journal of Document Analysis and Recognition (IJDAR).

[5]  Masakazu Iwamura,et al.  Use of Affine Invariants in Locally Likely Arrangement Hashing for Camera-Based Document Image Retrieval , 2006, Document Analysis Systems.

[6]  Donato Malerba,et al.  Multistrategy Learning for Document Recognition , 1994, Appl. Artif. Intell..

[7]  Geoff Leach,et al.  Improving Worst-Case Optimal Delaunay Triangulation Algorithms , 1992 .

[8]  Bidyut B. Chaudhuri Digital Document Processing , 2007 .

[9]  Richard Zanibbi,et al.  A shape-based layout descriptor for classifying spatial relationships in handwritten math , 2013, ACM Symposium on Document Engineering.

[10]  Akio Yamada,et al.  The MPEG-7 color layout descriptor: a compact image feature description for high-speed image/video segment retrieval , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[11]  B. S. Manjunath,et al.  Unsupervised Segmentation of Color-Texture Regions in Images and Video , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  David S. Doermann,et al.  Page classification through logical labelling , 2002, Object recognition supported by user interaction for service robots.

[13]  B. B. Chaudhuri Digital document processing : major directions and recent advances , 2006 .

[14]  Kai Chen,et al.  Hybrid Page Segmentation with Efficient Whitespace Rectangles Extraction and Grouping , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[15]  Francesca Cesarini,et al.  Encoding of modified X-Y trees for document classification , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[16]  Motoi Iwata,et al.  Segmentation of Page Images Using the Area Voronoi Diagram , 1998, Comput. Vis. Image Underst..

[17]  Marcel Worring,et al.  First order Gaussian graphs for efficient structure classification , 2003, Pattern Recognit..

[18]  Apostolos Antonacopoulos,et al.  A Realistic Dataset for Performance Evaluation of Document Layout Analysis , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[19]  Robert M. Haralick,et al.  Global and local document degradation models , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).