Utilisation de la couleur pour l'extraction de tableaux dans des images de documents

Tables are complex elements that can disturb the automatic analysis of the structure of an image of a document. In this article, we present a method based on the alternation of the color of lines to extract color tables that are not materialized by physical rulings. Experimental results, obtained on a dataset of document images with various layouts, enable to validate the interest of this approach. MOTS-CLES : Analyse d'images de documents, extraction de tableaux, detection de couleurs dominantes, segmentation d'images, croissance de regions.

[1]  Thomas Kieninger,et al.  Applying the T-Recs table recognition system to the business letter domain , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[2]  Alexandra Branzan Albu,et al.  Texture sparseness for pixel classification of business document images , 2014, International Journal on Document Analysis and Recognition (IJDAR).

[3]  Jean-Yves Ramel,et al.  Detection, extraction and representation of tables , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[4]  Ioannis Pratikakis,et al.  Automatic Table Detection in Document Images , 2005, ICAPR.

[5]  Faisal Shafait,et al.  Table detection in heterogeneous documents , 2010, DAS '10.

[6]  Yalin Wang,et al.  Document zone content classification and its performance evaluation , 2006, Pattern Recognit..

[7]  Sekhar Mandal,et al.  A simple and effective table detection system from document images , 2006, International Journal of Document Analysis and Recognition (IJDAR).

[8]  Francesca Cesarini,et al.  Trainable Table Location in Document Images , 2002, ICPR.