Picture extraction from digitized historical manuscripts

In this work we propose a system for automatic document segmentation to extract graphical elements from historical manuscripts and then to identify significant pictures from them, removing floral and abstract decorations. The system performs a block based analysis by means of color and texture features. The Gradient Spatial Dependency Matrix, a new texture operator particularly effective for this task, is proposed. The feature vectors are processed by an embedding procedure which allows increased performance in later SVM classification. Results for both feature extraction and embedding based classification are reported, supporting the effectiveness of the proposal.

[1]  Thierry Paquet,et al.  Document Image Segmentation Using a 2D Conditional Random Field Model , 2007, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007).

[2]  Rita Cucchiara,et al.  Enhancing HSV histograms with achromatic points detection for video retrieval , 2007, CIVR '07.

[3]  Sergios Theodoridis,et al.  Keyword-guided word spotting in historical printed documents using synthetic data and user feedback , 2007, International Journal of Document Analysis and Recognition (IJDAR).

[4]  Paolo Frasconi,et al.  Hidden Tree Markov Models for Document Image Classification , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Jean-Yves Ramel,et al.  Document image characterization using a multiresolution analysis of the texture: application to old documents , 2008, International Journal of Document Analysis and Recognition (IJDAR).

[6]  Christos Faloutsos,et al.  FastMap: a fast algorithm for indexing, data-mining and visualization of traditional and multimedia datasets , 1995, SIGMOD '95.

[7]  Rita Cucchiara,et al.  Describing texture directions with Von Mises distributions , 2008, 2008 19th International Conference on Pattern Recognition.

[8]  Véronique Eglin,et al.  Document images analysis solutions for digital libraries , 2004, First International Workshop on Document Image Analysis for Libraries, 2004. Proceedings..

[9]  Ergina Kavallieratou A binarization algorithm specialized on document images and photos , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[10]  Nanning Zheng,et al.  Document Images Retrieval Based on Multiple Features Combination , 2007 .

[11]  Jean-Yves Ramel,et al.  AGORA: the interactive document image analysis tool of the BVH project , 2006, Second International Conference on Document Image Analysis for Libraries (DIAL'06).

[12]  Simone Calderara,et al.  "Inside the bible": segmentation, annotation and retrieval for a new browsing experience , 2008, MIR '08.

[13]  Hanan Samet,et al.  Properties of Embedding Methods for Similarity Searching in Metric Spaces , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  H. Gabriela,et al.  Cluster-preserving Embedding of Proteins , 1999 .

[15]  Kaizhong Zhang,et al.  An Index Structure for Data Mining and Clustering , 2000, Knowledge and Information Systems.

[16]  Frank Lebourgeois,et al.  DEBORA: Digital AccEss to BOoks of the RenAissance , 2006, International Journal of Document Analysis and Recognition (IJDAR).

[17]  Dorothea Blostein,et al.  A survey of document image classification: problem statement, classifier architecture and performance evaluation , 2007, International Journal of Document Analysis and Recognition (IJDAR).

[18]  Jianying Hu,et al.  Document classification using layout analysis , 1999, Proceedings. Tenth International Workshop on Database and Expert Systems Applications. DEXA 99.

[19]  Robert M. Haralick,et al.  Textural Features for Image Classification , 1973, IEEE Trans. Syst. Man Cybern..

[20]  Thorsten Joachims,et al.  Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.

[21]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .

[22]  J. Ogier,et al.  Madonne: Document Image Analysis Techniques for Cultural Heritage Documents , 2006 .

[23]  George Nagy,et al.  Twenty Years of Document Image Analysis in PAMI , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[24]  J. Bourgain On lipschitz embedding of finite metric spaces in Hilbert space , 1985 .

[25]  Ching Y. Suen,et al.  Content analysis in document images: a scale space approach , 2002, Object recognition supported by user interaction for service robots.

[26]  Kinji Ono,et al.  Digital bleaching and content extraction for the digital archive of rare books , 2006, Second International Conference on Document Image Analysis for Libraries (DIAL'06).