论文信息 - Recognizing Scene Categories of Historical Postcards

Recognizing Scene Categories of Historical Postcards

The recognition of visual scene categories is a challenging issue in computer vision. It has many applications like organizing and tagging private or public photo collections. While most approaches are focused on web image collections, some of the largest unorganized image collections are historical images from archives and museums. In this paper the problem of recognizing categories in historical images is considered. More specifically, a new dataset is presented that addresses the analysis of a challenging collection of postcards from the period of World War I delivered by the German military postal service. The categorization of these postcards is of greater interest for historians in order to gain insights about the society during these years. For computer vision research the postcards pose various new challenges such as high degradations, varying visual domains like sketches, photographs or colorization and incorrect orientations due to an image in the image problem. The incorrect orientation is addressed by a pre-processing step that classifies the images into portrait or landscapes. In order to cope with the different visual domains an ensemble that incorporates global feature representations and features that are derived from detection results is used. The experiments on a development set and a large unexplored test set show that the proposed methods allow for improving the recognition on the historical postcards compared to a Bag-of-Features based scene categorization.

Gernot A. Fink | Rene Grzeszick | G. Fink | René Grzeszick

[1] Steve McLaughlin,et al. Comparative study of textural analysis techniques to characterise tissue from intravascular ultrasound , 1996, Proceedings of 3rd IEEE International Conference on Image Processing.

[2] Nikolaos G. Bourbakis,et al. A survey of skin-color modeling and detection methods , 2007, Pattern Recognit..

[3] Paul A. Viola,et al. Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[4] Matti Pietikäinen,et al. Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[5] Cordelia Schmid,et al. Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[6] Hyeonjoon Moon,et al. The FERET Evaluation Methodology for Face-Recognition Algorithms , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[7] Cordelia Schmid,et al. Evaluation of GIST descriptors for web-scale image search , 2009, CIVR '09.

[8] Thomas S. Huang,et al. Image processing , 1971 .

[9] Krista A. Ehinger,et al. SUN database: Large-scale scene recognition from abbey to zoo , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[10] Antonio Torralba,et al. Building the gist of a scene: the role of global image features in recognition. , 2006, Progress in brain research.

[11] Marwan Mattar,et al. Labeled Faces in the Wild: A Database forStudying Face Recognition in Unconstrained Environments , 2008 .

[12] K. T. Talele,et al. Face detection and geometric face normalization , 2009, TENCON 2009 - 2009 IEEE Region 10 Conference.

[13] Trevor Darrell,et al. Efficient Learning of Domain-invariant Image Representations , 2013, ICLR.

[14] Hyeonjoon Moon,et al. The FERET evaluation methodology for face-recognition algorithms , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[15] Alexei A. Efros,et al. An empirical study of context in object detection , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[16] Antonio Torralba,et al. Ieee Transactions on Pattern Analysis and Machine Intelligence 1 80 Million Tiny Images: a Large Dataset for Non-parametric Object and Scene Recognition , 2022 .

[17] Alexei A. Efros,et al. Data-driven visual similarity for cross-domain image matching , 2011, ACM Trans. Graph..

[18] Claudia Siebrecht. Die bunte Welt des Krieges. Bildpostkarten aus dem Ersten Weltkrieg , 2010 .

[19] Andrew Zisserman,et al. Multiple kernels for object detection , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[20] Gernot A. Fink,et al. Bag-of-features representations using spatial visual vocabularies for object classification , 2013, 2013 IEEE International Conference on Image Processing.

[21] N. H. C. Yung,et al. Improve scene categorization via sub-scene recognition , 2014, Machine Vision and Applications.

[22] Matti Pietikäinen,et al. Gray Scale and Rotation Invariant Texture Classification with Local Binary Patterns , 2000, ECCV.

[23] Antonio Criminisi,et al. Harvesting Image Databases from the Web , 2007, 2007 IEEE 11th International Conference on Computer Vision.