论文信息 - CNRS TELECOM ParisTech at ImageCLEF 2015 Scalable Concept Image Annotation Task: Concept Detection with Blind Localization Proposals

CNRS TELECOM ParisTech at ImageCLEF 2015 Scalable Concept Image Annotation Task: Concept Detection with Blind Localization Proposals

We introduce our participation at the ImageCLEF 2015 scal- able concept detection and localization task. This edition focuses on generating not only annotations (concept detections) but also localiz- ing concepts into a large image collection. Concept detection part of our runs is based on standard nonlinear sup- port vector machines (SVMs). The localization part is blind and based on a priori learned statistics that generate multiple localization propos- als. In spite of its blindness, the performance of this concept localization framework is promising.

Hichem Sahbi

[1] Miguel Á. Carreira-Perpiñán,et al. Multiscale conditional random fields for image labeling , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[2] Pietro Perona,et al. Microsoft COCO: Common Objects in Context , 2014, ECCV.

[3] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[4] R. Manmatha,et al. A Model for Learning the Semantics of Pictures , 2003, NIPS.

[5] Esa Rahtu,et al. Generating Object Segmentation Proposals Using Global and Local Search , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[6] José Francisco Aldana Montes,et al. General Overview of ImageCLEF at the CLEF 2015 Labs , 2015, CLEF.

[7] Fei-Fei Li,et al. ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[8] Emmanuel Dellandréa,et al. Overview of the ImageCLEF 2015 Scalable Image Annotation, Localization and Sentence Generation task , 2015, CLEF.

[9] Alexei A. Efros,et al. Ensemble of exemplar-SVMs for object detection and beyond , 2011, 2011 International Conference on Computer Vision.

[10] Hichem Sahbi,et al. Context-Based Support Vector Machines for Interconnected Image Annotation , 2010, ACCV.

[11] Juergen Gall,et al. Class-specific Hough forests for object detection , 2009, CVPR.

[12] Trevor Darrell,et al. Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[13] Stan Z. Li,et al. Markov Random Field Modeling in Computer Vision , 1995, Computer Science Workbench.

[14] James Ze Wang,et al. Real-Time Computerized Annotation of Pictures , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15] Pushmeet Kohli,et al. On Detection of Multiple Object Instances Using Hough Transforms , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16] Sanja Fidler,et al. Describing the scene as a whole: Joint object detection, scene classification and semantic segmentation , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[17] Xuming He,et al. An Exemplar-Based CRF for Multi-instance Object Segmentation , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[18] Z WangJames,et al. Real-Time Computerized Annotation of Pictures , 2008 .

[19] Paul A. Viola,et al. Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[20] Hichem Sahbi,et al. Superpixel-based object class segmentation using conditional random fields , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[21] Luc Van Gool,et al. The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.