论文信息 - Find you wherever you are: geographic location and environment context-based pedestrian detection

Find you wherever you are: geographic location and environment context-based pedestrian detection

Most existing approaches to pedestrian detection only use the visual appearances as the main source in real world images. However, the visual information cannot always provide reliable guidance since pedestrians often change pose or wear different clothes under different conditions. In this work, by leveraging a vast amount of Web images, we first construct a contextual image database, in which each image is automatically attached with geographic location (i.e., latitude and longitude) and environment information (i.e., season, time and weather condition), assisted by image metadata and a few pre-trained classifiers. For the further pedestrian detection, an annotation scheme is presented which can sharply decrease manual labeling efforts. Several properties of the contextual image database are studied including whether the database is authentic and helpful for pedestrian detection. Moreover, we propose a context-based pedestrian detection approach by jointly exploring visual and contextual cues in a probabilistic model. Encouraging results are reported on our contextual image database.

[1] Mor Naaman,et al. From Where to What: Metadata Sharing for Digital Photographs with Geographic Coordinates , 2003, OTM.

[2] Dong Liu,et al. Tag ranking , 2009, WWW '09.

[3] A. Torralba,et al. The role of context in object recognition , 2007, Trends in Cognitive Sciences.

[4] John R. Smith,et al. Large-scale concept ontology for multimedia , 2006, IEEE MultiMedia.

[5] Tao Mei,et al. CrowdReranking: exploring multiple search engines for visual search reranking , 2009, SIGIR.

[6] Alexei A. Efros,et al. What Does the Sky Tell Us about the Camera? , 2008, ECCV.

[7] G LoweDavid,et al. Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[8] Martial Hebert,et al. A hierarchical field framework for unified context-based classification , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[9] Alexei A. Efros,et al. An empirical study of context in object detection , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[10] David Gerónimo Gómez,et al. Survey of Pedestrian Detection for Advanced Driver Assistance Systems , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11] Mor Naaman,et al. How flickr helps us make sense of the world: context and content in community-contributed media collections , 2007, ACM Multimedia.

[12] Serge J. Belongie,et al. Context based object categorization: A critical survey , 2010, Comput. Vis. Image Underst..

[13] Alexei A. Efros,et al. IM2GPS: estimating geographic information from a single image , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[14] Andrew Zisserman,et al. Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[15] Jitendra Malik,et al. Representing and Recognizing the Visual Appearance of Materials using Three-dimensional Textons , 2001, International Journal of Computer Vision.

[16] Dariu Gavrila,et al. An Experimental Study on Pedestrian Classification , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17] Meng Wang,et al. Typicality-Based Visual Search Reranking , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[18] Chong-Wah Ngo,et al. Context-based friend suggestion in online photo-sharing community , 2011, ACM Multimedia.

[19] Bill Triggs,et al. Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[20] Antonio Torralba,et al. Object Recognition by Scene Alignment , 2007, NIPS.

[21] Chong-Wah Ngo,et al. Co-reranking by mutual reinforcement for image search , 2010, CIVR '10.

[22] Jiebo Luo,et al. Leveraging probabilistic season and location context models for scene understanding , 2008, CIVR '08.

[23] B. Schiele,et al. Multi-cue onboard pedestrian detection , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.