论文信息 - A scalable service for photo annotation, sharing, and search

A scalable service for photo annotation, sharing, and search

In this work we present the details of the implementation of Fotofiti(FF), a website that provides automatic semantic annotation of digital photographs, event management and social network integration. We describe our technique for real-time online semantic annotation using global features from both content and context. Classification experiments using various learning techniques were performed on a realworld data-set. Additionally, a scalable landmark recognition system which utilizes local features is discussed.

[1] Nikos Karampatziakis,et al. Probabilistic Outputs for SVMs and Comparisons to Regularized Likelihood Methods , 2007 .

[2] E. Y. Chang,et al. Toward perception-based image retrieval , 2000, 2000 Proceedings Workshop on Content-based Access of Image and Video Libraries.

[3] Piotr Indyk,et al. Similarity Search in High Dimensions via Hashing , 1999, VLDB.

[4] Qiang Yang,et al. A unified framework for semantics and feature based relevance feedback in image retrieval systems , 2000, ACM Multimedia.

[5] Marc Gelgon,et al. Organizing a personal image collection with statistical model-based ICL clustering on spatio-temporal camera phone meta-data , 2004, Journal of Visual Communication and Image Representation.

[6] G LoweDavid,et al. Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[7] Jiebo Luo,et al. Beyond pixels: Exploiting camera metadata for photo classification , 2005, Pattern Recognit..

[8] Dingxing Wang,et al. Boosting image classification with LDA-based feature combination for digital photograph management , 2005, Pattern Recognit..

[9] Anil K. Jain,et al. Content-based hierarchical classification of vacation images , 1999, Proceedings IEEE International Conference on Multimedia Computing and Systems.

[10] Edward Y. Chang,et al. Multimodal metadata fusion using causal strength , 2005, ACM Multimedia.

[11] Leonidas J. Guibas,et al. The Earth Mover's Distance as a Metric for Image Retrieval , 2000, International Journal of Computer Vision.

[12] Edward Y. Chang,et al. PBIR-MM: multimodal image retrieval and annotation , 2002, MULTIMEDIA '02.

[13] Alexander G. Hauptmann,et al. Text, Speech, and Vision for Video Segmentation: The InformediaTM Project , 1995 .