Semantic analysis and retrieval in personal and social photo collections

Semantic understanding of images has been an important topic in the research community for a long time as it is an important prerequisite to build meaningful retrieval systems which are accessible by both users and automatic reasoning algorithms. Recently, especially with the growing trend to share photos online, the social aspect of image retrieval becomes more and more prevalent and image retrieval more and more focusses specifically on photos and their special characteristics, especially on information outside the image itself. Researchers are starting to explore how and why photos are shot, shared and used and try to incorporate this additional knowledge to aid image analysis and retrieval. Several survey papers have been written in the past reviewing works in the general field of image analysis and retrieval. However, the social aspect of image retrieval and the focus on digital photos has not sufficiently been addressed in these works. In this article we give an overview over the current research field of semantic photo understanding, annotation and retrieval. We review over 160 contributions in the field and identify trending topics and implications for future directions of research.

[1]  Ebroul Izquierdo,et al.  Semantic structuring and retrieval of event chapters in social photo collections , 2010, MIR '10.

[2]  Rik Van de Walle,et al.  The MPEG-21 Book , 2006 .

[3]  R. Chalfen Snapshot versions of life , 1987 .

[4]  J. C. Platt AutoAlbum: clustering digital photographs using probabilistic model merging , 2000, 2000 Proceedings Workshop on Content-based Access of Image and Video Libraries.

[5]  Mor Naaman,et al.  Generating summaries for large collections of geo-referenced photographs , 2006, WWW '06.

[6]  Jiebo Luo,et al.  Photo classification by integrating image content and camera metadata , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[7]  HongJiang Zhang,et al.  Detecting image orientation based on low-level visual content , 2004, Comput. Vis. Image Underst..

[8]  Alan C. Bovik,et al.  No-reference quality assessment using natural scene statistics: JPEG2000 , 2005, IEEE Transactions on Image Processing.

[9]  Reiner Fageth,et al.  Employing a photo's life cycle for multimedia retrieval , 2008, MS '08.

[10]  Mor Naaman,et al.  Requirements for mobile photoware , 2010, Personal and Ubiquitous Computing.

[11]  Yanfeng Sun,et al.  MiAlbum - a system for home photo managemet using the semi-automatic image annotation approach , 2000, MM 2000.

[12]  Paul Clough,et al.  Creating a test collection to evaluate diversity in image retrieval , 2008, SIGIR 2008.

[13]  Dragutin Petkovic,et al.  Query by Image and Video Content: The QBIC System , 1995, Computer.

[14]  Dragutin Petkovic,et al.  Content-Based Representation and Retrieval of Visual Media: A State-of-the-Art Review , 1996 .

[15]  Jiebo Luo,et al.  Beyond pixels: Exploiting camera metadata for photo classification , 2005, Pattern Recognit..

[16]  Stefan M. Rüger,et al.  Automated Image Annotation Using Global Features and Robust Nonparametric Density Estimation , 2005, CIVR.

[17]  Nenghai Yu,et al.  Annotating personal albums via web mining , 2008, ACM Multimedia.

[18]  Marcel Worring,et al.  Benchmarking image and video retrieval: an overview , 2006, MIR '06.

[19]  Susanne Boll,et al.  Emergent Semantics in Personalized Multimedia Content , 2007, J. Digit. Inf. Manag..

[20]  Y. Tsymbalenko,et al.  Using HTML Metadata to Find Relevant Images on the World Wide Web , 2001 .

[21]  James Ze Wang,et al.  Studying Aesthetics in Photographic Images Using a Computational Approach , 2006, ECCV.

[22]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[23]  Djemel Ziou,et al.  Image Retrieval from the World Wide Web: Issues, Techniques, and Systems , 2004, CSUR.

[24]  John L. Arnott,et al.  Interface metaphor design and instant messaging for older adults , 2008, CHI Extended Abstracts.

[25]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[26]  James Ze Wang,et al.  Algorithmic inferencing of aesthetics and emotion in natural images: An exposition , 2008, 2008 15th IEEE International Conference on Image Processing.

[27]  John Adcock,et al.  Simplifying the Management of Large Photo Collections , 2003, INTERACT.

[28]  Xin Li,et al.  Blind image quality assessment , 2002, Proceedings. International Conference on Image Processing.

[29]  Randal E. Bryant,et al.  Data-Intensive Supercomputing: The case for DISC , 2007 .

[30]  Andreas E. Savakis,et al.  Automatic image event segmentation and quality screening for albuming applications , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[31]  R. Manmatha,et al.  A Model for Learning the Semantics of Pictures , 2003, NIPS.

[32]  Rob Procter,et al.  Supporting informality: team working and integrated care records , 2004, CSCW.

[33]  Jonathon S. Hare,et al.  Semantic spaces revisited: investigating the performance of auto-annotation and semantic retrieval using semantic spaces , 2008, CIVR '08.

[34]  Daniel Tretter,et al.  Consumer image retrieval by estimating relation tree from family photo collections , 2010, CIVR '10.

[35]  R. Doyle The American terrorist. , 2001, Scientific American.

[36]  James Ze Wang,et al.  Learning the consensus on visual quality for next-generation image management , 2007, ACM Multimedia.

[37]  Bernhard Schölkopf,et al.  Advances in Neural Information Processing Systems 16: Proceedings of the 2003 Conference , 2004, NIPS 2004.

[38]  Ramesh C. Jain,et al.  Towards an ecosystem for semantics , 2007, MS '07.

[39]  Edoardo Ardizzone,et al.  Mean shift clustering for personal photo album organization , 2008, 2008 15th IEEE International Conference on Image Processing.

[40]  John Tait,et al.  Browsing Personal Images Using Episodic Memory (Time + Location) , 2006, ECIR.

[41]  Azriel Rosenfeld,et al.  Face recognition: A literature survey , 2003, CSUR.

[42]  Andreas Girgensohn,et al.  Temporal event clustering for digital photo collections , 2003, ACM Multimedia.

[43]  S LewMichael,et al.  Content-based multimedia information retrieval , 2006 .

[44]  Clement T. Yu,et al.  Diogenes: a web search agent for person images , 2000, ACM Multimedia.

[45]  Thomas Hofmann,et al.  Map-Reduce for Machine Learning on Multicore , 2007 .

[46]  Mathias Lux,et al.  Lire: lucene image retrieval: an extensible java CBIR library , 2008, ACM Multimedia.

[47]  Nicu Sebe,et al.  Content-based multimedia information retrieval: State of the art and challenges , 2006, TOMCCAP.

[48]  Sharad Mehrotra,et al.  WebMARS: a multimedia search engine , 1999, Electronic Imaging.

[49]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[50]  Susanne Boll,et al.  From usage to annotation: analysis of personal photo albums for semantic photo understanding , 2009, WSM '09.

[51]  Paul S. Fisher,et al.  Image quality measures and their performance , 1995, IEEE Trans. Commun..

[52]  Nick Reid,et al.  Photo LOI: browsing multi-user photo collections , 2005, MULTIMEDIA '05.

[53]  Pat Mohlot Art and architecture thesaurus (abstract only) , 1981, CHI '81.

[54]  Oded Nov,et al.  Motivational, Structural and Tenure Factors that Impact Online Community Photo Sharing , 2009, ICWSM.

[55]  Narendra Ahuja,et al.  Detecting Faces in Images: A Survey , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[56]  Peng Wu,et al.  Close & closer: social cluster and closeness from photo collections , 2009, MM '09.

[57]  Angelo Chianese,et al.  Managing Uncertainties in Image Databases: A Fuzzy Approach , 2004, Multimedia Tools and Applications.

[58]  Thierry Pun,et al.  Performance evaluation in content-based image retrieval: overview and proposals , 2001, Pattern Recognit. Lett..

[59]  Jonathon S. Hare,et al.  Bridging the Semantic Gap in Multimedia Information Retrieval: Top-down and Bottom-up approaches , 2006 .

[60]  R. Manmatha,et al.  Automatic Image Annotation and Retrieval using CrossMedia Relevance Models , 2003 .

[61]  Yan Ke,et al.  The Design of High-Level Features for Photo Quality Assessment , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[62]  Mor Naaman,et al.  Generating diverse and representative image search results for landmarks , 2008, WWW.

[63]  Simone Santini,et al.  Emergent Semantics through Interaction in Image Databases , 2001, IEEE Trans. Knowl. Data Eng..

[64]  Michael J. Swain,et al.  WebSeer: An Image Search Engine for the World Wide Web , 1996 .

[65]  Ying Liu,et al.  A survey of content-based image retrieval with high-level semantics , 2007, Pattern Recognit..

[66]  Ethan V. Munson To Search for Images on the Web , Look at the Text , Then Look at the Images , 2001 .

[67]  Marc Gelgon,et al.  Building and tracking hierarchical geographical & temporal partitions for image collection management on mobile devices , 2005, MULTIMEDIA '05.

[68]  Andreas Paepcke,et al.  Time as essence for photo browsing through personal digital libraries , 2002, JCDL '02.

[69]  Alex S. Taylor,et al.  Collocated social practices surrounding photos , 2008, CHI Extended Abstracts.

[70]  Alan F. Smeaton,et al.  Mobile access to personal digital photograph archives , 2005, Mobile HCI.

[71]  M. Turk,et al.  Eigenfaces for Recognition , 1991, Journal of Cognitive Neuroscience.

[72]  Marc Davis,et al.  The Social Uses of Personal Photography : Methods for Projecting Future Imaging Applications , 2004 .

[73]  Y. Mori,et al.  Image-to-word transformation based on dividing and vector quantizing images with words , 1999 .

[74]  David M. Shotton,et al.  Building a Semantic Web Image Repository for Biological Research Images , 2008, ESWC.

[75]  Thierry Pun,et al.  A Framework for Benchmarking in CBIR , 2003, Multimedia Tools and Applications.

[76]  John Adcock,et al.  Leveraging face recognition technology to find and organize photos , 2004, MIR '04.

[77]  Jiejun Xu,et al.  Multimodal photo annotation and retrieval on a mobile phone , 2008, MIR '08.

[78]  Mohan S. Kankanhalli,et al.  SmartAlbum: a multi-modal photo annotation system , 2002, MULTIMEDIA '02.

[79]  James Ze Wang,et al.  Image retrieval: Ideas, influences, and trends of the new age , 2008, CSUR.

[80]  Thierry Pun,et al.  The Truth about Corel - Evaluation in Image Retrieval , 2002, CIVR.

[81]  Kobus Barnard,et al.  Evaluating image retrieval , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[82]  Rongrong Ji,et al.  Photo assessment based on computational visual attention model , 2009, ACM Multimedia.

[83]  Antoine Pigeau MyOwnLife: incremental and hierarchical classification of a personal image collection on mobile devices , 2009, Multimedia Tools and Applications.

[84]  Daniel Tretter,et al.  Managing and searching personal photo collections , 2003, IS&T/SPIE Electronic Imaging.

[85]  James A. Hendler,et al.  A Flexible Approach for Managing Digital Images on the Semantic Web , 2005, SemAnnot@ISWC.

[86]  B. S. Manjunath,et al.  Introduction to MPEG-7: Multimedia Content Description Interface , 2002 .

[87]  Ahmet M. Eskicioglu,et al.  Quality measurement for monochrome compressed images in the past 25 years , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[88]  Alex S. Taylor,et al.  Editorial: Collocated social practices surrounding photos , 2009 .

[89]  Nigel Shadbolt,et al.  Image annotation with Photocopain , 2006 .

[90]  Jérôme Gensel,et al.  PhotoMap - Automatic Spatiotemporal Annotation for Mobile Photos , 2007, W2GIS.

[91]  Mohan S. Kankanhalli,et al.  Using Camera Settings Templates ("Scene Modes") for Image Scene Classification of Photographs Taken on Manual/Expert Settings , 2007, PCM.

[92]  Mathias Lux,et al.  Caliph & Emir: MPEG-7 photo annotation and retrieval , 2009, ACM Multimedia.

[93]  Clement Yu,et al.  Diogenes: A Web Search Agent for Content Based Indexing of Personal Images , 2000, SIGIR 2000.

[94]  Marc Davis,et al.  The uses of personal networked digital imaging: an empirical study of cameraphone photos and sharing , 2005, CHI Extended Abstracts.

[95]  Frank Bentley,et al.  Personal vs. commercial content: the similarities between consumer use of photos and music , 2006, CHI.

[96]  Janko Calic,et al.  FreeEye: interactive intuitive interface for large-scale image browsing , 2009, MM '09.

[97]  David A. Forsyth,et al.  Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary , 2002, ECCV.

[98]  Marco La Cascia,et al.  Unifying Textual and Visual Cues for Content-Based Image Retrieval on the World Wide Web , 1999, Comput. Vis. Image Underst..

[99]  Xing Xie,et al.  Photo-to-search: using multimodal queries to search the web from mobile devices , 2005, MIR '05.

[100]  Fergal Monaghan,et al.  Leveraging Ontologies, Context and Social Networks to Automate Photo Annotation , 2007, SAMT.

[101]  Lynda Hardman,et al.  Canonical processes of semantically annotated media production , 2008, Multimedia Systems.

[102]  Susanne Boll,et al.  Processes of photo book production , 2008, Multimedia Systems.

[103]  Alan F. Smeaton,et al.  My digital photos: where and when? , 2005, MULTIMEDIA '05.

[104]  Mor Naaman,et al.  Adventures in Space and Time: Browsing Personal Collections of Geo-Referenced Digital Photographs , 2004 .

[105]  Geoffrey C. Bowker,et al.  Work and infrastructure , 1995, CACM.

[106]  Kun Li,et al.  iScope: personalized multi-modality image search for mobile devices , 2009, MobiSys '09.

[107]  Michael S. Lew Next-Generation Web Searches for Visual Content , 2000, Computer.

[108]  Ramesh C. Jain,et al.  Classification and annotation of digital photos using optical context data , 2008, CIVR '08.

[109]  Cláudio de Souza Baptista,et al.  PhotoGeo: A Self-Organizing System for Personal Photo Collections , 2008, 2008 Tenth IEEE International Symposium on Multimedia.

[110]  Nuria Oliver,et al.  The role of tags and image aesthetics in social image search , 2009, WSM '09.

[111]  Kerry Rodden,et al.  How do people manage their digital photographs? , 2003, CHI '03.

[112]  Tao Mei,et al.  Probabilistic Multimodality Fusion for Event based Home Photo Clustering , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[113]  Zhou Wang,et al.  Why is image quality assessment so difficult? , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[114]  R. Manmatha,et al.  Automatic image annotation and retrieval using cross-media relevance models , 2003, SIGIR.

[115]  Takahiro Hara,et al.  Image classification for mobile web browsing , 2006, WWW '06.

[116]  Jiebo Luo,et al.  Pictures Are Not Taken in a Vacuum , 2006 .

[117]  Mor Naaman,et al.  How flickr helps us make sense of the world: context and content in community-contributed media collections , 2007, ACM Multimedia.

[118]  King-Sun Fu,et al.  Query-by-Pictorial-Example , 1980, IEEE Trans. Software Eng..

[119]  Mary Czerwinski,et al.  PhotoTOC: automatic clustering for browsing personal photographs , 2003, Fourth International Conference on Information, Communications and Signal Processing, 2003 and the Fourth Pacific Rim Conference on Multimedia. Proceedings of the 2003 Joint.

[120]  Jingrui He,et al.  Classification of Digital Photos Taken by Photographers or Home Users , 2004, PCM.

[121]  Rong Yan,et al.  Large-scale multimedia semantic concept modeling using robust subspace bagging and MapReduce , 2009, LS-MMRM '09.

[122]  Zhiguo Gong,et al.  Web Image Semantic Clustering , 2005, OTM Conferences.

[123]  Yiannis S. Boutalis,et al.  img(Anaktisi): A Web Content Based Image Retrieval System , 2009, 2009 Second International Workshop on Similarity Search and Applications.

[124]  Wei-Ying Ma,et al.  Clustering and searching WWW images using link and page layout analysis , 2007, TOMCCAP.

[125]  Eytan Ruppin,et al.  Facial Attractiveness: Beauty and the Machine , 2006, Neural Computation.

[126]  Wendy Hall,et al.  The Semantic Web Revisited , 2006, IEEE Intelligent Systems.

[127]  Brian L. Evans,et al.  Unsupervised automation of photographic composition rules in digital still cameras , 2004, IS&T/SPIE Electronic Imaging.

[128]  Joo-Hwee Lim,et al.  Home Photo Retrieval: Time Matters , 2003, CIVR.

[129]  Andreas E. Savakis,et al.  Automated event clustering and quality screening of consumer pictures for digital albuming , 2003, IEEE Trans. Multim..

[130]  Ramesh C. Jain,et al.  Semantics In Digital Photos: A Contenxtual Analysis , 2008, 2008 IEEE International Conference on Semantic Computing.

[131]  Jiebo Luo,et al.  Exploiting context for semantic scene classification , 2005 .

[132]  Jiebo Luo,et al.  Annotating photo collections by label propagation according to multiple similarity cues , 2008, ACM Multimedia.

[133]  Andrew D. Miller,et al.  Give and take: a study of consumer photo-sharing culture and practice , 2007, CHI.

[134]  Allan Kuchinsky,et al.  Requirements for photoware , 2002, CSCW '02.

[135]  Miska M. Hannuksela,et al.  Perceptual quality assessment based on visual attention analysis , 2009, ACM Multimedia.

[136]  Letizia Tanca,et al.  Towards a definition of an Image Ontology , 2007, 18th International Workshop on Database and Expert Systems Applications (DEXA 2007).

[137]  Wilson S. Geisler,et al.  Image quality assessment based on a degradation model , 2000, IEEE Trans. Image Process..

[138]  Antonio Torralba,et al.  Scene-Centered Description from Spatial Envelope Properties , 2002, Biologically Motivated Computer Vision.

[139]  Li Cun-he,et al.  Hyperlink Classification: A New Approach to Improve PageRank , 2007 .

[140]  Danah Boyd,et al.  Social Network Sites: Definition, History, and Scholarship , 2007, J. Comput. Mediat. Commun..

[141]  Dragutin Petkovic,et al.  Content-based representation and retrieval of visual media: A state-of-the-art review , 1996, Multimedia Tools and Applications.

[142]  Marcel Worring,et al.  Content-Based Image Retrieval at the End of the Early Years , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[143]  W. Wagenaar My memory: A study of autobiographical memory over six years , 1986, Cognitive Psychology.

[144]  Abigail Sellen,et al.  Understanding photowork , 2006, CHI.

[145]  Jonathon S. Hare,et al.  Semantic facets: an in-depth analysis of a semantic image retrieval system , 2007, CIVR '07.

[146]  Fergal Monaghan,et al.  Automating Photo Annotation using Services and Ontologies , 2006, 7th International Conference on Mobile Data Management (MDM'06).

[147]  Masashi Inoue,et al.  Image retrieval: Research and use in the information explosion , 2009 .

[148]  Bülent Sankur,et al.  Statistical evaluation of image quality measures , 2002, J. Electronic Imaging.

[149]  S. Sclaroff,et al.  Combining textual and visual cues for content-based image retrieval on the World Wide Web , 1998, Proceedings. IEEE Workshop on Content-Based Access of Image and Video Libraries (Cat. No.98EX173).

[150]  Ajmal S. Mian,et al.  A Hybrid Image Quality Measure for Automatic Image Quality Assessment , 2009, SCIA.

[151]  Ramesh C. Jain,et al.  Automatic Person Annotation of Family Photo Album , 2006, CIVR.

[152]  Anil K. Jain,et al.  Content-based hierarchical classification of vacation images , 1999, Proceedings IEEE International Conference on Multimedia Computing and Systems.

[153]  Jiebo Luo,et al.  Bayesian fusion of camera metadata cues in semantic scene classification , 2004, CVPR 2004.

[154]  Lynda Hardman,et al.  Towards a syntax for multimedia semantics , 2002 .

[155]  Tom Rodden,et al.  Collaborating around collections: informing the continued development of photoware , 2004, CSCW.

[156]  Steffen Staab,et al.  Semantic Annotation of Images and Videos for Multimedia Analysis , 2005, ESWC.

[157]  Jiebo Luo,et al.  Pictures are not taken in a vacuum - an overview of exploiting context for semantic scene content understanding , 2006, IEEE Signal Processing Magazine.

[158]  James Ze Wang,et al.  Real-Time Computerized Annotation of Pictures , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[159]  Jane Hunter,et al.  Adding Multimedia to the Semantic Web: Building an MPEG-7 ontology , 2001, SWWS.

[160]  H. Garcia-Molina,et al.  Automatic organization for digital photographs with geographic coordinates , 2004, Proceedings of the 2004 Joint ACM/IEEE Conference on Digital Libraries, 2004..

[161]  Daniel Gatica-Perez,et al.  Analyzing Flickr groups , 2008, CIVR '08.

[162]  L. D. Couprie Iconclass: an iconographic classification system , 1983 .

[163]  A. T. Schreiber,et al.  Semantic Annotation of Image Collections , 2003 .

[164]  Rik Van de Walle,et al.  The MPEG-21 Book: Burnett/The MPEG-21 Book , 2006 .

[165]  Raimondo Schettini,et al.  Image annotation using SVM , 2003, IS&T/SPIE Electronic Imaging.

[166]  Hila Becker,et al.  Learning similarity metrics for event identification in social media , 2010, WSDM '10.