An enhanced personal photo recommendation system by fusing contextual and textual features on mobile device

As a main means to record scene in personal daily life, personal photos convey high-level semantic information (e.g., who, what, when, where) of an activity user engaged in. Different from other information retrieval tasks, personal photo recommendation depends on the measure of activity relevancy which is implicitly embedded in photos. Spurred by this observation, an enhanced recommendation approach by fusing both contextual and textual features is proposed. First, contextual relevancy is incrementally refined with an enhanced temporal and spatial clustering method respectively. Second, textual similarity of photo annotations is calculated using WordNet to augment the activity relevancy. Third, a fuzzy decision based multi-criteria ranking algorithm i.e., Preference Ranking Organization Method of Enrichment Evaluation (PROMETHEE) is adopted to make recommendations when giving an entry photo. A prototype has been developed on mobile device to illustrate this concept. Experiment results on a real dataset which contains 10,827 photos collected from 50 volunteers during 12 months demonstrate that our approach is more accurate than traditional schemes.

[1]  Nigel Shadbolt,et al.  Image annotation with Photocopain , 2006 .

[2]  Jérôme Gensel,et al.  PhotoMap - Automatic Spatiotemporal Annotation for Mobile Photos , 2007, W2GIS.

[3]  Yanfeng Sun,et al.  MiAlbum - a system for home photo managemet using the semi-automatic image annotation approach , 2000, MM 2000.

[4]  R. Manmatha,et al.  Automatic Image Annotation and Retrieval using CrossMedia Relevance Models , 2003 .

[5]  Alan F. Smeaton,et al.  MediAssist: Using Content-Based Analysis and Context to Manage Personal Photo Collections , 2006, CIVR.

[6]  Jian Ma,et al.  Topic detection and organization of mobile text messages , 2010, CIKM '10.

[7]  George A. Miller,et al.  Introduction to WordNet: An On-line Lexical Database , 1990 .

[8]  Simon Dobson,et al.  Ontology-based query recommendation as a support to image retrieval , 2008 .

[9]  Lin-Shan Lee,et al.  Latent semantic retrieval of personal photos with sparse user annotation by fused image/speech/text features , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[10]  Liang-Gee Chen,et al.  Architecture design of full HD JPEG XR encoder for digital photography applications , 2008, IEEE Transactions on Consumer Electronics.

[11]  Jiebo Luo,et al.  Pictures are not taken in a vacuum - an overview of exploiting context for semantic scene content understanding , 2006, IEEE Signal Processing Magazine.

[12]  Setareh Rafatirad,et al.  MEDIALIFE: from images to a life chronicle , 2009, SIGMOD Conference.

[13]  Nor Ashidi Mat Isa,et al.  Adaptive fuzzy-K-means clustering algorithm for image segmentation , 2010, IEEE Transactions on Consumer Electronics.

[14]  Roelof van Zwol,et al.  Flickr tag recommendation based on collective knowledge , 2008, WWW.

[15]  Eun Yi Kim,et al.  Automatic textile image annotation by predicting emotional concepts from visual features , 2010, Image Vis. Comput..

[16]  Shao-Yi Chien,et al.  Photo Retrieval Based on Spatial Layout with Hardware Acceleration for Mobile Devices , 2011, IEEE Transactions on Mobile Computing.

[17]  Mor Naaman,et al.  Why we tag: motivations for annotation in mobile and online media , 2007, CHI.