Enhancing the detection of concepts for visual lifelogs using contexts instead of ontologies

Automatic detection of semantic concepts in visual media is typically achieved by an automatic mapping from low-level features to higher level semantics and progress in automatic detection within narrow domains has now reached a satisfactory performance level. In visual lifelogging, part of the quantified-self movement, wearable cameras can automatically record most aspects of daily living. The resulting images have a diversity of everyday concepts which severely degrades the performance of concept detection. In this paper, we present an algorithm based on non-negative matrix refactorization which exploits inherent relationships between everyday concepts in domains where context is more prevalent, such as lifelogging. Results for initial concept detection are factorized and adjusted according to their patterns of appearance, and absence. In comparison to using an ontology to enhance concept detection, we use underlying contextual semantics to improve overall detection performance. Results are demonstrated in experiments to show the efficacy of our algorithm.

[1]  Deborah Estrin,et al.  Image browsing, processing, and clustering for participatory sensing: lessons from a DietSense prototype , 2007, EmNets '07.

[2]  Shahram Izadi,et al.  SenseCam: A Retrospective Memory Aid , 2006, UbiComp.

[3]  Alan F. Smeaton,et al.  Constructing a SenseCam visual diary as a media process , 2008, Multimedia Systems.

[4]  Alan F. Smeaton,et al.  Semantics-based selection of everyday concepts in visual lifelogging , 2012, International Journal of Multimedia Information Retrieval.

[5]  Jenny Benois-Pineau,et al.  The IMMED project: wearable video monitoring of people with age dementia , 2010, ACM Multimedia.

[6]  Steve Hodges,et al.  SenseCam improves memory for recent events and quality of life in a patient with memory retrieval difficulties , 2011, Memory.

[7]  Alan F. Smeaton,et al.  Everyday concept detection in visual lifelogs: validation, relationships and trends , 2010, Multimedia Tools and Applications.

[8]  Alan F. Smeaton,et al.  Using visual lifelogs to automatically characterize everyday activities , 2013, Inf. Sci..

[9]  John Platt,et al.  Probabilistic Outputs for Support vector Machines and Comparisons to Regularized Likelihood Methods , 1999 .

[10]  H. Sebastian Seung,et al.  Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[11]  Alan F. Smeaton,et al.  Measuring the Influence of Concept Detection on Video Retrieval , 2009, CAIP.

[12]  Paul Over,et al.  High-level feature detection from video in TRECVid: a 5-year retrospective of achievements , 2009 .

[13]  P. Kelly,et al.  Feasibility of a SenseCam-assisted 24-h recall to reduce under-reporting of energy intake , 2013, European Journal of Clinical Nutrition.