Transfer Learning for Improving Lifelog Image Retrieval

With lifelogging devices; such as wearable camera, smart watches, audio recorder or standalone smartphone applications; capturing daily moments becomes easier. In recent years, many workshops and panels have emerged and proposed benchmarks to face challenges in organizing, analyzing, managing, indexing and retrieving specific moments in the huge amount of multi-modal lifelog dataset. Recent advances in deep neural networks have given rise to new approaches to deep learning-based image retrieval. However, using deep neural networks in lifelog context systems is continuously rising challenges: relying on a convolutional neural network which is trained on images not related to the retrieval dataset reduced the performance to extract features. In this paper, we propose a novel fine-tuned Convolutional Neural Network approach based on a Long Short Term Memory processing for improving lifelog image retrieval. The experimental results show the feasibility and effectiveness of our approach with encouraging performance by reaching third place in the ImageCLEF Lifelog Moment Retrieval Task 2018.

[1]  Vinh-Tiep Nguyen,et al.  Lifelog Moment Retrieval with Visual Concept Fusion and Text-based Query Expansion , 2018, CLEF.

[2]  Victor S. Lempitsky,et al.  Neural Codes for Image Retrieval , 2014, ECCV.

[3]  Carlos R. del-Blanco,et al.  Retrieving Events in Life Logging , 2018, CLEF.

[4]  W. Bruce Croft,et al.  Estimating Embedding Vectors for Queries , 2016, ICTIR.

[5]  Georges Quénot,et al.  LIG-MRIM at NTCIR-12 Lifelog Semantic Access Task , 2016, NTCIR.

[6]  Bogdan Ionescu,et al.  Multimedia Lab @ ImageCLEF 2018 Lifelog Moment Retrieval Task , 2018, CLEF.

[7]  Jenny Benois-Pineau,et al.  The IMMED project: wearable video monitoring of people with age dementia , 2010, ACM Multimedia.

[8]  Chokri Ben Amar,et al.  Regim Lab Team at ImageCLEF Lifelog Moment Retrieval Task 2018 , 2018, CLEF.

[9]  Ali Farhadi,et al.  YOLO9000: Better, Faster, Stronger , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[10]  Mandar Mitra,et al.  Word Embedding based Generalized Language Model for Information Retrieval , 2015, SIGIR.

[11]  Geoffrey Zweig,et al.  Linguistic Regularities in Continuous Space Word Representations , 2013, NAACL.

[12]  Ryan Calo,et al.  There is a blind spot in AI research , 2016, Nature.

[13]  James Ze Wang,et al.  Content-based image retrieval: approaches and trends of the new age , 2005, MIR '05.

[14]  Petia Radeva,et al.  LEMoRe: A Lifelog Engine for Moments Retrieval at the NTCIR-Lifelog LSAT Task , 2016, NTCIR.

[15]  Chokri Ben Amar,et al.  Multilevel Deep Learning-Based Processing for Lifelog Image Retrieval Enhancement , 2018, 2018 IEEE International Conference on Systems, Man, and Cybernetics (SMC).

[16]  Vigneshwaran Subbaraju,et al.  VCI2R at the NTCIR-13 Lifelog-2 Lifelog Semantic Access Task , 2017, NTCIR Conference on Evaluation of Information Access Technologies.

[17]  Ross B. Girshick,et al.  Fast R-CNN , 2015, 1504.08083.

[18]  Zhenghao Chen,et al.  Layer Removal for Transfer Learning with Deep Convolutional Neural Networks , 2017, ICONIP.

[19]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[20]  Duc-Tien Dang-Nguyen,et al.  LIFER: An Interactive Lifelog Retrieval System , 2018, LSC@ICMR.

[21]  Rami Albatal,et al.  Overview of NTCIR-13 Lifelog-2 Task , 2017, NTCIR.

[22]  Petia Radeva,et al.  Leveraging Activity Indexing for Egocentric Image Retrieval , 2017, IbPRIA.

[23]  Michael Riegler,et al.  Organizer Team at ImageCLEFlifelog 2017: Baseline Approaches for Lifelog Retrieval and Summarization , 2017, CLEF.

[24]  Steve Hodges,et al.  SenseCam improves memory for recent events and quality of life in a patient with memory retrieval difficulties , 2011, Memory.

[25]  Chokri Ben Amar,et al.  A new model driven architecture for deep learning-based multimodal lifelog retrieval , 2018 .

[26]  W. Bruce Croft,et al.  Improving Language Estimation with the Paragraph Vector Model for Ad-hoc Retrieval , 2016, SIGIR.

[27]  G. O'loughlin,et al.  Using a wearable camera to increase the accuracy of dietary analysis. , 2013, American journal of preventive medicine.

[28]  Mohammad Reza Zare,et al.  Comparative Analysis of Image Retrieval Approaches , 2008 .