An Interactive Watershed-Based Approach for Lifelog Moment Retrieval

Recently, terminologies "lifelogging" and "lifelog" became to represent the activity of continuously recording people everyday experiences and dataset contained these recorded experiences, respectively. Hence, providing an excellent tool to retrieve life moments from lifelogs to fast and accurately bring a memory back to a human when required, become a challenging but exciting task for researchers. In this paper, a new method to meet this challenge by utilizing the hypothesis that a sequence of images taken during a specific period can share the same context and content is introduced. This hypothesis can be explained in another way that if there is one image satisfies a given query (i.e., seed); then a certain number of its spatiotemporal neighbors probably can share the same content and context (i.e., watershed). Hence, an interactive watershed-based approach is applied to build the proposed method that is evaluated on the imageCLEFlifelog 2019 dataset and compared to participants joined this event. The experimental results confirm the high productivity of the proposed method in both stable and accuracy aspects as well as the advantage of having an interactive schema to push the accuracy when there is a conflict between a query and how to interpret such a query.

[1]  Vigneshwaran Subbaraju,et al.  VCI2R at the NTCIR-13 Lifelog-2 Lifelog Semantic Access Task , 2017, NTCIR Conference on Evaluation of Information Access Technologies.

[2]  Lei Zhang,et al.  Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[3]  Chen Xu,et al.  The SUN Attribute Database: Beyond Categories for Deeper Scene Understanding , 2014, International Journal of Computer Vision.

[4]  Hsin-Hsi Chen,et al.  Visual Concept Selection with Textual Knowledge for Understanding Activities of Daily Living and Life Moment Retrieval , 2018, CLEF.

[5]  Michael Riegler,et al.  Smart Lifelogging: Recognizing Human Activities using PHASOR , 2017, ICPRAM.

[6]  Pietro Perona,et al.  Microsoft COCO: Common Objects in Context , 2014, ECCV.

[7]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[8]  Vinh-Tiep Nguyen,et al.  Lifelog Moment Retrieval with Visual Concept Fusion and Text-based Query Expansion , 2018, CLEF.

[9]  Marc Langheinrich,et al.  Remembering through lifelogging: A survey of human memory augmentation , 2016, Pervasive Mob. Comput..

[10]  Michael Riegler,et al.  Overview of ImageCLEFlifelog 2019: Solve My Life Puzzle and Lifelog Moment Retrieval , 2019, CLEF.

[11]  Michael Riegler,et al.  Organizer Team at ImageCLEFlifelog 2017: Baseline Approaches for Lifelog Retrieval and Summarization , 2017, CLEF.

[12]  Alejandro Cartas,et al.  Activity recognition from visual lifelogs: State of the art and future challenges , 2019, Multimodal Behavior Analysis in the Wild.

[13]  Chokri Ben Amar,et al.  Regim Lab Team at ImageCLEF Lifelog Moment Retrieval Task 2018 , 2018, CLEF.

[14]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[15]  Bolei Zhou,et al.  Places: A 10 Million Image Database for Scene Recognition , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.