Top-down Spatial Attention for Visual Search: Novelty Detection-Tracking Using Spatial Memory with a Mobile Robot

Assistive robotics technologies have been growing impact on at-home monitoring services to support daily life. One of the main research fields is to develop an autonomous mobile robot with the tasks detection, tracking, observation and analysis of the subject of interest in the indoor environment. The main challenges in such daily monitoring application, thus in visual search, are that the robot should track the subject successfully in several severe varying conditions. Recent color and depth image based visual search methods can help to handle part of the problems, such as changing illumination, occlusion, and etc. but these methods generally use large amount of training data by checking the whole scene with high redundancy to find the region of interest. Therefore, inspired by the idea that spatial memory can reveal novelty regions for finding the attention points as in Human Visual System (HVS), we proposed a simple and novel algorithm that integrates Kinect and Lidar(Light Detection And Ranging) sensor data to detect and track novelties using the environment map of the robot as a top-down approach without the necessity of large amount of training data. Then, novelty detection and tracking is achieved based on space based saliency map representing the novelty on the scene. Experimental results demonstrated that the proposed visual attention based scene analysis can handle various conditions stated and achieve high accuracy of novelty detection and tracking.

[1]  Shuang Wang,et al.  Robust human detecting and tracking using varying scale template matching , 2012, 2012 IEEE International Conference on Information and Automation.

[2]  Min-Shik Kim,et al.  The role of spatial working memory in visual search efficiency , 2004, Psychonomic bulletin & review.

[3]  Michael C. Mozer,et al.  Space-and object-based attention , 2005 .

[4]  C D Frith,et al.  Space-based and object-based visual attention: shared and specific neural domains. , 1997, Brain : a journal of neurology.

[5]  Bhaskara Marthi,et al.  An object-based semantic world model for long-term change detection and semantic querying , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[6]  Simone Frintrop,et al.  A Component-Based Approach to Visual Person Tracking from a Mobile Platform , 2010, Int. J. Soc. Robotics.

[7]  Simone Frintrop,et al.  A Computational Framework for Attentional 3D Object Detection , 2013, CogSci.

[8]  D. Zhang,et al.  Scale and orientation adaptive mean shift tracking , 2012 .

[9]  Kevin J. Hawley,et al.  Attention capture by novel stimuli. , 1990, Journal of experimental psychology. General.

[10]  David A. Bell,et al.  NavChair: An Assistive Wheelchair Navigation System with Automatic Adaptation , 1998, Assistive Technology and Artificial Intelligence.

[11]  K. Nakayama,et al.  On the Functional Role of Implicit Visual Memory for the Adaptive Deployment of Attention Across Scenes , 2000 .

[12]  Weisi Lin,et al.  Saliency detection for stereoscopic images , 2013, 2013 Visual Communications and Image Processing (VCIP).

[13]  Heinz Hügli,et al.  Computing visual attention from scene depth , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[14]  Huosheng Hu,et al.  Towards ROS Based Multi-robot Architecture for Ambient Assisted Living , 2013, 2013 IEEE International Conference on Systems, Man, and Cybernetics.

[15]  Sebastian Thrun,et al.  Probabilistic robotics , 2002, CACM.

[16]  Wenwei Yu,et al.  Human Activity Recognition Using Body Contour Parameters Extracted from Depth Images , 2013 .

[17]  Weisi Lin,et al.  A Saliency Detection Model Using Low-Level Features Based on Wavelet Transform , 2013, IEEE Transactions on Multimedia.

[18]  Wanqing Li,et al.  Object detection using Non-Redundant Local Binary Patterns , 2010, 2010 IEEE International Conference on Image Processing.

[19]  Alex Mihailidis,et al.  A Survey on Ambient-Assisted Living Tools for Older Adults , 2013, IEEE Journal of Biomedical and Health Informatics.

[20]  Ali Borji,et al.  Adaptive object tracking by learning background context , 2012, 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[21]  C. Datta,et al.  Design, implementation and field tests of a socially assistive robot for the elderly: HealthBot version 2 , 2012, 2012 4th IEEE RAS & EMBS International Conference on Biomedical Robotics and Biomechatronics (BioRob).

[22]  Christel Chamaret,et al.  Adaptive 3D rendering based on region-of-interest , 2010, Electronic Imaging.

[23]  Weisi Lin,et al.  Saliency Detection in the Compressed Domain for Adaptive Image Retargeting , 2012, IEEE Transactions on Image Processing.

[24]  Junle Wang,et al.  Computational Model of Stereoscopic 3D Visual Saliency , 2013, IEEE Transactions on Image Processing.

[25]  Luc Van Gool,et al.  Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..

[26]  Ren C. Luo,et al.  Multisensor Fusion and Integration: A Review on Approaches and Its Applications in Mechatronics , 2012, IEEE Transactions on Industrial Informatics.

[27]  Fakhri Karray,et al.  Object- and space-based visual attention: An integrated framework for autonomous robots , 2008, 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[28]  David G. Lowe,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004, International Journal of Computer Vision.

[29]  M. Shah,et al.  Object tracking: A survey , 2006, CSUR.

[30]  Honghai Liu,et al.  Intelligent Video Systems and Analytics: A Survey , 2013, IEEE Transactions on Industrial Informatics.

[31]  Ming-Hsuan Yang,et al.  Top-down visual saliency via joint CRF and dictionary learning , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[32]  Rustam Stolkin,et al.  Particle Filter Tracking of Camouflaged Targets by Adaptive Fusion of Thermal and Visible Spectra Camera Data , 2014, IEEE Sensors Journal.

[33]  Gordon D. Logan,et al.  THE CODE THEORY OF VISUAL ATTENTION : AN INTEGRATION OF SPACE-BASED AND OBJECT-BASED ATTENTION , 1996 .

[34]  Nanning Zheng,et al.  Key Object Discovery and Tracking Based on Context-Aware Saliency , 2013, International Journal of Advanced Robotic Systems.

[35]  Jingchuan Wang,et al.  Semantic Mapping for Smart Wheelchairs Using RGB-D Camera , 2013 .

[36]  Wolfram Burgard,et al.  Improved Techniques for Grid Mapping With Rao-Blackwellized Particle Filters , 2007, IEEE Transactions on Robotics.

[37]  Honggang Zhang,et al.  Region-based high-level semantics extraction with CEDD , 2010, 2010 2nd IEEE InternationalConference on Network Infrastructure and Digital Content.

[38]  Joachim Hertzberg,et al.  Towards semantic maps for mobile robots , 2008, Robotics Auton. Syst..

[39]  Armin B. Cremers,et al.  Adaptive Multi-cue 3D Tracking of Arbitrary Objects , 2012, DAGM/OAGM Symposium.

[40]  Horst-Michael Groß,et al.  Progress in developing a socially assistive mobile home robot companion for the elderly with mild cognitive impairment , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.