Cross-Domain Usage in Real-Time Video-Based Tracking

This chapter emphasizes on the approach to include information from different type of sensors into the visible domain real time tracking. Since any individual sensor is not able to retrieve the complete information, so it is better to use information from distinct category of sensors. The chapter firstly enlightens the significance of introducing the cross-domain treatment into video based tracking. Following this, some previous work in the literature related to this idea is briefed. The chapter introduces the categorization of the cross-domain activity usage for real time object tracking and then each category is separately discussed in detail. The advantages as well as the limitations of each type of supplemented cross domain activity will be discussed. Finally, the recommendation and concluding remarks from the authors in lieu of future development of this cutting-edge field will be presented.

[1]  Gang Xiao,et al.  A new tracking approach for visible and infrared sequences based on tracking-before-fusion , 2014, International Journal of Dynamics and Control.

[2]  Rafael Muñoz-Salinas,et al.  Adaptive multi-modal stereo people tracking without background modelling , 2008, J. Vis. Commun. Image Represent..

[3]  Josef Kittler,et al.  Audio Assisted Robust Visual Tracking With Adaptive Particle Filtering , 2015, IEEE Transactions on Multimedia.

[4]  Xuan Song,et al.  A Bayesian Approach: Fusion of Laser and Vision for Multiple Pedestrians Tracking , 2008 .

[5]  John W. McDonough,et al.  A joint particle filter for audio-visual speaker tracking , 2005, ICMI '05.

[6]  Daw-Tung Lin,et al.  Collaborative Pedestrian Tracking and Data Fusion With Multiple Cameras , 2011, IEEE Transactions on Information Forensics and Security.

[7]  Takeo Kanade,et al.  Cell population tracking and lineage construction with spatiotemporal context , 2008, Medical Image Anal..

[8]  Munsang Kim,et al.  Human-Robot Interaction in Real Environments by Audio-Visual Integration , 2007 .

[9]  Vladimir Shin,et al.  Mobile Node Localization Using Fusion Prediction-Based Interacting Multiple Model in Cricket Sensor Network , 2012, IEEE Transactions on Industrial Electronics.

[10]  Andrew Blake,et al.  Efficient Dense Stereo with Occlusions for New View-Synthesis by Four-State Dynamic Programming , 2006, International Journal of Computer Vision.

[11]  Takeo Kanade,et al.  Boundary detection based on supervised learning , 2010, 2010 IEEE International Conference on Robotics and Automation.

[12]  M.D. Naish,et al.  Active-vision-based multisensor surveillance - an implementation , 2006, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[13]  Mathew Legg,et al.  A Combined Microphone and Camera Calibration Technique With Application to Acoustic Imaging , 2013, IEEE Transactions on Image Processing.

[14]  Sascha Spors,et al.  Joint audio-video object localization and tracking , 2001 .

[15]  Jongsuk Choi,et al.  Speaker selection and tracking in a cluttered environment with audio and visual information , 2009, IEEE Transactions on Consumer Electronics.

[16]  Guillaume-Alexandre Bilodeau,et al.  Feedback scheme for thermal-visible video registration, sensor fusion, and people tracking , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.

[17]  Sun Li,et al.  Perceptual fusion of infrared and visible images through a hybrid multi-scale decomposition with Gaussian and bilateral filters , 2016, Inf. Fusion.

[18]  Robin J. Evans,et al.  Integrated probabilistic data association , 1994, IEEE Trans. Autom. Control..

[19]  John W. McDonough,et al.  Kalman Filters for Time Delay of Arrival-Based Source Localization , 2005, EURASIP J. Adv. Signal Process..

[20]  Mohan M. Trivedi,et al.  Audio-Visual Fusion and Tracking With Multilevel Iterative Decoding: Framework and Experimental Evaluation , 2010, IEEE Journal of Selected Topics in Signal Processing.

[21]  Trevor Darrell,et al.  Multiple person and speaker activity tracking with a particle filter , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[22]  Jean-Marc Odobez,et al.  Audio-visual speaker tracking with importance particle filters , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[23]  Frédéric Lerasle,et al.  Vision and RFID-based person tracking in crowds from a mobile robot , 2009, 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[24]  Ray A. Jarvis,et al.  Panoramic Vision and Laser Range Finder Fusion for Multiple Person Tracking , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[25]  Najla Megherbi Bouallagu,et al.  Joint audio-video people tracking using belief theory , 2005, IEEE Conference on Advanced Video and Signal Based Surveillance, 2005..

[26]  Moongu Jeon,et al.  Data fusion of radar and image measurements for multi-object tracking via Kalman filtering , 2014, Inf. Sci..

[27]  Robert Pless,et al.  Extrinsic calibration of a camera and laser range finder (improves camera calibration) , 2004, 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE Cat. No.04CH37566).

[28]  Josef Kittler,et al.  Audio constrained particle filter based visual tracking , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[29]  Usman Ali,et al.  Optimized Visual and Thermal Image Fusion for Efficient Face Recognition , 2006, 2006 9th International Conference on Information Fusion.

[30]  Lipo Wang,et al.  A Novel Support Vector Machine with Class-dependent Features for Biomedical Data , 2006, 2006 IEEE International Conference on Systems, Man and Cybernetics.

[31]  David Rees,et al.  Bayesian fusion of thermal and visible spectra camera data for region based tracking with rapid background adaptation , 2012, 2012 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems (MFI).

[32]  Grzegorz Cielniak,et al.  Active people recognition using thermal and grey images on a mobile security robot , 2005, 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[33]  Geovany de Araújo Borges,et al.  Line Extraction in 2D Range Images for Mobile Robotics , 2004, J. Intell. Robotic Syst..

[34]  James R. Hopgood,et al.  Video tracking through occlusions by fast audio source localisation , 2013, 2013 IEEE International Conference on Image Processing.

[35]  Cristiano Premebida,et al.  Performance of laser and radar ranging devices in adverse environmental conditions , 2009 .

[36]  Xuan Song,et al.  An online system for multiple interacting targets tracking: Fusion of laser and vision, tracking and learning , 2013, TIST.

[37]  Giovanni B. Palmerini,et al.  Combining thermal and visual imaging in spacecraft proximity operations , 2014, 2014 13th International Conference on Control Automation Robotics & Vision (ICARCV).

[38]  Senén Barro,et al.  Particle filter robot localisation through robust fusion of laser, WiFi, compass, and a network of external cameras , 2016, Inf. Fusion.

[39]  Alan F. Smeaton,et al.  Thermo-visual feature fusion for object tracking using multiple spatiogram trackers , 2007 .

[40]  Jean-Marc Odobez,et al.  Audiovisual Probabilistic Tracking of Multiple Speakers in Meetings , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[41]  Dieter Fox,et al.  A Spatio-Temporal Probabilistic Model for Multi-Sensor Multi-Class Object Recognition , 2007, ISRR.

[42]  Kiho Kwak,et al.  Unknown multiple object tracking using 2D lidar and video camera , 2014 .

[43]  Olivier Strauss,et al.  Calibration of a multi-sensor system laser rangefinder/camera , 1995, Proceedings of the Intelligent Vehicles '95. Symposium.

[44]  Takeo Kanade,et al.  Extrinsic calibration of a single line scanning lidar and a camera , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[45]  Alessio Del Bue,et al.  A closed form solution to the microphone position self-calibration problem , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[46]  Dorin Comaniciu,et al.  Kernel-Based Object Tracking , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[47]  Yunhui Liu,et al.  An algorithm for extrinsic parameters calibration of a camera and a laser range finder using line features , 2007, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[48]  Rustam Stolkin,et al.  Particle Filter Tracking of Camouflaged Targets by Adaptive Fusion of Thermal and Visible Spectra Camera Data , 2014, IEEE Sensors Journal.