Employing Shadows for Multi-Person Tracking Based on a Single RGB-D Camera

Although there are many algorithms to track people that are walking, existing methods mostly fail to cope with occluded bodies in the setting of multi-person tracking with one camera. In this paper, we propose a method to use people’s shadows as a clue to track them instead of treating shadows as mere noise. We introduce a novel method to track multiple people by fusing shadow data from the RGB image with skeleton data, both of which are captured by a single RGB Depth (RGB-D) camera. Skeletal tracking provides the positions of people that can be captured directly, while their shadows are used to track them when they are no longer visible. Our experiments confirm that this method can efficiently handle full occlusions. It thus has substantial value in resolving the occlusion problem in multi-person tracking, even with other kinds of cameras.

[1]  Chenglei Yang,et al.  Supporting Easy Physical-to-Virtual Creation of Mobile VR Maze Games: A New Genre , 2017, CHI.

[2]  Luc Van Gool,et al.  Robust tracking-by-detection using a detector confidence particle filter , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[3]  Rita Cucchiara,et al.  Detecting Moving Objects, Ghosts, and Shadows in Video Streams , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Chi-Jui Wu,et al.  Out of sight: a toolkit for tracking occluded human joint positions , 2016, Personal and Ubiquitous Computing.

[5]  Matteo Munaro,et al.  OpenPTrack: People Tracking for Heterogeneous Networks of Color-Depth Cameras , 2014 .

[6]  Vassilios Morellas,et al.  A multi-sensor visual tracking system for behavior monitoring of at-risk children , 2012, 2012 IEEE International Conference on Robotics and Automation.

[7]  JongSuk Choi,et al.  Multiple human tracking using multiple kinects for an attendance check system of a smart class , 2013, 2013 10th International Conference on Ubiquitous Robots and Ambient Intelligence (URAI).

[8]  Larry S. Davis,et al.  Non-parametric Model for Background Subtraction , 2000, ECCV.

[9]  Alexandre Heili,et al.  Exploiting Long-Term Connectivity and Visual Motion in CRF-Based Multi-Person Tracking , 2014, IEEE Transactions on Image Processing.

[10]  Jun Miura,et al.  Person identification based on the matching of foot strike timings obtained by LRFs and a smartphone , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[11]  Leonidas J. Guibas,et al.  Frustum PointNets for 3D Object Detection from RGB-D Data , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[12]  Ting-En Tseng,et al.  Real-time people detection and tracking for indoor surveillance using multiple top-view depth cameras , 2014, 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[13]  Le Hui,et al.  Stacked Conditional Generative Adversarial Networks for Jointly Learning Shadow Detection and Shadow Removal , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[14]  Robby van Delden,et al.  A Thing of Beauty: Steering Behavior in an Interactive Playground , 2017, CHI.

[15]  Pao-Chi Chang,et al.  People tracking in an environment with multiple depth cameras: A skeleton-based pairwise trajectory matching scheme , 2016, J. Vis. Commun. Image Represent..

[16]  Wei Zeng,et al.  A two-view VR shooting theater system , 2014, VRCAI '14.

[17]  Michael Rauter Reliable Human Detection and Tracking in Top-View Depth Images , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[18]  Shin Ishii,et al.  An occlusion-aware particle filter tracker to handle complex and persistent occlusions , 2016, Computer Vision and Image Understanding.

[19]  Matteo Munaro,et al.  Tracking people within groups with RGB-D data , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[20]  Nicolai Marquardt,et al.  EagleSense: Tracking People and Devices in Interactive Spaces using Real-Time Top-View Depth-Sensing , 2017, CHI.

[21]  Matteo Munaro,et al.  OpenPTrack: Open source multi-camera calibration and people tracking for RGB-D camera networks , 2016, Robotics Auton. Syst..

[22]  Mohan M. Trivedi,et al.  Detecting Moving Shadows: Algorithms and Evaluation , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[23]  Dimitris Samaras,et al.  Leave-One-Out Kernel Optimization for Shadow Detection and Removal , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  Chuan Qin,et al.  Can smartphone sensors enhance kinect experience? , 2012, MobiHoc '12.

[25]  Wang Weijie,et al.  Multi-user identification based on double views coupling in cooperative interaction scenarios , 2015 .

[26]  Jun Miura,et al.  Specific Person Detection and Tracking by a Mobile Robot Using 3D LIDAR and ESPAR Antenna , 2014, IAS.

[27]  Xiao Ke,et al.  A Robust Moving Object Detection in Multi-Scenario Big Data for Video Surveillance , 2019, IEEE Transactions on Circuits and Systems for Video Technology.

[28]  Jun Li,et al.  Deep Alignment Network Based Multi-Person Tracking With Occlusion and Motion Reasoning , 2019, IEEE Transactions on Multimedia.

[29]  Andrew W. Fitzgibbon,et al.  Real-time human pose recognition in parts from single depth images , 2011, CVPR 2011.

[30]  Moira C. Norrie,et al.  XDKinect: development framework for cross-device interaction using kinect , 2014, EICS.

[31]  Patrick Baudisch,et al.  VirtualSpace - Overloading Physical Space with Multiple Virtual Reality Users , 2018, CHI.

[32]  J. Krumm,et al.  Multi-camera multi-person tracking for EasyLiving , 2000, Proceedings Third IEEE International Workshop on Visual Surveillance.

[33]  Ramakant Nevatia,et al.  Tracking multiple humans in complex situations , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[34]  Simon Osindero,et al.  Conditional Generative Adversarial Nets , 2014, ArXiv.

[35]  Roderick Murray-Smith,et al.  Virtual sensors: rapid prototyping of ubiquitous interaction with a mobile phone and a Kinect , 2011, Mobile HCI.

[36]  Yasushi Yagi,et al.  Shadow extraction and application in pedestrian detection , 2014, EURASIP J. Image Video Process..

[37]  Luc Van Gool,et al.  A distributed camera system for multi-resolution surveillance , 2009, 2009 Third ACM/IEEE International Conference on Distributed Smart Cameras (ICDSC).

[38]  L. Davis,et al.  Background and foreground modeling using nonparametric kernel density estimation for visual surveillance , 2002, Proc. IEEE.

[39]  Peter H. N. de With,et al.  Employing a RGB-D sensor for real-time tracking of humans across multiple re-entries in a smart environment , 2012, IEEE Transactions on Consumer Electronics.

[40]  V. Beran,et al.  Depth-Based Filtration for Tracking Boost , 2015, ACIVS.

[41]  Kai Oliver Arras,et al.  People detection in RGB-D data , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[42]  Krystof Litomisky Consumer RGB-D Cameras and their Applications , 2012 .

[43]  Mohammed Bennamoun,et al.  Automatic Feature Learning for Robust Shadow Detection , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[44]  Thomas B. Moeslund,et al.  Chromatic shadow detection and tracking for moving foreground segmentation , 2015, Image Vis. Comput..

[45]  Mary C. Whitton,et al.  Walking > walking-in-place > flying, in virtual environments , 1999, SIGGRAPH.

[46]  Dimitris Samaras,et al.  Shadow Detection with Conditional Generative Adversarial Networks , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[47]  Ming-Hsuan Yang,et al.  Fast and Accurate Online Video Object Segmentation via Tracking Parts , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[48]  Shwetak N. Patel,et al.  ID-Match: A Hybrid Computer Vision and RFID System for Recognizing Individuals in Groups , 2016, CHI.