Body Parts Segmentation with Attached Props Using RGB-D Imaging

People detection is essential in a lot of different systems. Many applications nowadays tend to require people detection to achieve certain tasks. These applications come under many disciplines, such as robotics, ergonomics, biomechanics, gaming and automotive industries. This wide range of applications makes human body detection an active area of research. With the release of depth sensors or RGB-D cameras such as Micosoft Kinect, this area of research became more active, specially with their affordable price. Human body detection requires the adaptation of many scenarios and situations. Various conditions such as occlusions, background cluttering and props attached to the human body require training on custom built datasets. In this paper we present an approach to prepare training datasets to detect and track human body with attached props. The proposed approach uses rigid body physics simulation to create and animate different props attached to the human body. Three scenarios are implemented. In the first scenario the prop is closely attached to the human body, such as a person carrying a backpack. In the second scenario, the prop is slightly attached to the human body, such as a person carrying a briefcase. In the third scenario the prop is not attached to the human body, such as a person dragging a trolley bag. Our approach gives results with accuracy of 93% in identifying both the human body parts and the attached prop in all the three scenarios.

[1]  Andrew D. Wilson Using a depth camera as a touch sensor , 2010, ITS '10.

[2]  Kin Fun Li,et al.  A Web-Based Sign Language Translator Using 3D Video Processing , 2011, 2011 14th International Conference on Network-Based Information Systems.

[3]  Lei Wei,et al.  Low cost multimodal facial recognition via kinect sensors , 2012 .

[4]  Albert A. Rizzo,et al.  Development and evaluation of low cost game-based balance rehabilitation tool using the microsoft kinect sensor , 2011, 2011 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[5]  Tilak Dutta,et al.  Evaluation of the Kinect™ sensor for 3-D kinematic measurement in the workplace. , 2012, Applied ergonomics.

[6]  Richard W. Hamming,et al.  Error detecting and error correcting codes , 1950 .

[7]  Roberto Cipolla,et al.  Semantic texton forests for image categorization and segmentation , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[8]  Toby Sharp,et al.  Implementing Decision Trees and Forests on a GPU , 2008, ECCV.

[9]  Saeid Nahavandi,et al.  Safety applications using Kinect technology , 2014, 2014 IEEE International Conference on Systems, Man, and Cybernetics (SMC).

[10]  Sander Oude Elberink,et al.  Accuracy and Resolution of Kinect Depth Data for Indoor Mapping Applications , 2012, Sensors.

[11]  Saeid Nahavandi,et al.  Kinect crowd interaction , 2012 .

[12]  Ligang Liu,et al.  Scanning 3D Full Human Bodies Using Kinects , 2012, IEEE Transactions on Visualization and Computer Graphics.

[13]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[14]  Günther Greiner,et al.  Automatic reconstruction of personalized avatars from 3D face scans , 2011, Comput. Animat. Virtual Worlds.

[15]  Andrew W. Fitzgibbon,et al.  KinectFusion: real-time 3D reconstruction and interaction using a moving depth camera , 2011, UIST.

[16]  Saeid Nahavandi,et al.  Measuring depth accuracy in RGBD cameras , 2013, 2013, 7th International Conference on Signal Processing and Communication Systems (ICSPCS).

[17]  Ricardo Gutierrez-Osuna,et al.  Web GIS in practice X: a Microsoft Kinect natural user interface for Google Earth navigation , 2011, International journal of health geographics.

[18]  Andrew W. Fitzgibbon,et al.  Real-time human pose recognition in parts from single depth images , 2011, CVPR 2011.

[19]  S. Nahavandi,et al.  Color map-based image fusion , 2008, 2008 6th IEEE International Conference on Industrial Informatics.

[20]  Joris De Schutter,et al.  An adaptable system for RGB-D based human body detection and pose estimation , 2014, J. Vis. Commun. Image Represent..

[21]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.