论文信息 - 3D reconstruction of freely moving persons for re-identification with a depth sensor

3D reconstruction of freely moving persons for re-identification with a depth sensor

In this work, we describe a novel method for creating 3D models of persons freely moving in front of a consumer depth sensor and we show how they can be used for long-term person re-identification. For overcoming the problem of the different poses a person can assume, we exploit the information provided by skeletal tracking algorithms for warping every point cloud frame to a standard pose in real time. Then, the warped point clouds are merged together to compose the model. Re-identification is performed by matching body shapes in terms of whole point clouds warped to a standard pose with the described method. We compare this technique with a classification method based on a descriptor of skeleton features and with a mixed approach which exploits both skeleton and shape features. We report experiments on two datasets we acquired for RGB-D re-identification which use different skeletal tracking algorithms and which are made publicly available to foster research in this new research branch.

[1] Wanqing Li,et al. Action recognition based on a bag of 3D points , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.

[2] Ying Wu,et al. Mining actionlet ensemble for action recognition with depth cameras , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[3] Andrew W. Fitzgibbon,et al. KinectFusion: real-time 3D reconstruction and interaction using a moving depth camera , 2011, UIST.

[4] Phil Sallee,et al. Training and feature-reduction techniques for human identification using anthropometry , 2010, 2010 Fourth IEEE International Conference on Biometrics: Theory, Applications and Systems (BTAS).

[5] Alexander M. Bronstein,et al. Expression-invariant three-dimensional face recognition , 2005 .

[6] Stefano Soatto,et al. Multi-View Stereo Reconstruction of Dense Shape and Complex Appearance , 2005, International Journal of Computer Vision.

[7] Christophe Garcia,et al. Human activities dataset and the ICPR 2012 human activities recognition and localization competition , 2012 .

[8] Alexander M. Bronstein,et al. Topology-Invariant Similarity of Nonrigid Shapes , 2009, International Journal of Computer Vision.

[9] Jean-Luc Dugelay,et al. Improving identification by pruning: A case study on face recognition and body soft biometric , 2012, 2012 13th International Workshop on Image Analysis for Multimedia Interactive Services.

[10] Anil K. Jain,et al. Can soft biometric traits assist user recognition? , 2004, SPIE Defense + Commercial Sensing.

[11] Lynne E. Parker,et al. 4-dimensional local spatio-temporal features for human activity recognition , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[12] Alexander M. Bronstein,et al. Three-Dimensional Face Recognition , 2005, International Journal of Computer Vision.

[13] Alessio Del Bue,et al. Re-identification with RGB-D Sensors , 2012, ECCV Workshops.

[14] Jiawen Chen,et al. Scalable real-time volumetric surface reconstruction , 2013, ACM Trans. Graph..

[15] Bart Selman,et al. Unstructured human activity detection from RGBD images , 2011, 2012 IEEE International Conference on Robotics and Automation.

[16] Claudia Linnhoff-Popien,et al. Gait Recognition with Kinect , 2012 .

[17] Paul J. Besl,et al. A Method for Registration of 3-D Shapes , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[18] Hai Tao,et al. Viewpoint Invariant Pedestrian Recognition with an Ensemble of Localized Features , 2008, ECCV.

[19] Baining Guo,et al. Kinect Identity: Technology and Experience , 2011, Computer.

[20] Luc Van Gool,et al. Real-time facial feature detection using conditional regression forests , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[21] Paul A. Viola,et al. Robust Real-Time Face Detection , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[22] Luc Van Gool,et al. Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..

[23] Fabio Roli,et al. Real-time Appearance-based Person Re-identification Over Multiple KinectTM Cameras , 2013, VISAPP.

[24] Ruzena Bajcsy,et al. Berkeley MHAD: A comprehensive Multimodal Human Action Database , 2013, 2013 IEEE Workshop on Applications of Computer Vision (WACV).

[25] Michael J. Black,et al. Home 3D body scans from noisy image and range data , 2011, 2011 International Conference on Computer Vision.