Three-Dimensional Integral Imaging for Gesture Recognition Under Occlusions

Three-dimensional (3D) imaging has recently been applied to human gesture recognition using depth maps from RGB-D sensors. An alternative which has been scarcely explored is 3D Integral Imaging, which has shown to give very competitive results in object reconstruction and recognition tasks, even under challenging conditions (e.g. low illumination, occlusions). Integral Imaging has some remarkable advantages over other sensors that may give 3D information (like RGB-D sensors). One of the most important ones is its long range working capability, which stands out even more when compared against other sensors that lose their capabilities for depths of 2m or more. In this paper we present results corresponding to the application of the integral imaging 3D acquisition technique for the recognition of human gestures, when there are occlusions that may hinder the recognition capability. We also present results comparing its capability against that given by an RGB-D sensor (Kinect) and that obtained when only one of the cameras in the camera array is used. Our results show that Integral Imaging compares more or less similarly to Kinect and the monocular case when there are not occlusions, but much more favorably when there are. We also show that the camera spatial resolution may be an issue to account for, when we refer to gesture recognition under occlusions, for the monocular case, but it is less sensitive for the Integral Imaging case, because the features that are extracted from Integral Imaging seem to be more descriptive and discriminative than for the monocular counterpart case.

[1]  Hong Wei,et al.  A survey of human motion analysis using depth imagery , 2013, Pattern Recognit. Lett..

[2]  Ivan Laptev,et al.  On Space-Time Interest Points , 2005, International Journal of Computer Vision.

[3]  Bahram Javidi,et al.  Mid-Wave Infrared 3D Integral Imaging at Long Range , 2013, Journal of Display Technology.

[4]  Pascal Fua,et al.  Making Action Recognition Robust to Occlusions and Viewpoint Changes , 2010, ECCV.

[5]  Bahram Javidi,et al.  Distortion-tolerant 3D recognition of occluded objects using computational integral imaging. , 2006, Optics express.

[6]  Myungjin Cho,et al.  3D passive photon counting automatic target recognition using advanced correlation filters. , 2011, Optics letters.

[7]  Myungjin Cho,et al.  Three-Dimensional Visualization of Objects in Turbid Water Using Integral Imaging , 2010, Journal of Display Technology.

[8]  Rémi Ronfard,et al.  A survey of vision-based methods for action representation, segmentation and recognition , 2011, Comput. Vis. Image Underst..

[9]  Guangchun Cheng,et al.  Advances in Human Action Recognition: A Survey , 2015, ArXiv.

[10]  J.K. Aggarwal,et al.  Human activity analysis , 2011, ACM Comput. Surv..

[11]  Ioannis A. Kakadiaris,et al.  A Review of Human Activity Recognition Methods , 2015, Front. Robot. AI.

[12]  Ivan Laptev,et al.  On Space-Time Interest Points , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[13]  Filiberto Pla,et al.  Human gesture recognition using three-dimensional integral imaging. , 2014, Journal of the Optical Society of America. A, Optics, image science, and vision.

[14]  Lu Yang,et al.  Survey on 3D Hand Gesture Recognition , 2016, IEEE Transactions on Circuits and Systems for Video Technology.

[15]  Anil K. Jain Data clustering: 50 years beyond K-means , 2008, Pattern Recognit. Lett..

[16]  Filiberto Pla,et al.  Three-Dimensional Integral Imaging for Gesture Recognition Under Occlusions , 2017, IEEE Signal Processing Letters.