Risk analysis for smart homes and domestic robots using robust shape and physics descriptors, and complex boosting techniques

In this paper, the notion of risk analysis within 3D scenes using vision based techniques is introduced. In particular the problem of risk estimation of indoor environments at the scene and object level is considered, with applications in domestic robots and smart homes. To this end, the proposed Risk Estimation Framework is described, which provides a quantified risk score for a given scene. This methodology is extended with the introduction of a novel robust kernel for 3D shape descriptors such as 3D HOG and SIFT3D, which aims to reduce the effects of outliers in the proposed risk recognition methodology. The Physics Behaviour Feature (PBF) is presented, which uses an object's angular velocity obtained using Newtonian physics simulation as a descriptor. Furthermore, an extension of boosting techniques for learning is suggested in the form of the novel Complex and Hyper-Complex Adaboost, which greatly increase the computation efficiency of the original technique. In order to evaluate the proposed robust descriptors an enriched version of the 3D Risk Scenes (3DRS) dataset with extra objects, scenes and meta-data was utilised. A comparative study was conducted demonstrating that the suggested approach outperforms current state-of-the-art descriptors.

[1]  G. Griebel,et al.  Risk assessment as an evolved threat detection and analysis process , 2011, Neuroscience & Biobehavioral Reviews.

[2]  Cordelia Schmid,et al.  Human Focused Action Localization in Video , 2010, ECCV Workshops.

[3]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[4]  Sven Wachsmuth,et al.  Dynamic 3D scene analysis for acquiring articulated scene models , 2010, 2010 IEEE International Conference on Robotics and Automation.

[5]  Odemir Martinez Bruno,et al.  Texture analysis using fractal descriptors estimated by the mutual interference of color channels , 2016, Inf. Sci..

[6]  Stefan Hinz,et al.  Semantic point cloud interpretation based on optimal neighborhoods, relevant features and efficient classifiers , 2015 .

[7]  Martial Hebert,et al.  Efficient visual event detection using volumetric features , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[8]  Frédo Durand,et al.  Visual vibrometry: Estimating material properties from small motions in video , 2015, CVPR.

[9]  Jian Huang,et al.  An Accurate Method for Voxelizing Polygon Meshes , 1998, VVS.

[10]  Bahram Javidi,et al.  3D Integral Imaging Reconstruction of Occluded Objects Using Independent Component Analysis-Based K-Means Clustering , 2010, Journal of Display Technology.

[11]  Katsushi Ikeuchi,et al.  Scene Understanding by Reasoning Stability and Safety , 2015, International Journal of Computer Vision.

[12]  Chris D. Nugent,et al.  A Knowledge-Driven Approach to Activity Recognition in Smart Homes , 2012, IEEE Transactions on Knowledge and Data Engineering.

[13]  Andrew W. Fitzgibbon,et al.  Real-time human pose recognition in parts from single depth images , 2011, CVPR 2011.

[14]  Anton van den Hengel,et al.  Thrift: Local 3D Structure Recognition , 2007, 9th Biennial Conference of the Australian Pattern Recognition Society on Digital Image Computing Techniques and Applications (DICTA 2007).

[15]  Andrew W. Fitzgibbon,et al.  KinectFusion: real-time 3D reconstruction and interaction using a moving depth camera , 2011, UIST.

[16]  Benjamin Bustos,et al.  Harris 3D: a robust extension of the Harris operator for interest point detection on 3D meshes , 2011, The Visual Computer.

[17]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1997, EuroCOLT.

[18]  Pol Cirujeda,et al.  A 3D Scene Registration Method via Covariance Descriptors and an Evolutionary Stable Strategy Game Theory Solver , 2015, International Journal of Computer Vision.

[19]  Cordelia Schmid,et al.  Explicit Modeling of Human-Object Interactions in Realistic Videos , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[21]  William J. Christmas,et al.  Fast robust correlation , 2005, IEEE Transactions on Image Processing.

[22]  Balbir S. Dhillon,et al.  Robot systems reliability and safety: a review , 2002 .

[23]  Georgios Tzimiropoulos,et al.  A 3D Scene Analysis Framework and Descriptors for Risk Evaluation , 2015, 2015 International Conference on 3D Vision.

[24]  Mubarak Shah,et al.  A 3-dimensional sift descriptor and its application to action recognition , 2007, ACM Multimedia.

[25]  Y. F. Yong,et al.  Robot Safety , 1985 .

[26]  Federico Tombari,et al.  Unique Signatures of Histograms for Local Surface Description , 2010, ECCV.

[27]  James M. Keller,et al.  Histogram of Oriented Normal Vectors for Object Recognition with a Depth Sensor , 2012, ACCV.

[28]  Tülay Adali,et al.  Noncircular Principal Component Analysis and Its Application to Model Selection , 2011, IEEE Transactions on Signal Processing.

[29]  Neil D. Lawrence,et al.  Gaussian Process Latent Variable Models for Visualisation of High Dimensional Data , 2003, NIPS.

[30]  Andreas Zell,et al.  Automatic Take Off, Tracking and Landing of a Miniature UAV on a Moving Carrier Vehicle , 2011, J. Intell. Robotic Syst..

[31]  Nico Blodow,et al.  Fast Point Feature Histograms (FPFH) for 3D registration , 2009, 2009 IEEE International Conference on Robotics and Automation.

[32]  J. Niemeyer,et al.  Contextual classification of lidar data and building object detection in urban areas , 2014 .

[33]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[34]  Georg Wiesmann,et al.  Event-driven feature analysis in a 4D spatiotemporal representation for ambient assisted living , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[35]  Marjorie Skubic,et al.  Evaluation of an inexpensive depth camera for passive in-home fall risk assessment , 2011, 2011 5th International Conference on Pervasive Computing Technologies for Healthcare (PervasiveHealth) and Workshops.

[36]  Tülay Adali,et al.  Complex-Valued Signal Processing: The Proper Way to Deal With Impropriety , 2011, IEEE Transactions on Signal Processing.

[37]  Sergio A. Velastin,et al.  Vehicle localisation and classification in urban CCTV streams , 2009 .

[38]  Nico Blodow,et al.  General 3D modelling of novel objects from a single view , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[39]  Sergio A. Velastin,et al.  A Review of Computer Vision Techniques for the Analysis of Urban Traffic , 2011, IEEE Transactions on Intelligent Transportation Systems.

[40]  Katsushi Ikeuchi,et al.  BRDF Estimation of Structural Color Object by Using Hyper Spectral Image , 2013, 2013 IEEE International Conference on Computer Vision Workshops.

[41]  Jiajun Wu,et al.  Galileo: Perceiving Physical Object Properties by Integrating a Physics Engine with Deep Learning , 2015, NIPS.

[42]  Katsushi Ikeuchi,et al.  Detecting potential falling objects by inferring human action and natural disturbance , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[43]  Bohyung Han,et al.  Bayesian Filtering and Integral Image for Visual Tracking , 2005 .

[44]  Vasileios Argyriou,et al.  3D Voxel HOG and Risk Estimation , 2015, 2015 IEEE International Conference on Digital Signal Processing (DSP).

[45]  Silvio Savarese,et al.  3D Scene Understanding by Voxel-CRF , 2013, 2013 IEEE International Conference on Computer Vision.

[46]  Jitendra Malik,et al.  Recognizing Objects in Range Data Using Regional Point Descriptors , 2004, ECCV.

[47]  Oliver Wang,et al.  Material classification using BRDF slices , 2009, CVPR.

[48]  Nassir Navab,et al.  Model globally, match locally: Efficient and robust 3D object recognition , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[49]  S. Shankar Sastry,et al.  A vision system for landing an unmanned aerial vehicle , 2001, Proceedings 2001 ICRA. IEEE International Conference on Robotics and Automation (Cat. No.01CH37164).

[50]  Jessica B. Hamrick,et al.  Simulation as an engine of physical scene understanding , 2013, Proceedings of the National Academy of Sciences.

[51]  Ernesto Tapia,et al.  A note on the computation of high-dimensional integral images , 2011, Pattern Recognit. Lett..

[52]  Tom Drummond,et al.  Faster and Better: A Machine Learning Approach to Corner Detection , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[53]  Henrik I. Christensen,et al.  Efficient Organized Point Cloud Segmentation with Connected Components , 2013 .

[54]  Tobias Schreck,et al.  Histograms of Oriented Gradients for 3D Object Retrieval , 2010 .

[55]  Zixiang Xiong,et al.  3D scene reconstruction by multiple structured-light based commodity depth cameras , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[56]  Afzal Godil,et al.  Salient local 3D features for 3D shape retrieval , 2011, Electronic Imaging.

[57]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1995, EuroCOLT.