3D Voxel HOG and Risk Estimation

In this paper, we evaluate the notion of scene analysis with regard to physical risks and potential hazards in an environment and providing a quantified risk score. A definition of risk is given incorporating different elements but mainly focusing on intrinsic risk related properties of an object (e.g sharpness). A novel 3D Voxel-HOG descriptor is introduced that aims to classify and recognise the presence of sharp characteristics and features of objects present in a given scene. Additionally we present a new dataset (3DRS) designed for risk evaluation in scene analysis. The effectiveness of our method is demonstrated by conducting experiments on all the objects in the 3DRS dataset, comparing the proposed features with state-of-the-art descriptors.

[1]  Luc Van Gool,et al.  Exploiting Physical Inconsistencies for 3D Scene Understanding , 2012, 2012 Second International Conference on 3D Imaging, Modeling, Processing, Visualization & Transmission.

[2]  Ali Farhadi,et al.  Describing objects by their attributes , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[3]  Reinhard Koch,et al.  Dynamic 3-D Scene Analysis Through Synthesis Feedback Control , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Mubarak Shah,et al.  A 3-dimensional sift descriptor and its application to action recognition , 2007, ACM Multimedia.

[5]  Tetsuya Takiguchi,et al.  3D human posture estimation using the HOG features from monocular image , 2008, 2008 19th International Conference on Pattern Recognition.

[6]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[7]  Georg Wiesmann,et al.  Event-driven feature analysis in a 4D spatiotemporal representation for ambient assisted living , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[8]  Marjorie Skubic,et al.  Evaluation of an inexpensive depth camera for passive in-home fall risk assessment , 2011, 2011 5th International Conference on Pervasive Computing Technologies for Healthcare (PervasiveHealth) and Workshops.

[9]  Sergio A. Velastin,et al.  Vehicle localisation and classification in urban CCTV streams , 2009 .

[10]  S. Shankar Sastry,et al.  A vision system for landing an unmanned aerial vehicle , 2001, Proceedings 2001 ICRA. IEEE International Conference on Robotics and Automation (Cat. No.01CH37164).

[11]  Andrew W. Fitzgibbon,et al.  Real-time human pose recognition in parts from single depth images , 2011, CVPR 2011.

[12]  Katsushi Ikeuchi,et al.  Detecting potential falling objects by inferring human action and natural disturbance , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[13]  Srinivasa G. Narasimhan,et al.  Clustering Appearance for Scene Analysis , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[14]  Afzal Godil,et al.  Salient local 3D features for 3D shape retrieval , 2011, Electronic Imaging.

[15]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1995, EuroCOLT.

[16]  Tsuhan Chen,et al.  3D-Based Reasoning with Blocks, Support, and Stability , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[17]  Jessica B. Hamrick,et al.  Simulation as an engine of physical scene understanding , 2013, Proceedings of the National Academy of Sciences.

[18]  Srinivasa G. Narasimhan,et al.  Appearance Derivatives for Isonormal Clustering of Scenes , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  T. Kanade,et al.  Geometric reasoning for single image structure recovery , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[20]  Andrew W. Fitzgibbon,et al.  KinectFusion: real-time 3D reconstruction and interaction using a moving depth camera , 2011, UIST.

[21]  Benjamin Bustos,et al.  Harris 3D: a robust extension of the Harris operator for interest point detection on 3D meshes , 2011, The Visual Computer.

[22]  Katsushi Ikeuchi,et al.  Beyond Point Clouds: Scene Understanding by Reasoning Geometry and Physics , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[23]  Cordelia Schmid,et al.  Explicit Modeling of Human-Object Interactions in Realistic Videos , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  Cordelia Schmid,et al.  Human Focused Action Localization in Video , 2010, ECCV Workshops.

[25]  G. Griebel,et al.  Risk assessment as an evolved threat detection and analysis process , 2011, Neuroscience & Biobehavioral Reviews.

[26]  Sven Wachsmuth,et al.  Dynamic 3D scene analysis for acquiring articulated scene models , 2010, 2010 IEEE International Conference on Robotics and Automation.

[27]  Sergio A. Velastin,et al.  A Review of Computer Vision Techniques for the Analysis of Urban Traffic , 2011, IEEE Transactions on Intelligent Transportation Systems.

[28]  Henrik I. Christensen,et al.  Efficient Organized Point Cloud Segmentation with Connected Components , 2013 .

[29]  Roni Yagel,et al.  An accurate method for voxelizing polygon meshes , 1998, IEEE Symposium on Volume Visualization (Cat. No.989EX300).

[30]  Tobias Schreck,et al.  Histograms of Oriented Gradients for 3D Object Retrieval , 2010 .

[31]  Zixiang Xiong,et al.  3D scene reconstruction by multiple structured-light based commodity depth cameras , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[32]  Chris D. Nugent,et al.  A Knowledge-Driven Approach to Activity Recognition in Smart Homes , 2012, IEEE Transactions on Knowledge and Data Engineering.

[33]  James M. Keller,et al.  Histogram of Oriented Normal Vectors for Object Recognition with a Depth Sensor , 2012, ACCV.

[34]  Luc Van Gool,et al.  Dynamic 3D Scene Analysis from a Moving Vehicle , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[35]  David Engel,et al.  Towards robust scene analysis: A versatile mid-level feature framework , 2010 .

[36]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.