Occlusion-Aware Object Localization, Segmentation and Pose Estimation

We present a learning approach for localization and segmentation of objects in an image in a manner that is robust to partial occlusion. Our algorithm produces a bounding box around the full extent of the object and labels pixels in the interior that belong to the object. Like existing segmentation aware detection approaches, we learn an appearance model of the object and consider regions that do not fit this model as potential occlusions. However, in addition to the established use of pairwise potentials for encouraging local consistency, we use higher order potentials which capture information at the level of im- age segments. We also propose an efficient loss function that targets both localization and segmentation performance. Our algorithm achieves 13.52% segmentation error and 0.81 area under the false-positive per image vs. recall curve on average over the challenging CMU Kitchen Occlusion Dataset. This is a 42.44% decrease in segmentation error and a 16.13% increase in localization performance compared to the state-of-the-art. Finally, we show that the visibility labelling produced by our algorithm can make full 3D pose estimation from a single image robust to occlusion.

[1]  Jun Ueda,et al.  An Asymptotically Stable Pressure Observer Based on Load and Displacement Sensing for Pneumatic Actuators With Long Transmission Lines , 2017, IEEE/ASME Transactions on Mechatronics.

[2]  Aaron D. Ames,et al.  First steps toward translating robotic walking to prostheses: a nonlinear optimization based control approach , 2017, Auton. Robots.

[3]  Evangelos A. Theodorou,et al.  Model Predictive Path Integral Control: From Theory to Parallel Computation , 2017 .

[4]  Aaron D. Ames,et al.  3D multi-contact gait design for prostheses: Hybrid system models, virtual constraints and two-step direct collocation , 2016, 2016 IEEE 55th Conference on Decision and Control (CDC).

[5]  Huihua Zhao,et al.  FROM BIPEDAL LOCOMOTION TO PROSTHETIC WALKING: A HYBRID SYSTEM AND NONLINEAR CONTROL APPROACH , 2016 .

[6]  Jun Ueda,et al.  Observer based impedance control of a pneumatic system with long transmission lines , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[7]  James M. Rehg,et al.  Aggressive driving with model predictive path integral control , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[8]  Charles C. Kemp,et al.  Multimodal execution monitoring for anomaly detection during robot manipulation , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[9]  Aaron D. Ames,et al.  3D dynamic walking with underactuated humanoid robots: A direct collocation framework for optimizing hybrid zero dynamics , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[10]  Aaron D. Ames,et al.  Realizing dynamic and efficient bipedal locomotion on the humanoid robot DURUS , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[11]  Andrea Lockerd Thomaz,et al.  Work those arms: Toward dynamic and stable humanoid walking that optimizes full-body motion , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[12]  Victor Paredes,et al.  Upslope walking with transfemoral prosthesis using optimization based spline generation , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[13]  Aaron D. Ames,et al.  Optimal bipedal interactions with dynamic terrain: synthesis and analysis via nonlinear programming , 2016 .

[14]  Aaron D. Ames,et al.  Multicontact Locomotion on Transfemoral Prostheses via Hybrid System Models and Optimization-Based Control , 2016, IEEE Transactions on Automation Science and Engineering.

[15]  히로유키 핫토리,et al.  Motor control apparatus and motor control method , 2015 .

[16]  Charles C. Kemp,et al.  Task-centric selection of robot and environment initial configurations for assistive tasks , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[17]  James M. Rehg,et al.  Combining tactile sensing and vision for rapid haptic mapping , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[18]  Aaron D. Ames,et al.  Multi-contact bipedal robotic locomotion , 2015, Robotica.

[19]  Aaron D. Ames,et al.  A hybrid systems and optimization-based control approach to realizing multi-contact locomotion on transfemoral prostheses , 2015, 2015 54th IEEE Conference on Decision and Control (CDC).

[20]  Siavash Rezazadeh,et al.  Spring-Mass Walking With ATRIAS in 3D: Robust Gait Control Spanning Zero to 4.3 KPH on a Heavily Underactuated Bipedal Robot , 2015 .

[21]  Aaron D. Ames,et al.  Realization of stair ascent and motion transitions on prostheses utilizing optimization-based control and intent recognition , 2015, 2015 IEEE International Conference on Rehabilitation Robotics (ICORR).

[22]  Evangelos A. Theodorou,et al.  Sample Efficient Path Integral Control under Uncertainty , 2015, NIPS.

[23]  Evangelos Theodorou,et al.  Model Predictive Path Integral Control using Covariance Variable Importance Sampling , 2015, ArXiv.

[24]  Jun Ueda,et al.  Robotic Neuromuscular Facilitation for Regaining Neural Activation in Hemiparetic Limbs , 2015 .

[25]  Monica A. Daley,et al.  Do limit cycles matter in the long run? Stable orbits and sliding-mass dynamics emerge in task-optimal locomotion , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[26]  Aaron D. Ames,et al.  Hybrid zero dynamics based multiple shooting optimization with applications to robotic walking , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[27]  Evangelos Theodorou,et al.  Nonlinear Stochastic Control and Information Theoretic Dualities: Connections, Interdependencies and Thermodynamic Interpretations , 2015, Entropy.

[28]  Aaron D. Ames,et al.  Demonstration of locomotion with the powered prosthesis AMPRO utilizing online optimization-based control , 2015, HSCC.

[29]  Aaron D. Ames,et al.  Realization of nonlinear real-time optimization based controllers on self-contained transfemoral prosthesis , 2015, ICCPS.

[30]  Paulo Tabuada,et al.  First steps toward formal controller synthesis for bipedal robots , 2015, HSCC.

[31]  Eric Rombokas,et al.  GPU Based Path Integral Control with Learned Dynamics , 2015, ArXiv.

[32]  Carl E. Rasmussen,et al.  Gaussian Processes for Data-Efficient Learning in Robotics and Control , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[33]  E. Theodorou,et al.  A Generalized Importance Sampling Method for Model Predictive Path Integral Control , 2015 .

[34]  Monica A. Daley,et al.  TASK-LEVEL PRIORITIES IN GROUND-RUNNING BIRD LOCOMOTION: EXPERIMENTS TO MATH MODELS TO ROBOTS BY WAY OF OPTIMIZATION , 2015 .

[35]  Evangelos A. Theodorou,et al.  Scalable Reinforcement Learning via Trajectory Optimization and Approximate Gaussian Process Regression , 2015 .

[36]  Christian M. Hubicki,et al.  From running birds to walking robots : optimization as a unifying framework for dynamic bipedal locomotion , 2014 .

[37]  Yunpeng Pan,et al.  Model-based Path Integral Stochastic Control: A Bayesian Nonparametric Approach , 2014, ArXiv.

[38]  Yunpeng Pan,et al.  Probabilistic Differential Dynamic Programming , 2014, NIPS.

[39]  James M. Rehg,et al.  Learning to reach into the unknown: Selecting initial conditions when reaching in clutter , 2014, 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[40]  Charles C. Kemp,et al.  Interleaving planning and control for efficient haptically-guided reaching in unknown environments , 2014, 2014 IEEE-RAS International Conference on Humanoid Robots.

[41]  Monica A. Daley,et al.  Don't break a leg: running birds from quail to ostrich prioritise leg safety and economy on uneven terrain , 2014, Journal of Experimental Biology.

[42]  Aaron D. Ames,et al.  Human‐inspired motion primitives and transitions for bipedal robotic locomotion in diverse terrain , 2014 .

[43]  Wayne J. Book,et al.  Model Predictive Control for Compliant Pneumatic Systems , 2014 .

[44]  Kostas Daniilidis,et al.  Single image 3D object detection and pose estimation for grasping , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[45]  Arnold Maliki,et al.  Robotic rehabilitation exercise in hemiparetic limbs based on functional synchronization of voluntary and involuntary nerve impulses , 2014, 2014 IEEE International Workshop on Advanced Robotics and its Social Impacts.

[46]  Monica A. Daley,et al.  Running into a trap: Numerical design of task-optimal preflex behaviors for delayed disturbance responses , 2014, 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[47]  Christian M. Hubicki,et al.  Swing-Leg Trajectory of Running Guinea Fowl Suggests Task-Level Priority of Force Regulation Rather than Disturbance Rejection , 2014, PloS one.

[48]  Wayne J. Book,et al.  Predictive Friction Compensation for Control of Pneumatic Actuators , 2014 .

[49]  Aaron D. Ames,et al.  Exponential convergence of a unified CLF controller for robotic systems under parameter uncertainty , 2014, 2014 American Control Conference.

[50]  Aaron D. Ames,et al.  Quadratic program based control of fully-actuated transfemoral prosthesis for flat-ground and up-slope locomotion , 2014, 2014 American Control Conference.

[51]  Aaron D. Ames,et al.  Quadratic programming and impedance control for transfemoral prosthesis , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[52]  Aaron D. Ames,et al.  Human-inspired walking via unified PD and impedance control , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[53]  Yunpeng Pan,et al.  Nonparametric infinite horizon Kullback-Leibler stochastic control , 2014, 2014 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL).

[54]  Aaron D. Ames,et al.  Dynamic multi-domain bipedal walking with atrias through SLIP based human-inspired control , 2014, HSCC.

[55]  Aaron D. Ames,et al.  Human-inspired multi-contact locomotion with AMBER2 , 2014, 2014 ACM/IEEE International Conference on Cyber-Physical Systems (ICCPS).

[56]  智一 吉田,et al.  Efficient Graph-Based Image Segmentationを用いた圃場図自動作成手法の検討 , 2014 .

[57]  Monica A. Daley,et al.  GROUND-BIRD RUNNING IN NON-RIGID TERRAIN: DEDUCING TASK-LEVEL PRIORITIES IN LOCOMOTION , 2014 .

[58]  Wayne J. Book,et al.  Value of a High Fidelity Actuator Model for Dynamic Simulation of a Pneumatic Rescue Robot , 2014 .

[59]  Aaron D. Ames,et al.  Composing Dynamical Systems to Realize Dynamic Robotic Dancing , 2014, WAFR.

[60]  Silvio Savarese,et al.  Object Detection by 3D Aspectlets and Occlusion Reasoning , 2013, 2013 IEEE International Conference on Computer Vision Workshops.

[61]  Timothy Bretl,et al.  Learning impedance controller parameters for lower-limb prostheses , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[62]  M. Shinohara,et al.  Control of Voluntary and Involuntary Nerve Impulses for Hemiparesis Rehabilitation and MRI Study , 2013 .

[63]  Lena H. Ting,et al.  Stability Radius as a Method for Comparing the Dynamics of Neuromechanical Systems , 2013, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[64]  Robert J. Webster,et al.  MRI–Compatible Fluid-Powered Medical Devices , 2013 .

[65]  Lena H Ting,et al.  Defining feasible bounds on muscle activation in a redundant biomechanical task: practical implications of redundancy. , 2013, Journal of biomechanics.

[66]  J. Ueda,et al.  Analysis of an MRI Compatible Force Sensor for Sensitivity and Precision , 2013, IEEE Sensors Journal.

[67]  Torrence D. J. Welch,et al.  Statistically significant contrasts between EMG waveforms revealed using wavelet-based functional ANOVA. , 2013, Journal of neurophysiology.

[68]  E. Rayfield,et al.  Annual Meeting of the Society-for-Integrative-and-Comparative-Biology (SICB) , 2013 .

[69]  Shishir Kolathaya,et al.  Human-Inspired Walking in AMBER 1 . 0 and AMBER 2 . 0 , 2013 .

[70]  Lena H Ting,et al.  Sensorimotor feedback based on task-relevant error robustly predicts temporal recruitment and multidirectional tuning of muscle synergies. , 2013, Journal of neurophysiology.

[71]  Aaron D. Ames,et al.  Human-inspired underactuated bipedal robotic walking with AMBER on flat-ground, up-slope and uneven terrain , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[72]  Aaron D. Ames,et al.  Bipedal robotic running with partial hybrid zero dynamics and human-inspired optimization , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[73]  Aaron D. Ames,et al.  Achieving bipedal locomotion on rough terrain through human-inspired control , 2012, 2012 IEEE International Symposium on Safety, Security, and Rescue Robotics (SSRR).

[74]  Jun Wang,et al.  Model Predictive Control of Unknown Nonlinear Dynamical Systems Based on Recurrent Neural Networks , 2012, IEEE Transactions on Industrial Electronics.

[75]  Maya Cakmak,et al.  Keyframe-based Learning from Demonstration , 2012, Int. J. Soc. Robotics.

[76]  Aaron D. Ames,et al.  Zeno behavior in electromechanical hybrid systems: From theory to experimental validation , 2012, 2012 American Control Conference (ACC).

[77]  Aaron D. Ames,et al.  Outputs of human walking for bipedal robotic controller design , 2012, 2012 American Control Conference (ACC).

[78]  Martial Hebert,et al.  Occlusion reasoning for object detection under arbitrary viewpoint , 2012, CVPR.

[79]  Aaron D. Ames,et al.  Motion primitives for human-inspired bipedal robotic locomotion: walking and stair climbing , 2012, 2012 IEEE International Conference on Robotics and Automation.

[80]  Vincent Lepetit,et al.  Gradient Response Maps for Real-Time Detection of Textureless Objects , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[81]  Henrik I. Christensen,et al.  Robust 3D visual tracking using particle filtering on the special Euclidean group: A combined approach of keypoint and edge features , 2011, 2011 IEEE International Conference on Robotics and Automation.

[82]  Frank Dellaert,et al.  iSAM2: Incremental smoothing and mapping using the Bayes tree , 2012, Int. J. Robotics Res..

[83]  James Cain,et al.  Human-Inspired Bipedal Robotic Walking : From Theory to Experimental Implementation on AMBER , 2012 .

[84]  Christian M. Hubicki,et al.  RUNNING ON SOFT GROUND: SIMPLE, ENERGY-OPTIMAL DISTURBANCE REJECTION ∗ , 2012 .

[85]  Aaron D. Ames,et al.  From Formal Methods to Algorithmic Implementation of Human Inspired Control on Bipedal Robots , 2012, WAFR.

[86]  Fumitoshi Matsuno,et al.  Teleoperation system using past image records for legged robot , 2011, 2011 IEEE International Symposium on Safety, Security, and Rescue Robotics.

[87]  David A. McAllester,et al.  Object Detection with Grammar Models , 2011, NIPS.

[88]  Jun Ueda,et al.  Design of an MRI compatible haptic interface , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[89]  Aaron D. Ames,et al.  Simulating prosthetic devices with human-inspired hybrid control , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[90]  Hannes G. Daepp Development of a multi-platform simulation for a pneumatically-actuated quadruped robot , 2011 .

[91]  Seyed A Safavynia,et al.  Muscle Synergies: Implications for Clinical Evaluation and Rehabilitation of Movement. , 2011, Topics in spinal cord injury rehabilitation.

[92]  Lena H Ting,et al.  Stability in a frontal plane model of balance requires coupled changes to postural configuration and neural feedback control. , 2011, Journal of neurophysiology.

[93]  Stephen Gould,et al.  Max-margin Learning for Lower Linear Envelope Potentials in Binary Markov Random Fields , 2011, ICML.

[94]  Daphne Koller,et al.  A segmentation-aware object detection model with occlusion handling , 2011, CVPR 2011.

[95]  Charless C. Fowlkes,et al.  Contour Detection and Hierarchical Image Segmentation , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[96]  Christian M. Hubicki Energy-Economical Heuristically Based Control of Compass Gait Walking on Stochastically Varying Terrain , 2011 .

[97]  Stacie A. Chvatal,et al.  Decomposing Muscle Activity in Motor TasksMethods and Interpretation , 2010 .

[98]  Jun Wang,et al.  A neurodynamic optimization approach to nonlinear model predictive control , 2010, 2010 IEEE International Conference on Systems, Man and Cybernetics.

[99]  Emilio Frazzoli,et al.  Steady-state cornering equilibria and stabilisation for a vehicle during extreme operating conditions , 2010 .

[100]  Carl E. Rasmussen,et al.  Sparse Spectrum Gaussian Process Regression , 2010, J. Mach. Learn. Res..

[101]  Richard R Neptune,et al.  Merging of healthy motor modules predicts reduced locomotor performance and muscle coordination complexity post-stroke. , 2010, Journal of neurophysiology.

[102]  Huihua Zhao,et al.  Dynamic Multi-Contact Bipedal Robotic Walking , 2010 .

[103]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[104]  Luc Van Gool,et al.  The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.

[105]  Andrew Zisserman,et al.  Structured output regression for detection with partial truncation , 2009, NIPS.

[106]  Thorsten Joachims,et al.  Cutting-plane training of structural SVMs , 2009, Machine Learning.

[107]  Shuicheng Yan,et al.  An HOG-LBP human detector with partial occlusion handling , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[108]  Stefan Schaal,et al.  Biologically-inspired dynamical systems for movement generation: Automatic real-time goal adaptation and obstacle avoidance , 2009, 2009 IEEE International Conference on Robotics and Automation.

[109]  Maya Cakmak,et al.  Learning about objects with human teachers , 2009, 2009 4th ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[110]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[111]  Stefan Schaal,et al.  Movement reproduction and obstacle avoidance with dynamic movement primitives and potential fields , 2008, Humanoids 2008 - 8th IEEE-RAS International Conference on Humanoid Robots.

[112]  Jun Wang,et al.  Robust Model Predictive Control Using a Discrete-Time Recurrent Neural Network , 2008, ISNN.

[113]  Pushmeet Kohli,et al.  Robust Higher Order Potentials for Enforcing Label Consistency , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[114]  Jun Wang,et al.  Two neural network approaches to model predictive control , 2008, 2008 American Control Conference.

[115]  J E Scrivens,et al.  A robotic device for understanding neuromechanical interactions during standing balance control , 2008, Bioinspiration & biomimetics.

[116]  Jun Wang,et al.  Nonlinear model predictive control using a recurrent neural network , 2008, 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence).

[117]  Torrence D. J. Welch,et al.  A feedback model reproduces muscle activity during human postural responses to support-surface translations. , 2008, Journal of neurophysiology.

[118]  S. Schaal,et al.  Combining dynamic movement primitives and potential fields for online obstacle avoidance , 2008 .

[119]  L. Ting,et al.  Functional muscle synergies constrain force production during postural tasks. , 2008, Journal of biomechanics.

[120]  Benjamin Recht,et al.  Random Features for Large-Scale Kernel Machines , 2007, NIPS.

[121]  Lena H Ting,et al.  Neuromechanics of muscle synergies for posture and movement , 2007, Current Opinion in Neurobiology.

[122]  L. Ting,et al.  Muscle synergies characterizing human postural responses. , 2007, Journal of neurophysiology.

[123]  Keith W. Buffinton,et al.  Adaptive tabular pulse-width control of a prismatically jointed manipulator , 2007 .

[124]  Keith W. Buffinton,et al.  TABULAR PULSE-WIDTH CONTROL OF A TWO DEGREE-OF-FREEDOM MANIPULATOR , 2006 .

[125]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[126]  Vladimir Kolmogorov,et al.  An Experimental Comparison of Min-Cut/Max-Flow Algorithms for Energy Minimization in Vision , 2004, IEEE Trans. Pattern Anal. Mach. Intell..

[127]  Vladimir Kolmogorov,et al.  "GrabCut": interactive foreground extraction using iterated graph cuts , 2004, ACM Trans. Graph..

[128]  Vladimir Kolmogorov,et al.  An experimental comparison of min-cut/max- flow algorithms for energy minimization in vision , 2001, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[129]  James W. Moore,et al.  Institute of Electrical and Electronics Engineers (IEEE) , 2002 .

[130]  Olga Veksler,et al.  Fast approximate energy minimization via graph cuts , 2001, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[131]  Lakshmi N. Bairavasundaram Conference publications , 1999 .