Online human assisted and cooperative pose estimation of 2D cameras

Active and interactive method to deduct pose estimation of a fleet of robots.Pose estimation based interactive structure pose estimation.Cooperative Pose estimation when direct alignment is not possible. Autonomous robots performing cooperative tasks need to know the relative pose of the other robots in the fleet. Deducing these poses might be performed through structure from motion methods in the applications where there are no landmarks or GPS, for instance, in non-explored indoor environments. Structure from motion is a technique that deduces the pose of cameras only given only the 2D images. This technique relies on a first step that obtains a correspondence between salient points of images. For this reason, the weakness of this method is that poses cannot be estimated if a proper correspondence is not obtained due to low quality of the images or images that do not share enough salient points. We propose, for the first time, an interactive structure-from-motion method to deduce the pose of 2D cameras. Autonomous robots with embedded cameras have to stop when they cannot deduce their position because the structure-from-motion method fails. In these cases, a human interacts by simply mapping a pair of points in the robots' images. Performing this action the human imposes the correct correspondence between them. Then, the interactive structure from motion is capable of deducing the robots' lost positions and the fleet of robots can continue their high level task. From the practical point of view, the interactive method allows the whole system to achieve more complex tasks in more complex environments since the human interaction can be seen as a recovering or a reset process.

[1]  Alberto Sanfeliu,et al.  On the Graph Edit Distance Cost: Properties and Applications , 2012, Int. J. Pattern Recognit. Artif. Intell..

[2]  Anand Rangarajan,et al.  A new point matching algorithm for non-rigid registration , 2003, Comput. Vis. Image Underst..

[3]  Steven Gold,et al.  A Graduated Assignment Algorithm for Graph Matching , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Cordelia Schmid,et al.  A Performance Evaluation of Local Descriptors , 2005, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  C Vollmar,et al.  Quantitative comparison of automatic and interactive methods for MRI-SPECT image registration of the brain based on 3-dimensional calculation of error. , 2000, Journal of nuclear medicine : official publication, Society of Nuclear Medicine.

[6]  Hee-Hyol Lee,et al.  Cooperative behavior control of robot group using stress antibody allotment reward , 2013, Artificial Life and Robotics.

[7]  A. Ben Hamza,et al.  An information-theoretic method for multimodality medical image registration , 2012, Expert Syst. Appl..

[8]  Gonzalo Ferrer,et al.  Bayesian Human Motion Intentionality Prediction in urban environments , 2014, Pattern Recognit. Lett..

[9]  Anand Rangarajan,et al.  The Softassign Procrustes Matching Algorithm , 1997, IPMI.

[10]  Yong Luo,et al.  Group Sparse Multiview Patch Alignment Framework With View Consistency for Image Classification , 2014, IEEE Transactions on Image Processing.

[11]  Yeong Min Jang,et al.  Stereo-vision-based cooperative-vehicle positioning using OCC and neural networks , 2015 .

[12]  Francesc Serratosa,et al.  Cooperative pose estimation of a fleet of robots based on interactive points alignment , 2016, Expert Syst. Appl..

[13]  Andriy Myronenko,et al.  Point Set Registration: Coherent Point Drift , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Julie A. Shah,et al.  Challenges in Developing a Collaborative Robotic Assistant for Automotive Assembly Lines , 2015, HRI.

[15]  Francesc Serratosa,et al.  An interactive method for the image alignment problem based on partially supervised correspondence , 2015, Expert Syst. Appl..

[16]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[17]  Francesc Serratosa,et al.  Computation of graph edit distance: Reasoning about optimality and speed-up , 2015, Image Vis. Comput..

[18]  Thomas Martin Deserno,et al.  Feature description with SIFT, SURF, BRIEF, BRISK, or FREAK? A general question answered for bone age assessment , 2016, Comput. Biol. Medicine.

[19]  Francesc Serratosa,et al.  Smooth point-set registration using neighboring constraints , 2012, Pattern Recognit. Lett..

[20]  Francesc Serratosa,et al.  Speeding up Fast Bipartite Graph Matching Through a New Cost Matrix , 2015, Int. J. Pattern Recognit. Artif. Intell..

[21]  Francesc Serratosa,et al.  Interactive graph-matching using active query strategies , 2015, Pattern Recognit..

[22]  I. Jolliffe Principal Component Analysis , 2002 .

[23]  Francesc Moreno-Noguer,et al.  Efficient monocular pose estimation for complex 3D models , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[24]  Luo Jianxin,et al.  Survey of structure from motion , 2014, Proceedings of 2014 International Conference on Cloud Computing and Internet of Things.

[25]  Robin R. Murphy,et al.  Human-robot interactions during the robot-assisted urban search and rescue response at the World Trade Center , 2003, IEEE Trans. Syst. Man Cybern. Part B.

[26]  U Pietrzyk,et al.  An interactive technique for three-dimensional image registration: validation for PET, SPECT, MRI and CT brain studies. , 1994, Journal of nuclear medicine : official publication, Society of Nuclear Medicine.

[27]  Sinisa Todorovic,et al.  From contours to 3D object detection and pose estimation , 2011, 2011 International Conference on Computer Vision.

[28]  Cynthia Breazeal,et al.  A Social Robot to Mitigate Stress, Anxiety, and Pain in Hospital Pediatric Care , 2015, HRI.

[29]  Francesc Serratosa,et al.  Fast computation of Bipartite graph matching , 2014, Pattern Recognit. Lett..

[30]  Luc Van Gool,et al.  Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..

[31]  Pierre Vandergheynst,et al.  FREAK: Fast Retina Keypoint , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[32]  Ronen Basri,et al.  A Survey on Structure from Motion , 2017, ArXiv.

[33]  Alberto Sanfeliu,et al.  Cooperative social robots to accompany groups of people , 2012, Int. J. Robotics Res..

[34]  Jun Ota,et al.  Exploration path generation for multiple mobile robots using reaction-diffusion equation on a graph , 2004 .

[35]  Francesc Serratosa,et al.  A probabilistic integrated object recognition and tracking framework , 2009, Expert Syst. Appl..

[36]  Vincent Lepetit,et al.  BRIEF: Binary Robust Independent Elementary Features , 2010, ECCV.

[37]  Du Q. Huynh,et al.  Metrics for 3D Rotations: Comparison and Analysis , 2009, Journal of Mathematical Imaging and Vision.

[38]  Edwin R. Hancock,et al.  A unified framework for alignment and correspondence , 2003, Comput. Vis. Image Underst..

[39]  Gonzalo Ferrer,et al.  Robot Interactive Learning through Human Assistance , 2013, Multimodal Interaction in Image and Video Applications.

[40]  Tom Drummond,et al.  Faster and Better: A Machine Learning Approach to Corner Detection , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[41]  Francesc Serratosa,et al.  A new graph matching method for point-set correspondence using the EM algorithm and Softassign , 2012, Comput. Vis. Image Underst..

[42]  Cecilia E. Garcia Cena,et al.  A cooperative multi-agent robotics system: Design and modelling , 2013, Expert Syst. Appl..

[43]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[44]  Paolo Napoletano,et al.  An interactive tool for manual, semi-automatic and automatic video annotation , 2015, Comput. Vis. Image Underst..

[45]  Zhengyou Zhang,et al.  Iterative point matching for registration of free-form curves and surfaces , 1994, International Journal of Computer Vision.

[46]  Luc Van Gool,et al.  SURF: Speeded Up Robust Features , 2006, ECCV.

[47]  Roland Siegwart,et al.  BRISK: Binary Robust invariant scalable keypoints , 2011, 2011 International Conference on Computer Vision.

[48]  Bin Fang,et al.  A comparison of 3D shape retrieval methods based on a large-scale benchmark supporting multimodal queries , 2015, Comput. Vis. Image Underst..

[49]  Songsong Wu,et al.  Multi-view Intact Discriminant Space Learning for Image Classification , 2018, Neural Processing Letters.