Detection, localisation and tracking of pallets using machine learning techniques and 2D range data

The problem of autonomous transportation in industrial scenarios is receiving a renewed interest due to the way it can revolutionise internal logistics, especially in unstructured environments. This paper presents a novel architecture allowing a robot to detect, localise, and track (possibly multiple) pallets using machine learning techniques based on an on-board 2D laser rangefinder only. The architecture is composed of two main components: the first stage is a pallet detector employing a Faster Region-Based Convolutional Neural Network (Faster R-CNN) detector cascaded with a CNN-based classifier; the second stage is a Kalman filter for localising and tracking detected pallets, which we also use to defer commitment to a pallet detected in the first stage until sufficient confidence has been acquired via a sequential data acquisition process. For fine-tuning the CNNs, the architecture has been systematically evaluated using a real-world dataset containing 340 labelled 2D scans, which have been made freely available in an online repository. Detection performance has been assessed on the basis of the average accuracy over k -fold cross-validation, and it scored 99.58% in our tests. Concerning pallet localisation and tracking, experiments have been performed in a scenario where the robot is approaching the pallet to fork. Although data have been originally acquired by considering only one pallet as per specification of the use case we consider, artificial data have been generated as well to mimic the presence of multiple pallets in the robot workspace. Our experimental results confirm that the system is capable of identifying, localising and tracking pallets with a high success rate while being robust to false positives.

[1]  Bernardo Wagner,et al.  Variable Pallet Pick-Up for Automatic Guided Vehicles in Industrial Environments , 2006, 2006 IEEE Conference on Emerging Technologies and Factory Automation.

[2]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[3]  Raúl Rojas,et al.  Kalman filter for vision tracking , 2005 .

[4]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Clint Heyer Human-robot interaction and future industrial robotics applications , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[6]  Fulvio Mastrogiovanni,et al.  Context assessment strategies for Ubiquitous Robots , 2009, 2009 IEEE International Conference on Robotics and Automation.

[7]  Harold W. Kuhn,et al.  The Hungarian method for the assignment problem , 1955, 50 Years of Integer Programming.

[8]  Ieee Staff,et al.  2015 IEEE International Conference on Intelligent Computer Communication and Processing (ICCP) , 2013 .

[9]  Alonzo Kelly,et al.  Model based object pose refinement for terrestrial and space autonomy , 2000 .

[10]  Roger V. Bostelman,et al.  Visualization of pallets , 2006, SPIE Optics East.

[11]  Arthur Daniel Costea,et al.  Improved autonomous load handling with stereo cameras , 2015, 2015 IEEE International Conference on Intelligent Computer Communication and Processing (ICCP).

[12]  Alexander Verl,et al.  Cooperation of human and machines in assembly lines , 2009 .

[13]  Weidong Zhao,et al.  Pallet Recognition and Localization Method for Vision Guided Forklift , 2012, 2012 8th International Conference on Wireless Communications, Networking and Mobile Computing.

[14]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Michael J. Seelinger,et al.  Automatic Pallet Engagment by a Vision Guided Forklift , 2005, Proceedings of the 2005 IEEE International Conference on Robotics and Automation.

[16]  Martial Hebert,et al.  Outdoor scene analysis using range data , 1986, Proceedings. 1986 IEEE International Conference on Robotics and Automation.

[17]  Chih-Hsien Hsia,et al.  A computer vision assisted system for autonomous forklift vehicles in real factory environment , 2017, Multimedia Tools and Applications.

[18]  Jouni Mattila,et al.  A multistage controller with smooth switching for Autonomous Pallet Picking , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[19]  Korea Maritime,et al.  Development of Pallet Recognition System using Kinect Camera , 2014, MUE 2014.

[20]  Giovanni Garibotto,et al.  Service robotics in logistic automation: ROBOLIFT: vision based autonomous navigation of a conventional fork-lift for pallet handling , 1997, 1997 8th International Conference on Advanced Robotics. Proceedings. ICAR'97.

[21]  Fulvio Mastrogiovanni,et al.  How the Location of the Range Sensor Affects EKF-based Localization , 2012, J. Intell. Robotic Syst..

[22]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[23]  Reinhard Koch,et al.  A Comparison of PMD-Cameras and Stereo-Vision for the Task of Surface Reconstruction using Patchlets , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[24]  L. Baglivo,et al.  An object localization and reaching method for wheeled mobile robots using laser rangefinder , 2008, 2008 4th International IEEE Conference Intelligent Systems.

[25]  Giovanni Garibotto,et al.  ROBOLIFT: a vision guided autonomous fork-lift for pallet handling , 1996, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems. IROS '96.

[26]  Francesco Biral,et al.  Autonomous pallet localization and picking for industrial forklifts: a robust range and look method , 2011 .

[27]  Emilio Frazzoli,et al.  Closed-loop pallet manipulation in unstructured environments , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[28]  Sergiu Nedevschi,et al.  Robust Pallet Detection for Automated Logistics Operations , 2016, VISIGRAPP.

[29]  Guang-zhao Cui,et al.  A robust autonomous mobile forklift pallet recognition , 2010, 2010 2nd International Asia Conference on Informatics in Control, Automation and Robotics (CAR 2010).

[30]  Anil K. Jain,et al.  Model-based classification of quadric surfaces , 1993 .

[31]  Zhu Teng,et al.  Real-time lane detection by using multiple cues , 2010, ICCAS 2010.

[32]  Fulvio Mastrogiovanni,et al.  The more the better? A discussion about line features for self-localization , 2007, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[33]  Paulo Peixoto,et al.  DepthCN: Vehicle detection using 3D-LIDAR and ConvNet , 2017, 2017 IEEE 20th International Conference on Intelligent Transportation Systems (ITSC).

[34]  Jean-Philippe Thiran,et al.  Combining LiDAR space clustering and convolutional neural networks for pedestrian detection , 2017, 2017 14th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[35]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[36]  Sergiu Nedevschi,et al.  Vision-based autonomous load handling for automated guided vehicles , 2014, 2014 IEEE 10th International Conference on Intelligent Computer Communication and Processing (ICCP).

[37]  Cristiano Premebida Segmentation and Geometric Primitives Extraction from 2D Laser Range Data for Mobile Robot Applications , 2005 .

[38]  Juan Carlos Augusto,et al.  Handbook of Ambient Intelligence and Smart Environments , 2009, HAIS 2010.

[39]  Åke Wernersson,et al.  Docking to pallets with feedback from a sheet-of-light range camera , 2000, Proceedings. 2000 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2000) (Cat. No.00CH37113).

[40]  Rita Cucchiara,et al.  Focus based Feature Extraction for Pallets Recognition , 2000, BMVC.

[41]  F. Mastrogiovanni,et al.  A System for Hierarchical Planning in Service Mobile Robotics , 2004 .

[42]  Nasser Kehtarnavaz,et al.  Proceedings of SPIE - The International Society for Optical Engineering , 1991 .

[43]  GlassJames,et al.  A Situationally Aware Voice-commandable Robotic Forklift Working Alongside People in Unstructured Outdoor Environments , 2015 .

[44]  Pólo de Coimbra,et al.  Segmentation and Geometric Primitives Extraction from 2D Laser Range Data for Mobile Robot Applications , 2005 .

[45]  Jouni Mattila,et al.  Position-based visual servoing for pallet picking by an articulated-frame-steering hydraulic mobile machine , 2013, 2013 6th IEEE Conference on Robotics, Automation and Mechatronics (RAM).

[46]  Raffaello D'Andrea,et al.  Guest editorial: A revolution in the warehouse: a retrospective on Kiva Systems and the grand challenges ahead , 2012, IEEE Trans Autom. Sci. Eng..

[47]  Fulvio Mastrogiovanni,et al.  A 2D laser rangefinder scans dataset of standard EUR pallets , 2018, Data in brief.

[48]  Anil K. Jain,et al.  Segmentation and Classification of Range Images , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[49]  Shijun Wang,et al.  Autonomous pallet localization and picking for industrial forklifts based on the line structured light , 2016, 2016 IEEE International Conference on Mechatronics and Automation.

[50]  Christian Prasse,et al.  Automated detection of euro pallet loads by interpreting PMD camera depth images , 2013, Logist. Res..

[51]  R. Khan,et al.  Sequential Tests of Statistical Hypotheses. , 1972 .

[52]  Yin Zhou,et al.  VoxelNet: End-to-End Learning for Point Cloud Based 3D Object Detection , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[53]  Tullio Vernazza,et al.  Mobile Robots in Hospital Environments: an Installation Case Study , 2011, ECMR.

[54]  Stergios I. Roumeliotis,et al.  Weighted line fitting algorithms for mobile robot map building and efficient data representation , 2003, 2003 IEEE International Conference on Robotics and Automation (Cat. No.03CH37422).

[55]  Koen E. A. van de Sande,et al.  Selective Search for Object Recognition , 2013, International Journal of Computer Vision.

[56]  Fulvio Mastrogiovanni,et al.  Semantic-Aware Real-Time Scheduling in Robotics , 2013, IEEE Transactions on Robotics.

[57]  Luke Fletcher,et al.  A Situationally Aware Voice‐commandable Robotic Forklift Working Alongside People in Unstructured Outdoor Environments , 2015, J. Field Robotics.

[58]  Sylvain Arlot,et al.  A survey of cross-validation procedures for model selection , 2009, 0907.4728.

[59]  Fulvio Mastrogiovanni,et al.  Robust Navigation in an Unknown Environment With Minimal Sensing and Representation , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[60]  Alexander Kleiner,et al.  Self-localization in dynamic environments based on laser and vision data , 2003, Proceedings 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2003) (Cat. No.03CH37453).

[61]  Fulvio Mastrogiovanni,et al.  Flexible human-robot cooperation models for assisted shop-floor tasks , 2017, ArXiv.

[62]  Bin Yang,et al.  Deep Continuous Fusion for Multi-sensor 3D Object Detection , 2018, ECCV.

[63]  Fulvio Mastrogiovanni,et al.  From Autonomous Robots to Artificial Ecosystems , 2010, Handbook of Ambient Intelligence and Smart Environments.

[64]  Sven Behnke,et al.  Fast Edge-Based Detection and Localization of Transport Boxes and Pallets in RGB-D Images for Mobile Robot Bin Picking , 2016 .

[65]  Jouni Mattila,et al.  A macro-micro controller for pallet picking by an articulated-frame-steering hydraulic mobile machine , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[66]  Minhwan Kim,et al.  Real-Time Positioning and Orienting of Pallets Based on Monocular Vision , 2008, 2008 20th IEEE International Conference on Tools with Artificial Intelligence.

[67]  Klaus C. J. Dietmayer,et al.  Towards Safe Autonomous Driving: Capture Uncertainty in the Deep Neural Network For Lidar 3D Vehicle Detection , 2018, 2018 21st International Conference on Intelligent Transportation Systems (ITSC).

[68]  Ross B. Girshick,et al.  Fast R-CNN , 2015, 1504.08083.

[69]  Jie Liu,et al.  Feature-to-Feature Based Laser Scan Matching for Pallet Recognition , 2010, 2010 International Conference on Measuring Technology and Mechatronics Automation.

[70]  Joachim Denzler,et al.  Convolutional Patch Networks with Spatial Prior for Road Detection and Urban Scene Understanding , 2015, VISAPP.

[71]  Andrei Novikov,et al.  Uniform Asymptotic Expansion of Likelihood Ratio for Markov Dependent Observations , 2001 .