Visual Subterranean Junction Recognition for MAVs based on Convolutional Neural Networks

This article proposes a novel visual framework for detecting tunnel crossings/junctions in underground mine areas towards the autonomous navigation of Micro Aerial Vehicles (MAVs). Usually mine environments have complex geometries, including multiple crossings with different tunnels that challenge the autonomous planning of aerial robots. Towards the envisioned scenario of autonomous or semi-autonomous deployment of MAVs with limited Line-of-Sight in subterranean environments, the proposed module acknowledges the existence of junctions by providing crucial information to the autonomy and planning layers of the aerial vehicle. The capability for a junction detection is necessary in the majority of mission scenarios, including unknown area exploration, known area inspection and robot homing missions. The proposed novel method has the ability to feed the image stream from the vehicles on-board forward facing camera in a Convolutional Neural Network (CNN) classification architecture, expressed in four categories: 1) left junction, 2) right junction, 3) left & right junction, and 4) no junction in the local vicinity of the vehicle. The core contribution stems for the incorporation of AlexNet in a transfer learning scheme for detecting multiple branches in a subterranean environment. The validity of the proposed method has been validated through multiple data-sets collected from real underground environments, demonstrating the performance and merits of the proposed module.

[1]  Roland Siegwart,et al.  Receding Horizon "Next-Best-View" Planner for 3D Exploration , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[2]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[3]  Sergey Ioffe,et al.  Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  Christoforos Kanellakis,et al.  Survey on Computer Vision for UAVs: Current Developments and Trends , 2017, Journal of Intelligent & Robotic Systems.

[5]  Sina Sharif Mansouri,et al.  Towards MAV Navigation in Underground Mine Using Deep Learning , 2018, 2018 IEEE International Conference on Robotics and Biomimetics (ROBIO).

[6]  K. Madhava Krishna,et al.  Towards View-Invariant Intersection Recognition from Videos using Deep Network Ensembles , 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[7]  Kevin P. Murphy,et al.  Machine learning - a probabilistic perspective , 2012, Adaptive computation and machine learning series.

[8]  Jordan J. Bird,et al.  A Study on CNN Transfer Learning for Image Classification , 2018, UKCI.

[9]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[10]  Yanning Zhang,et al.  Convolutional Neural Network-Based Robot Navigation Using Uncalibrated Spherical Images , 2017, Sensors.

[11]  Haizhong Qian,et al.  Applying CNN Classifier to Road Interchange Classification , 2018, Geoinformatics.

[12]  Emil Fresk,et al.  Autonomous visual inspection of large-scale infrastructures using aerial robots , 2019, ArXiv.

[13]  Javier Ruiz-del-Solar,et al.  Chilean underground mine dataset , 2017, Int. J. Robotics Res..

[14]  Sina Sharif Mansouri,et al.  Towards Autonomous Surveying of Underground Mine Using MAVs , 2018, Advances in Service and Industrial Robotics.

[15]  Ronald M. Summers,et al.  Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning , 2016, IEEE Transactions on Medical Imaging.

[16]  Nikolai Smolyanskiy,et al.  Toward low-flying autonomous MAV trail navigation using deep neural networks for environmental awareness , 2017, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[17]  Ashutosh Natraj,et al.  A real-time monocular vision-based frontal obstacle detection and avoidance for low cost UAVs in GPS denied environment , 2014, 2014 IEEE International Conference on Aerospace Electronics and Remote Sensing Technology.

[18]  Shipeng Xie,et al.  Alcoholism Identification Based on an AlexNet Transfer Learning Model , 2019, Front. Psychiatry.

[19]  John C. Platt,et al.  Learning Discriminative Projections for Text Similarity Measures , 2011, CoNLL.

[20]  Hyongsuk Kim,et al.  Accurate Natural Trail Detection Using a Combination of a Deep Neural Network and Dynamic Programming , 2018, Sensors.

[21]  Marti A. Hearst Trends & Controversies: Support Vector Machines , 1998, IEEE Intell. Syst..

[22]  Massimo Bertozzi,et al.  Enabling Computer Vision-Based Autonomous Navigation for Unmanned Aerial Vehicles in Cluttered GPS-Denied Environments , 2018, 2018 21st International Conference on Intelligent Transportation Systems (ITSC).

[23]  Fei-Fei Li,et al.  ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[24]  Alexei A. Efros,et al.  What makes ImageNet good for transfer learning? , 2016, ArXiv.

[25]  Phil Kim,et al.  MATLAB Deep Learning , 2017, Apress.

[26]  Gavriel Salomon,et al.  T RANSFER OF LEARNING , 1992 .

[27]  S. N. Omkar,et al.  JuncNet: A Deep Neural Network for Road Junction Disambiguation for Autonomous Vehicles , 2018, ArXiv.

[28]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[29]  K. Madhava Krishna,et al.  Have i reached the intersection: A deep learning-based approach for intersection detection from monocular cameras , 2017, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).