IRDC-Net: Lightweight Semantic Segmentation Network Based on Monocular Camera for Mobile Robot Navigation

Computer vision plays a significant role in mobile robot navigation due to the wealth of information extracted from digital images. Mobile robots localize and move to the intended destination based on the captured images. Due to the complexity of the environment, obstacle avoidance still requires a complex sensor system with a high computational efficiency requirement. This study offers a real-time solution to the problem of extracting corridor scenes from a single image using a lightweight semantic segmentation model integrating with the quantization technique to reduce the numerous training parameters and computational costs. The proposed model consists of an FCN as the decoder and MobilenetV2 as the decoder (with multi-scale fusion). This combination allows us to significantly minimize computation time while achieving high precision. Moreover, in this study, we also propose to use the Balance Cross-Entropy loss function to handle diverse datasets, especially those with class imbalances and to integrate a number of techniques, for example, the Adam optimizer and Gaussian filters, to enhance segmentation performance. The results demonstrate that our model can outperform baselines across different datasets. Moreover, when being applied to practical experiments with a real mobile robot, the proposed model’s performance is still consistent, supporting the optimal path planning, allowing the mobile robot to efficiently and effectively avoid the obstacles.

[1]  N. Bui,et al.  Obstacle Avoidance Strategy for Mobile Robot Based on Monocular Camera , 2023, Electronics.

[2]  Xin Kong,et al.  GADA-SegNet: gated attentive domain adaptation network for semantic segmentation of LiDAR point clouds , 2023, The Visual Computer.

[3]  Duc-Son Pham,et al.  A real-time semantic segmentation model using iteratively shared features in multiple sub-encoders , 2023, Pattern Recognit..

[4]  K. Aizawa,et al.  Comprehensive Comparisons of Uniform Quantization in Deep Image Compression , 2023, IEEE Access.

[5]  Diogo Carneiro,et al.  Two-Stage Framework for Faster Semantic Segmentation , 2023, Sensors.

[6]  M. Ahamed,et al.  Experimental Analysis of the Behavior of Mirror-like Objects in LiDAR-Based Robot Navigation , 2023, Applied Sciences.

[7]  Qizhi Tang,et al.  A Rapid Identification Technique of Moving Loads Based on MobileNetV2 and Transfer Learning , 2023, Buildings.

[8]  C. Sivaparthipan,et al.  Fully convolutional neural networks for LIDAR–camera fusion for pedestrian detection in autonomous vehicle , 2023, Multimedia Tools and Applications.

[9]  N. Bui,et al.  Multi-Scale Fully Convolutional Network-Based Semantic Segmentation for Mobile Robot Navigation , 2023, Electronics.

[10]  Jingfeng Guo,et al.  An Improved Adam Optimization Algorithm Combining Adaptive Coefficients and Composite Gradients Based on Randomized Block Coordinate Descent , 2023, Computational intelligence and neuroscience.

[11]  Hui Li,et al.  S-ResNet: An improved ResNet neural model capable of the identification of small insects , 2022, Frontiers in Plant Science.

[12]  M. Al-Khawaldeh,et al.  Obstacles Avoidance for Mobile Robot Using Type-2 Fuzzy Logic Controller , 2022, Robotics.

[13]  Yongjie Huang,et al.  Prediction of Prospecting Target Based on ResNet Convolutional Neural Network , 2022, Applied Sciences.

[14]  Yiqin Wang Remote sensing image semantic segmentation network based on ENet , 2022, The Journal of Engineering.

[15]  Z. Ibrahim,et al.  Convolutional Neural Network featuring VGG-16 Model for Glioma Classification , 2022, JOIV : International Journal on Informatics Visualization.

[16]  Qin Dong Path Planning Algorithm Based on Visual Image Feature Extraction for Mobile Robots , 2022, Mobile Information Systems.

[17]  Leonard Rusli,et al.  Vision-based vanishing point detection of autonomous navigation of mobile robot for outdoor applications , 2021, Journal of Mechatronics, Electrical Power, and Vehicular Technology.

[18]  Yang Wang,et al.  Encoder- and Decoder-Based Networks Using Multiscale Feature Fusion and Nonlocal Block for Remote Sensing Image Semantic Segmentation , 2021, IEEE Geoscience and Remote Sensing Letters.

[19]  T. Murata,et al.  Balanced Softmax Cross-Entropy for Incremental Learning , 2021, ICANN.

[20]  Jan Flusser,et al.  Handling Gaussian blur without deconvolution , 2020, Pattern Recognit..

[21]  Suresh Muknahallipatna,et al.  Deep learning algorithm for Gaussian noise removal from images , 2020, J. Electronic Imaging.

[22]  Yang Tang,et al.  Monocular depth estimation based on deep learning: An overview , 2020, Science China Technological Sciences.

[23]  Dan Wang,et al.  HRED-Net: High-Resolution Encoder-Decoder Network for Fine-Grained Image Segmentation , 2020, IEEE Access.

[24]  Antonio J. Plaza,et al.  Image Segmentation Using Deep Learning: A Survey , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25]  Ronghua Shang,et al.  Densely Based Multi-Scale and Multi-Modal Fully Convolutional Networks for High-Resolution Remote-Sensing Image Semantic Segmentation , 2019, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[26]  Yi Yu,et al.  Dense U-net Based on Patch-Based Learning for Retinal Vessel Segmentation , 2019, Entropy.

[27]  Angga Rusdinar,et al.  Lidar Application for Mapping and Robot Navigation on Closed Environment , 2018, Journal of Measurements, Electronics, Communications, and Systems.

[28]  Lianru Gao,et al.  Semantic Labeling of High Resolution Aerial Imagery and LiDAR Data with Fine Segmentation Network , 2018, Remote. Sens..

[29]  Murat Lüy,et al.  Initial Results of Testing a Multilayer Laser Scanner in a Collision Avoidance System for Light Rail Vehicles , 2018 .

[30]  Andreas Geiger,et al.  Augmented Reality Meets Computer Vision: Efficient Data Generation for Urban Driving Scenes , 2017, International Journal of Computer Vision.

[31]  Éric Marchand,et al.  Pose Estimation for Augmented Reality: A Hands-On Survey , 2016, IEEE Transactions on Visualization and Computer Graphics.

[32]  Roberto Cipolla,et al.  SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[33]  R. Chen,et al.  Vision-Aided Path Planning Using Low-Cost Gene Encoding for a Mobile Robot , 2022, Intelligent Automation & Soft Computing.

[34]  J. Zerubia,et al.  Semantic Segmentation of Remote-Sensing Images Through Fully Convolutional Neural Networks and Hierarchical Probabilistic Graphical Models , 2022, IEEE Transactions on Geoscience and Remote Sensing.

[35]  Kirill Krinkin,et al.  Autonomous Wheels And Camera Calibration In Duckietown Project , 2021 .

[36]  Tamer Abukhalil,et al.  ROBOT NAVIGATION SYSTEM USING LASER AND A MONOCULAR CAMERA , 2020 .