Real-Time Passable Area Segmentation With Consumer RGB-D Cameras for the Visually Impaired

Passable area segmentation is an essential component of navigation assistance for the visually impaired people. Pixelwise semantic segmentation of red, green, and blue (RGB) images can be advanced by exploiting informative features from depth modality. Simple fusion, however, may not lead to satisfactory results as actual depth data are generally noisy, which may worsen the accuracy as the networks go deeper. In this article, we construct a wearable system with a novel lightweight multimodality collaborative network (LM-MCN), which is performing real-time passable area segmentation to assist people walking alone more safely. First, a normal map estimation (NME) module, which can convert dense depth into a normal map with high accuracy and efficiency, is introduced. Moreover, we propose a multimodality feature fusion module (M-MFFM), which can recalibrate and fuse the features from both RGB images and the inferred depth information for accurate passable area segmentation. Furthermore, we publish a new passable area segmentation dataset, named SensingAI-Easy-to-Walk (SETW), collected from sidewalk views in urban areas. Extensive experiments demonstrate that depth information is more appealing since it provides a geometric counterpart to RGB representation. In addition, the experimental results report that our method achieves a 96.50% MaxF with real-time inference speed of 50 FPS for KITTI road, which demonstrates the validity and superiority of LM-MCN.

[1]  R. Stiefelhagen,et al.  CMX: Cross-Modal Fusion for RGB-X Semantic Segmentation with Transformers , 2022, IEEE Transactions on Intelligent Transportation Systems.

[2]  Wenteng Liang,et al.  Fast Road Segmentation via Uncertainty-aware Symmetric Network , 2022, 2022 International Conference on Robotics and Automation (ICRA).

[3]  Rainer Stiefelhagen,et al.  Trans4Trans: Efficient Transformer for Transparent Object and Semantic Scene Segmentation in Real-World Navigation Assistance , 2021, IEEE Transactions on Intelligent Transportation Systems.

[4]  Ming Liu,et al.  Dynamic Fusion Module Evolves Drivable Area and Road Anomaly Detection: A Benchmark and Algorithms , 2021, IEEE Transactions on Cybernetics.

[5]  Rui Fan,et al.  Learning Collision-Free Space Detection From Stereo Images: Homography Matrix Brings Better Data Augmentation , 2020, IEEE/ASME Transactions on Mechatronics.

[6]  Rui Fan,et al.  SNE-RoadSeg+: Rethinking Depth-Normal Translation and Deep Supervision for Freespace Detection , 2021, 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[7]  Lu Zhang,et al.  Dynamic Crosswalk Scene Understanding for the Visually Impaired , 2021, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[8]  Jian Yang,et al.  A Cascaded LiDAR-Camera Fusion Network for Road Detection , 2021, 2021 IEEE International Conference on Robotics and Automation (ICRA).

[9]  I. Pitas,et al.  Three-Filters-to-Normal: An Accurate and Ultrafast Surface Normal Estimator , 2020, IEEE Robotics and Automation Letters.

[10]  I-Hsuan Hsieh,et al.  Outdoor walking guide for the visually-impaired people based on semantic segmentation and depth map , 2020, 2020 International Conference on Pervasive Artificial Intelligence (ICPAI).

[11]  Yuxiang Sun,et al.  Applying Surface Normal Information in Drivable Area and Road Anomaly Detection for Ground Mobile Robots , 2020, 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[12]  Rui Fan,et al.  SNE-RoadSeg: Incorporating Surface Normal Information into Semantic Segmentation for Accurate Freespace Detection , 2020, ECCV.

[13]  Xiaokang Chen,et al.  Bi-directional Cross-Modality Feature Propagation with Separation-and-Aggregation Gate for RGB-D Semantic Segmentation , 2020, ECCV.

[14]  I. Pitas,et al.  Pothole Detection Based on Disparity Transformation and Road Surface Modeling , 2019, IEEE Transactions on Image Processing.

[15]  Enhua Wu,et al.  Squeeze-and-Excitation Networks , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Shuo Gu,et al.  Two-View Fusion based Convolutional Neural Network for Urban Road Detection , 2019, 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[17]  Jee-Young Sun,et al.  Reverse and Boundary Attention Network for Road Segmentation , 2019, 2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW).

[18]  Kaite Xiang,et al.  Importance-Aware Semantic Segmentation with Efficient Pyramidal Context Network for Navigational Assistant Systems , 2019, 2019 IEEE Intelligent Transportation Systems Conference (ITSC).

[19]  Andrzej Kos,et al.  Low Energy Precise Navigation System for the Blind with Infrared Sensors , 2019, 2019 MIXDES - 26th International Conference "Mixed Design of Integrated Circuits and Systems".

[20]  Xinxin Hu,et al.  ACNET: Attention Based Network to Exploit Complementary Features for RGBD Semantic Segmentation , 2019, 2019 IEEE International Conference on Image Processing (ICIP).

[21]  Quoc V. Le,et al.  Searching for MobileNetV3 , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[22]  Hui Kong,et al.  Road Detection through CRF based LiDAR-Camera Fusion , 2019, 2019 International Conference on Robotics and Automation (ICRA).

[23]  Roberto Cipolla,et al.  Fast-SCNN: Fast Semantic Segmentation Network , 2019, BMVC.

[24]  Gang Yu,et al.  BiSeNet: Bilateral Segmentation Network for Real-time Semantic Segmentation , 2018, ECCV.

[25]  Luis Miguel Bergasa,et al.  Unifying terrain awareness through real-time semantic segmentation , 2018, 2018 IEEE Intelligent Vehicles Symposium (IV).

[26]  Hongdong Li,et al.  Semisupervised and Weakly Supervised Road Detection Based on Generative Adversarial Networks , 2018, IEEE Signal Processing Letters.

[27]  Cang Ye,et al.  3-D Object Recognition of a Robotic Navigation Aid for the Visually Impaired , 2018, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[28]  Eduardo Romera,et al.  ERFNet: Efficient Residual Factorized ConvNet for Real-Time Semantic Segmentation , 2018, IEEE Transactions on Intelligent Transportation Systems.

[29]  Zhe Chen,et al.  RBNet: A Deep Neural Network for Unified Road and Road Boundary Detection , 2017, ICONIP.

[30]  Lakshmi Sudha Kondaka,et al.  Ultrasonic navigation based blind aid for the visually impaired , 2017, 2017 IEEE International Conference on Power, Control, Signals and Instrumentation Engineering (ICPCSI).

[31]  Xiaogang Wang,et al.  Pyramid Scene Parsing Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[32]  Josechu J. Guerrero,et al.  Stairs detection with odometry-aided traversal from a wearable RGB-D camera , 2017, Comput. Vis. Image Underst..

[33]  Kangru Wang,et al.  Free Space Estimation on Nonflat Plane Based on V-Disparity , 2016, IEEE Signal Processing Letters.

[34]  Ali Hayek,et al.  Smart Assistive Navigation System for Blind and Visually Impaired Individuals , 2015, 2015 International Conference on Advances in Biomedical Engineering (ICABME).

[35]  Amy Hurst,et al.  "Pray before you step out": describing personal and situational blind navigation behaviors , 2013, ASSETS.

[36]  Jannik Fritsch,et al.  A new performance measure and evaluation benchmark for road detection algorithms , 2013, 16th International IEEE Conference on Intelligent Transportation Systems (ITSC 2013).

[37]  Yann LeCun,et al.  Road Scene Segmentation from a Single Image , 2012, ECCV.

[38]  Daniel Cremers,et al.  B-Spline Modeling of Road Surfaces With an Application to Free-Space Estimation , 2009, IEEE Transactions on Intelligent Transportation Systems.

[39]  Frederick Lippman,et al.  Blindness and vision impairment , 2008 .

[40]  Bernhard P. Wrobel,et al.  Multiple View Geometry in Computer Vision , 2001 .