A dataset for the recognition of obstacles on blind sidewalk

Recently, the technology of assisting the navigation of visually impaired persons with computer vision has been greatly developed. A number of scholars have conducted related research, including indoor and outdoor object detection for blind people. However, there are still problems with some existing methods or datasets. Our work mainly proposes a dataset (OD) for assisting the detection and recognition of outdoor obstacles for blind people on blind sidewalk. We classify some common obstacles, train the dataset with state-of-the-art detectors to obtain detection models, and then analyze and compare these models in detail. The results show that our proposed dataset is very challenging. The OD and the detection model can be obtained at the following URL: https://github.com/TW0521/Obstacle-Dataset.git .

[1]  Baoli Li,et al.  Traffic-Sign Detection and Classification in the Wild , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  Malay Kishore Dutta,et al.  Fusion of Object Recognition and Obstacle Detection approach for Assisting Visually Challenged Person , 2020, 2020 43rd International Conference on Telecommunications and Signal Processing (TSP).

[3]  Andrei Bursuc,et al.  A Smartphone-Based Obstacle Detection and Classification System for Assisting Visually Impaired People , 2013, 2013 IEEE International Conference on Computer Vision Workshops.

[4]  Leo Abraham,et al.  VISION- Wearable Speech Based Feedback System for the Visually Impaired using Computer Vision , 2020, 2020 4th International Conference on Trends in Electronics and Informatics (ICOEI)(48184).

[5]  Lianwen Jin,et al.  Decoupled Attention Network for Text Recognition , 2019, AAAI.

[6]  Riadh Ayachi,et al.  An Evaluation of RetinaNet on Indoor Object Detection for Blind and Visually Impaired Persons Assistance Navigation , 2020, Neural Processing Letters.

[7]  Steven C. H. Hoi,et al.  Feature Agglomeration Networks for Single Stage Face Detection , 2017, Neurocomputing.

[8]  Safvan Vahora,et al.  Android Smartphone Based Visual Object Recognition for Visually Impaired Using Deep Learning , 2018, 2018 International Conference on Communication and Signal Processing (ICCSP).

[9]  Lianwen Jin,et al.  Adaptive Embedding Gate for Attention-Based Scene Text Recognition , 2020, Neurocomputing.

[10]  Jiebo Luo,et al.  DOTA: A Large-Scale Dataset for Object Detection in Aerial Images , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[11]  Hong-Yuan Mark Liao,et al.  YOLOv4: Optimal Speed and Accuracy of Object Detection , 2020, ArXiv.

[12]  Ross B. Girshick,et al.  Fast R-CNN , 2015, 1504.08083.

[13]  Don-Lin Yang,et al.  A Deep Learning Approach to Sensory Navigation Device for Blind Guidance , 2018, 2018 IEEE 20th International Conference on High Performance Computing and Communications; IEEE 16th International Conference on Smart City; IEEE 4th International Conference on Data Science and Systems (HPCC/SmartCity/DSS).

[14]  Sung Wook Baik,et al.  Raspberry Pi assisted face recognition framework for enhanced law-enforcement services in smart cities , 2017, Future Gener. Comput. Syst..

[15]  Wei Liu,et al.  DSSD : Deconvolutional Single Shot Detector , 2017, ArXiv.

[16]  Nazli Ikizler-Cinbis,et al.  Wildest Faces: Face Detection and Recognition in Violent Settings , 2018, ArXiv.

[17]  Pietro Perona,et al.  Microsoft COCO: Common Objects in Context , 2014, ECCV.

[18]  S. Sofana Reka,et al.  Real Time Multi Object Detection for Blind Using Single Shot Multibox Detector , 2019, Wirel. Pers. Commun..

[19]  Kannan Karthik,et al.  Face anti-spoofing by identity masking using random walk patterns and outlier detection , 2020, Pattern Analysis and Applications.

[20]  Bernt Schiele,et al.  CityPersons: A Diverse Dataset for Pedestrian Detection , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[21]  Karsten Behrendt,et al.  A deep learning approach to traffic lights: Detection, tracking, and classification , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[22]  Samkit Shah,et al.  CNN based Auto-Assistance System as a Boon for Directing Visually Impaired Person , 2019, 2019 3rd International Conference on Trends in Electronics and Informatics (ICOEI).

[23]  Yaroslav Bulatov,et al.  xView: Objects in Context in Overhead Imagery , 2018, ArXiv.

[24]  Jianlin Wang,et al.  DC-SPP-YOLO: Dense Connection and Spatial Pyramid Pooling Based YOLO for Object Detection , 2019, Inf. Sci..

[25]  Jing Wang,et al.  Effective Crowd Anomaly Detection Through Spatio-temporal Texture Analysis , 2019, Int. J. Autom. Comput..

[26]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[27]  Shuxiao Li,et al.  Feature Pyramid SSD: Outdoor Object Detection Algorithm for Blind People , 2019, 2019 IEEE 5th International Conference on Computer and Communications (ICCC).

[28]  Mohamed Atri,et al.  Recognizing signs and doors for Indoor Wayfinding for Blind and Visually Impaired Persons , 2020, 2020 5th International Conference on Advanced Technologies for Signal and Image Processing (ATSIP).

[29]  Hanan Abdullah Mengash,et al.  A novel technique for automated concealed face detection in surveillance videos , 2020, Personal and Ubiquitous Computing.

[30]  Kai Chen,et al.  Real-time Scene Text Detection with Differentiable Binarization , 2019, AAAI.

[31]  Jiri Matas,et al.  COCO-Text: Dataset and Benchmark for Text Detection and Recognition in Natural Images , 2016, ArXiv.

[32]  Dariu Gavrila,et al.  EuroCity Persons: A Novel Benchmark for Person Detection in Traffic Scenes , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[33]  Ibrahim M. El-Henawy,et al.  Local binary pattern-based on-road vehicle detection in urban traffic scene , 2020, Pattern Analysis and Applications.

[34]  Luc Van Gool,et al.  The Pascal Visual Object Classes Challenge: A Retrospective , 2014, International Journal of Computer Vision.

[35]  Lin Gao,et al.  Automated Pavement Crack Damage Detection Using Deep Multiscale Convolutional Features , 2020, Journal of Advanced Transportation.

[36]  Manoj Singh Gaur,et al.  Object Recognition and Classification System for Visually Impaired , 2020, 2020 International Conference on Communication and Signal Processing (ICCSP).

[37]  Yuchuan Du,et al.  Pavement distress detection and classification based on YOLO network , 2020, International Journal of Pavement Engineering.

[38]  Jing Zhang,et al.  Deep transfer learning for gesture recognition with WiFi signals , 2020, Personal and Ubiquitous Computing.

[39]  Salma Kammoun Jarraya,et al.  Deep Multi-Layer Perceptron-Based Obstacle Classification Method From Partial Visual Information: Application to the Assistance of Visually Impaired People , 2020, IEEE Access.

[40]  Wei Liu,et al.  SSD: Single Shot MultiBox Detector , 2015, ECCV.

[41]  Vishal M. Patel,et al.  Pushing the Limits of Unconstrained Face Detection: a Challenge Dataset and Baseline Results , 2018, 2018 IEEE 9th International Conference on Biometrics Theory, Applications and Systems (BTAS).

[42]  Andrew Zisserman,et al.  Synthetic Data and Artificial Neural Networks for Natural Scene Text Recognition , 2014, ArXiv.

[43]  Xiaofan Yang,et al.  Automatic recognition of lactating sow postures by refined two-stream RGB-D faster R-CNN , 2020 .

[44]  Pietro Perona,et al.  Pedestrian Detection: An Evaluation of the State of the Art , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.