A Frame-work assisting the Visually Impaired People: Common Object Detection and Pose Estimation in Surrounding Environment

Many assisting systems to detect interested-objects (or obstacles) for Visually Impaired People(VIPs) have been studied for a long time. However, the practical systems built-in the real environment are still very challenging due to generic object classes in highly cluttered scenes (or complicated background). In this paper, we propose an unique frame-work to detect and estimate the full model of the common objects in the daily living of the VIP’s activities. Not only resolving the question where is the object’s query, the proposed system provides its relevant information that is presented such as size, and safety direction for grasping on a flat surface. The pipelines combine a series of the point cloud representation, table plane detection, objects detection and the full model estimation via a robust estimator. In this frame-work, we tackle that recent advantages of deep learning (e.g., RCNN, YOLO) that could be an efficient way for the detection task, while the geometry-based approaches estimate full 3-D model. This scheme does not require separating (or segmenting) the interested objects from the background of the surrounding scenes. The proposed system is compared with other approaches as well as is evaluated on the real datasets collected in common scenes such as Kitchen or cafeteria room. In these evaluations, the proposed frame-work meets requirements of high accuracy, processing time, and suitability for VIPs. The evaluation datasets are made publicity available.

[1]  Michael A. Greenspan,et al.  3D Object Recognition by Surface Registration of Interest Segments , 2013, 2013 International Conference on 3D Vision.

[2]  Gary R. Bradski,et al.  Fast 3D recognition and pose using the Viewpoint Feature Histogram , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[3]  Yingli Tian,et al.  Finding objects for assisting blind people , 2013, Network Modeling Analysis in Health Informatics and Bioinformatics.

[4]  Jörg Franke,et al.  Blind path obstacle detector using smartphone camera and line laser emitter , 2016, 2016 1st International Conference on Technology and Innovation in Sports, Health and Wellbeing (TISHW).

[5]  Jörg Stückler,et al.  Shape-Primitive Based Object Recognition and Grasping , 2012, ROBOTIK.

[6]  Thi-Lan Le,et al.  Acquiring qualified samples for RANSAC using geometrical constraints , 2018, Pattern Recognit. Lett..

[7]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Christian A. Mueller,et al.  Hierarchical graph-based discovery of non-primitive-shaped objects in unstructured environments , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[9]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[10]  M. R. Adame,et al.  A Navigation Aid for Blind People with Walking Disabilities , 2014 .

[11]  Qingming Zhana,et al.  Color-based Segmentation of Point Clouds , 2009 .

[12]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13]  Michael G. Strintzis,et al.  Object recognition for the blind , 2005, 2005 13th European Signal Processing Conference.

[14]  3D Object Localization using Superquadric Models with a Kinect Sensor , 2011 .

[15]  Sven Behnke,et al.  Real-Time Plane Segmentation Using RGB-D Cameras , 2012, RoboCup.

[16]  Dirk Kraft,et al.  Rotational Subgroup Voting and Pose Clustering for Robust 3D Object Recognition , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[17]  Rainer Stiefelhagen,et al.  An Assistive Vision System for the Blind That Helps Find Lost Things , 2012, ICCHP.

[18]  Ross B. Girshick,et al.  Fast R-CNN , 2015, 1504.08083.

[19]  Wilfried Philips,et al.  Real-Time Table Plane Detection Using Accelerometer Information And Organized Point Cloud Data From Kinect Sensor , 2017 .

[20]  Reinhard Klein,et al.  Efficient RANSAC for Point‐Cloud Shape Detection , 2007, Comput. Graph. Forum.

[21]  P. Jaccard THE DISTRIBUTION OF THE FLORA IN THE ALPINE ZONE.1 , 1912 .

[22]  Computer Vision-Based Aid for the Visually Impaired Persons-A Survey And Proposing New Framework , 2014 .

[23]  Dieter Fox,et al.  Unsupervised feature learning for 3D scene labeling , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[24]  Dieter Fox,et al.  Object recognition with hierarchical kernel descriptors , 2011, CVPR 2011.