Toward a computer vision-based wayfinding aid for blind persons to access unfamiliar indoor environments

Independent travel is a well-known challenge for blind and visually impaired persons. In this paper, we propose a proof-of-concept computer vision-based wayfinding aid for blind people to independently access unfamiliar indoor environments. In order to find different rooms (e.g. an office, a laboratory, or a bathroom) and other building amenities (e.g. an exit or an elevator), we incorporate object detection with text recognition. First, we develop a robust and efficient algorithm to detect doors, elevators, and cabinets based on their general geometric shape, by combining edges and corners. The algorithm is general enough to handle large intra-class variations of objects with different appearances among different indoor environments, as well as small inter-class differences between different objects such as doors and door-like cabinets. Next, to distinguish intra-class objects (e.g. an office door from a bathroom door), we extract and recognize text information associated with the detected objects. For text recognition, we first extract text regions from signs with multiple colors and possibly complex backgrounds, and then apply character localization and topological analysis to filter out background interference. The extracted text is recognized using off-the-shelf optical character recognition software products. The object type, orientation, location, and text information are presented to the blind traveler as speech.

[1]  H. Tran,et al.  A Novel Approach for Text Detection in Images Using Structural Features , 2005, ICAPR.

[2]  N.P. Papanikolopoulos,et al.  Real-time door detection in cluttered environments , 2000, Proceedings of the 2000 IEEE International Symposium on Intelligent Control. Held jointly with the 8th IEEE Mediterranean Conference on Control and Automation (Cat. No.00CH37147).

[3]  Gabriel Kreiman Biological object recognition , 2008, Scholarpedia.

[4]  P. Dubey Edge Based Text Detection for Multi-purpose Application , 2006, 2006 8th international Conference on Signal Processing.

[5]  Roberto Manduchi,et al.  Search Strategies of Visually Impaired Persons Using a Camera Phone Wayfinding System , 2008, ICCHP.

[6]  A. Torralba,et al.  The role of context in object recognition , 2007, Trends in Cognitive Sciences.

[7]  Qiao Liu,et al.  Text localization in spam image using edge features , 2008, 2008 International Conference on Communications, Circuits and Systems.

[9]  Sebastian Thrun,et al.  Detecting and modeling doors with mobile robots , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[10]  25 Blind Navigation and the Role of Technology , 2008 .

[11]  I. Biederman Recognition-by-components: a theory of human image understanding. , 1987, Psychological review.

[12]  Barry T. Thomas,et al.  Wearable Mobility Aid for Low Vision Using Scene Classification in a Markov Random Field Model Framework , 2003, Int. J. Hum. Comput. Interact..

[13]  Nelson H. C. Yung,et al.  Corner detector based on global and local curvature properties , 2008 .

[14]  Zhichao Chen,et al.  Visual detection of lintel-occluded doors from a single image , 2008, 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[15]  Chunheng Wang,et al.  Text detection in images based on unsupervised classification of edge-based features , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[16]  Alexei A. Efros,et al.  An empirical study of context in object detection , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[17]  Alan L. Yuille,et al.  Detecting and reading text in natural scenes , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[18]  G. Medioni,et al.  Piecewise Planar Modeling for Step Detection using Stereo Vision , 2008 .

[19]  Sumi Helal,et al.  The Engineering Handbook of Smart Technology for Aging, Disability, and Independence , 2008 .

[20]  B. Silverstone The Lighthouse handbook on vision impairment and visionrehabilitation. , 2000 .

[21]  R. Dornbusch,et al.  New Directions for Research , 1988 .

[22]  Nikos A. Nikolaou,et al.  Color reduction for complex document images , 2009, Int. J. Imaging Syst. Technol..

[23]  Sanghoon Sull,et al.  An Efficient Method for Text Detection in Video Based on Stroke Width Similarity , 2007, ACCV.

[24]  Palaiahnakote Shivakumara,et al.  An Efficient Edge Based Technique for Text Detection in Video Frames , 2008, 2008 The Eighth IAPR International Workshop on Document Analysis Systems.

[25]  Yong Wang,et al.  Machine Vision and Applications , 2013 .

[26]  Cheng Chen,et al.  Door detection via signage context-based Hierarchical Compositional Model , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.

[27]  James M. Coughlan,et al.  Crosswatch: A Camera Phone System for Orienting Visually Impaired Pedestrians at Traffic Intersections , 2008, ICCHP.

[28]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[29]  Lucas Paletta,et al.  Context Based Object Detection from Video , 2003, ICVS.

[30]  Jiebo Luo,et al.  Natural object detection in outdoor scenes based on probabilistic spatial context models , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[31]  Palaiahnakote Shivakumara,et al.  A Laplacian Method for Video Text Detection , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[32]  Nikolaos G. Bourbakis,et al.  Wearable Obstacle Avoidance Electronic Travel Aids for Blind: A Survey , 2010, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[33]  Antonio Torralba,et al.  Contextual Priming for Object Detection , 2003, International Journal of Computer Vision.

[34]  Yingli Tian,et al.  Improving Computer Vision-Based Indoor Wayfinding for Blind Persons with Context Information , 2010, ICCHP.

[35]  Jana Kosecka,et al.  Visual door detection integrating appearance and shape cues , 2008, Robotics Auton. Syst..

[36]  Larry S. Davis,et al.  A video based interface to textual information for the visually impaired , 2002, Proceedings. Fourth IEEE International Conference on Multimodal Interfaces.

[37]  William Labov,et al.  New Directions in Research , 2003 .

[38]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[39]  Edward K. Wong,et al.  A new robust algorithm for video text extraction , 2003, Pattern Recognit..

[40]  Ramakant Nevatia,et al.  A method for recognition and localization of generic objects for indoor navigation , 1994, Proceedings of 1994 IEEE Workshop on Applications of Computer Vision.

[41]  Xiaodong Yang,et al.  Robust door detection in unfamiliar environments by combining edge and corner features , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.

[42]  Oliver Bittel,et al.  Real-Time Door Detection Based on AdaBoost Learning Algorithm , 2009, Eurobot Conference.

[43]  Rafael Mu,et al.  Door-detection using computer vision and fuzzy logic , 2003 .

[44]  James M. Coughlan,et al.  Grouping Using Factor Graphs: An Approach for Finding Text with a Camera Phone , 2007, GbRPR.

[45]  J. Kumar,et al.  Font and Background Color Independent Text , 2007 .

[46]  Youngsu Moon,et al.  Text segmentation based on stroke filter , 2006, MM '06.

[47]  Xiaodong Yang,et al.  Computer Vision-Based Door Detection for Accessibility of Unfamiliar Environments to Blind Persons , 2010, ICCHP.