It is safe to say that computer vision holds great potential for providing a broad range of benefits to the blind in the foreseeable future. But despite the rapid advances in computer hardware and vision algorithms, robust, self-contained functional systems that can be used by the blind for ‘identify-and-locate’ tasks are not yet available. This paper describes the challenges encountered and experience gained in building such a functional ‘identify-and-locate’ system by a group of undergraduates in the Gemstone Program at the University of Maryland, College Park. The greatest resistance to getting this task off the starting block was the need to simultaneously tackle several technical challenges requiring a range of expertise. These included developing a robust and real-time computer vision algorithm, a voice recognition and speech output system, a directional feedback interface, and the means to integrate these components into a single functional unit. With access to a pool of students with differing skills and backgrounds, we overcame the initial resistance. The bulk of the project was completed over a one year period, resulting in a prototype system that the blind can use in controlled environments to reliably identify and locate objects and signs within seconds.
[1]
Robert C. Bolles,et al.
Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography
,
1981,
CACM.
[2]
H. Steven Colburn,et al.
Role of spectral detail in sound-source localization
,
1998,
Nature.
[3]
Bernhard P. Wrobel,et al.
Multiple View Geometry in Computer Vision
,
2001
.
[4]
G LoweDavid,et al.
Distinctive Image Features from Scale-Invariant Keypoints
,
2004
.
[5]
Hanan Samet,et al.
Foundations of Multidimensional and Metric Data Structures (The Morgan Kaufmann Series in Computer Graphics and Geometric Modeling)
,
2005
.
[6]
Roberto Manduchi,et al.
Functional Assessment of a Camera Phone-Based Wayfinding System Operated by Blind Users
,
2007
.
[7]
David Nistér,et al.
Scalable Recognition with a Vocabulary Tree
,
2006,
2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).
[8]
Roberto Manduchi,et al.
Cell Phone-based Wayfinding for the Visually Impaired
,
2006
.