A Compressive Sensing Approach to Describe Indoor Scenes for Blind People

This paper introduces a new portable camera-based method for helping blind people to recognize indoor objects. Unlike state-of-the-art techniques, which typically perform the recognition task by limiting it to a single predefined class of objects, we propose here a completely different alternative scheme, defined as coarse description. It aims at expanding the recognition task to multiple objects and, at the same time, keeping the processing time under control by sacrificing some information details. The benefit is to increment the awareness and the perception of a blind person to his direct contextual environment. The coarse description issue is addressed via two image multilabeling strategies which differ in the way image similarity is computed. The first one makes use of the Euclidean distance measure, while the second one relies on a semantic similarity measure modeled by means of Gaussian process estimation. To achieve fast computation capability, both strategies rely on a compact image representation based on compressive sensing. The proposed methodology was assessed on two indoor datasets representing different indoor environments. Encouraging results were achieved in terms of both accuracy and processing time.

[1]  Sazali Yaacob,et al.  Stereopsis method for visually impaired to identify obstacles based on distance , 2004, Third International Conference on Image and Graphics (ICIG'04).

[2]  Emmanuel J. Candès,et al.  Robust uncertainty principles: exact signal reconstruction from highly incomplete frequency information , 2004, IEEE Transactions on Information Theory.

[3]  Cordelia Schmid,et al.  Object Recognition by Integrating Multiple Image Segmentations , 2008, ECCV.

[4]  Antonio Criminisi,et al.  TextonBoost for Image Understanding: Multi-Class Object Recognition and Segmentation by Jointly Modeling Texture, Layout, and Context , 2007, International Journal of Computer Vision.

[5]  Wei Hu,et al.  Image inpainting via sparse representation , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[6]  L. Dandona,et al.  Revision of visual impairment definitions in the International Statistical Classification of Diseases , 2006, BMC medicine.

[7]  David L Donoho,et al.  Compressed sensing , 2006, IEEE Transactions on Information Theory.

[8]  Gérard G. Medioni,et al.  Robot vision for the visually impaired , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.

[9]  Luc Van Gool,et al.  SURF: Speeded Up Robust Features , 2006, ECCV.

[10]  Xiaodong Yang,et al.  Robust door detection in unfamiliar environments by combining edge and corner features , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.

[11]  Bernhard P. Wrobel,et al.  Multiple View Geometry in Computer Vision , 2001 .

[12]  Rama Chellappa,et al.  Sparse Representations, Compressive Sensing and dictionaries for pattern recognition , 2011, The First Asian Conference on Pattern Recognition.

[13]  David Barber,et al.  Bayesian Classification With Gaussian Processes , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Ender Tekin,et al.  An algorithm enabling blind users to find and read barcodes , 2009, 2009 Workshop on Applications of Computer Vision (WACV).

[15]  Farid Melgani,et al.  Missing-Area Reconstruction in Multispectral Images Under a Compressive Sensing Perspective , 2013, IEEE Transactions on Geoscience and Remote Sensing.

[16]  Sterling C. Johnson,et al.  Reduced hippocampal activation during episodic encoding in middle-aged individuals at genetic risk of Alzheimer's Disease: a cross-sectional study , 2006, BMC medicine.

[17]  K. Karacs,et al.  Advanced crosswalk detection for the Bionic Eyeglass , 2010, 2010 12th International Workshop on Cellular Nanoscale Networks and their Applications (CNNA 2010).

[18]  Xiaodong Yang,et al.  Robust and Effective Component-Based Banknote Recognition for the Blind , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[19]  Julio Jacobo-Berlles,et al.  Face recognition on partially occluded images using compressed sensing , 2014, Pattern Recognit. Lett..

[20]  Bernt Schiele,et al.  Multiple Object Class Detection with a Generative Model , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[21]  Iwan Ulrich,et al.  The GuideCane-applying mobile robot technologies to assist the visually impaired , 2001, IEEE Trans. Syst. Man Cybern. Part A.

[22]  Pedro Pinho,et al.  Indoor guidance system for the blind and the visually impaired , 2012 .

[23]  Thomas S. Huang,et al.  Image Super-Resolution Via Sparse Representation , 2010, IEEE Transactions on Image Processing.

[24]  Shrinivas J. Pundlik,et al.  Collision Detection for Visually Impaired from a Body-Mounted Camera , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[25]  Andrew Zisserman,et al.  MLESAC: A New Robust Estimator with Application to Estimating Image Geometry , 2000, Comput. Vis. Image Underst..

[26]  Guillermo Sapiro,et al.  Sparse Representation for Computer Vision and Pattern Recognition , 2010, Proceedings of the IEEE.

[27]  Michael Elad,et al.  Sparse Representation for Color Image Restoration , 2008, IEEE Transactions on Image Processing.

[28]  René Vidal,et al.  Motion Segmentation in the Presence of Outlying, Incomplete, or Corrupted Trajectories , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[29]  Yingli Tian,et al.  A primary travelling assistant system of bus detection and recognition for visually impaired people , 2013, 2013 IEEE International Conference on Multimedia and Expo Workshops (ICMEW).

[30]  Jean-Luc Starck,et al.  Sparse Solution of Underdetermined Systems of Linear Equations by Stagewise Orthogonal Matching Pursuit , 2012, IEEE Transactions on Information Theory.

[31]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[32]  Allen Y. Yang,et al.  Robust Face Recognition via Sparse Representation , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[33]  A. Bruckstein,et al.  K-SVD : An Algorithm for Designing of Overcomplete Dictionaries for Sparse Representation , 2005 .

[34]  Hermann Ney,et al.  Local Representations for Multi-object Recognition , 2003, DAGM-Symposium.

[35]  Luc Van Gool,et al.  Scalable multi-class object detection , 2011, CVPR 2011.

[36]  Wai Ho Li,et al.  Plane-based detection of staircases using inverse depth , 2012, ICRA 2012.

[37]  R. Kowalik,et al.  An ultrasonic obstacle detector based on phase beamforming principles , 2006, IEEE Sensors Journal.

[38]  Shuihua Wang,et al.  Camera-Based Signage Detection and Recognition for Blind Persons , 2012, ICCHP.

[39]  Farid Melgani,et al.  Gaussian Process Approach to Remote Sensing Image Classification , 2010, IEEE Transactions on Geoscience and Remote Sensing.

[40]  Alan L. Yuille,et al.  Detecting and reading text in natural scenes , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[41]  Emmanuel J. Candès,et al.  Decoding by linear programming , 2005, IEEE Transactions on Information Theory.

[42]  Shraga Shoval,et al.  Auditory guidance with the Navbelt-a computerized travel aid for the blind , 1998, IEEE Trans. Syst. Man Cybern. Part C.

[43]  Trevor Darrell,et al.  Transfer learning for image classification with sparse prototype representations , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[44]  M. Elad,et al.  $rm K$-SVD: An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation , 2006, IEEE Transactions on Signal Processing.

[45]  Diego López-de-Ipiña,et al.  BlindShopping: Enabling Accessible Shopping for Visually Impaired People through Mobile Technologies , 2011, ICOST.