Lightweight Active Object Retrieval with Weak Classifiers

In the last few years, there has been a steadily growing interest in autonomous vehicles and robotic systems. While many of these agents are expected to have limited resources, these systems should be able to dynamically interact with other objects in their environment. We present an approach where lightweight sensory and processing techniques, requiring very limited memory and processing power, can be successfully applied to the task of object retrieval using sensors of different modalities. We use the Hough framework to fuse optical and orientation information of the different views of the objects. In the presented spatio-temporal perception technique, we apply active vision, where, based on the analysis of initial measurements, the direction of the next view is determined to increase the hit-rate of retrieval. The performance of the proposed methods is shown on three datasets loaded with heavy noise.

[1]  Ali Farhadi,et al.  XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks , 2016, ECCV.

[2]  László Czúni,et al.  View centered video-based object recognition for lightweight devices , 2016, 2016 International Conference on Systems, Signals and Image Processing (IWSSIP).

[3]  Richard I. Hartley,et al.  Optimised KD-trees for fast image descriptor matching , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[4]  Subhashis Banerjee,et al.  Active recognition through next view planning: a survey , 2004, Pattern Recognit..

[5]  Hiroshi Murase,et al.  Visual learning and recognition of 3-d objects from appearance , 2005, International Journal of Computer Vision.

[6]  Steve Paschall,et al.  Fast, lightweight autonomy through an unknown cluttered environment: Distribution statement: A — Approved for public release; distribution unlimited , 2017, 2017 IEEE Aerospace Conference.

[7]  M. Tarr,et al.  Visual Object Recognition , 1996, ISTCS.

[8]  Gaurav S. Sukhatme,et al.  Active multi-view object recognition: A unifying view on online feature selection and view planning , 2016, Robotics Auton. Syst..

[9]  F. Seco,et al.  A comparison of Pedestrian Dead-Reckoning algorithms using a low-cost MEMS IMU , 2009, 2009 IEEE International Symposium on Intelligent Signal Processing.

[10]  F. Nicolls,et al.  Active object recognition using vocabulary trees , 2013, 2013 IEEE Workshop on Robot Vision (WORV).

[11]  Giorgio Metta,et al.  Active object recognition on a humanoid robot , 2012, 2012 IEEE International Conference on Robotics and Automation.

[12]  Vittorio Murino,et al.  Weighted bag of visual words for object recognition , 2014, 2014 IEEE International Conference on Image Processing (ICIP).

[13]  Chris D. Nugent,et al.  A Lightweight Hierarchical Activity Recognition Framework Using Smartphone Sensors , 2014, Sensors.

[14]  Adrien Chan-Hon-Tong,et al.  Deeply Optimized Hough Transform: Application to Action Segmentation , 2013, ICIAP.

[15]  Mubarak Shah,et al.  A probabilistic framework for object recognition in video , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[16]  Jon Louis Bentley,et al.  An Algorithm for Finding Best Matches in Logarithmic Expected Time , 1977, TOMS.

[17]  László Czúni,et al.  The use of IMUs for video object retrieval in lightweight devices , 2017, J. Vis. Commun. Image Represent..

[18]  László Czúni,et al.  Lightweight mobile object recognition , 2014, 2014 IEEE International Conference on Image Processing (ICIP).

[19]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[20]  Moshe Bar,et al.  Viewpoint Dependency in Visual Object Recognition Does Not Necessarily Imply Viewer-Centered Representation , 2001, Journal of Cognitive Neuroscience.

[21]  Lucas Paletta,et al.  Active Object Recognition in Parametric Eigenspace , 1998, BMVC.

[22]  Marco La Cascia,et al.  Video Object Recognition and Modeling by SIFT Matching Optimization , 2014, ICPRAM.

[23]  Luc Van Gool,et al.  Server-side object recognition and client-side object tracking for mobile augmented reality , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.

[24]  D. Ruta,et al.  An Overview of Classifier Fusion Methods , 2000 .

[25]  Yaser Sheikh,et al.  Model generation for video-based object recognition , 2006, MM '06.

[26]  Jiaya Jia,et al.  Image partial blur detection and classification , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[27]  Ognjen Arandjelovic,et al.  Matching objects across the textured-smooth continuum , 2013, ICRA 2012.

[28]  Yiannis S. Boutalis,et al.  Accurate Image Retrieval Based on Compact Composite Descriptors and Relevance Feedback Information , 2010, Int. J. Pattern Recognit. Artif. Intell..

[29]  Paramvir Bahl,et al.  Glimpse: Continuous, Real-Time Object Recognition on Mobile Devices , 2015, SenSys.

[30]  Juergen Gall,et al.  Class-specific Hough forests for object detection , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[31]  Pierre Magnan,et al.  Analysis and reduction of signal readout circuitry temporal noise in CMOS image sensors for low-light levels , 2000 .