Seeing the trees before the forest [natural object detection]

In this paper, we propose an algorithm that detects and locates natural objects in an outdoor environment using local descriptors. Interest points inside images are detected with a difference of Gaussian (DoG) filter and are then represented using scale invariant local descriptors. Our algorithm learns objects in a weakly supervised manner by clustering similar descriptors together and using those clusters as object classifiers. The intent is to identify stable objects to be used as landmarks for simultaneous localization and mapping (SLAM) of robots. The robot milieu is first identified using a fast environment recognition algorithm and then landmarks are suggested for SLAM that are appropriate for that environment. In our experiments we test our theory on the detection of trees that belong to the plantae pinophyta (pine family). Initial results show that out of 200 test images, our classification yields 85 correct positives, 15 false negatives, 73 correct negatives and 27 false positives.

[1]  Paul A. Viola,et al.  Boosting Image Retrieval , 2004, International Journal of Computer Vision.

[2]  William T. Freeman,et al.  Presented at: 2nd Annual IEEE International Conference on Image , 1995 .

[3]  Michael Brady,et al.  Saliency, Scale and Image Description , 2001, International Journal of Computer Vision.

[4]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[5]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[6]  Antonio Torralba,et al.  Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.

[7]  Cordelia Schmid,et al.  A performance evaluation of local descriptors , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Antonio Torralba,et al.  Contextual Priming for Object Detection , 2003, International Journal of Computer Vision.

[9]  Daniel C. Asmar,et al.  SmartSLAM: localization and mapping across multi-environments , 2004, 2004 IEEE International Conference on Systems, Man and Cybernetics (IEEE Cat. No.04CH37583).

[10]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[11]  Cordelia Schmid,et al.  Indexing Based on Scale Invariant Interest Points , 2001, ICCV.

[12]  Antonio Torralba,et al.  Context-based vision system for place and object recognition , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[13]  P. Matsakis,et al.  The use of force histograms for affine-invariant relative position description , 2004 .

[14]  C. Schmid,et al.  Object Class Recognition Using Discriminative Local Features , 2005 .