论文信息 - Incorporating Background Invariance into Feature-Based Object Recognition

Incorporating Background Invariance into Feature-Based Object Recognition

Current feature-based object recognition methods use information derived from local image patches. For robustness, features are engineered for invariance to various transformations, such as rotation, scaling, or affine warping. When patches overlap object boundaries, however, errors in both detection and matching will almost certainly occur due to inclusion of unwanted background pixels. This is common in real images, which often contain significant background clutter, objects which are not heavily textured, or objects which occupy a relatively small portion of the image. We suggest improvements to the popular scale invariant feature transform (SIFT) which incorporate local object boundary information. The resulting feature detection and descriptor creation processes are invariant to changes in background. We call this method the background and scale invariant feature transform (BSIFT). We demonstrate BSIFT's superior performance in feature detection and matching on synthetic and natural images.

Martial Hebert | Andrew N. Stein | M. Hebert

[1] Sameer A. Nene,et al. Columbia Object Image Library (COIL100) , 1996 .

[2] David J. Fleet,et al. Probabilistic Detection and Tracking of Motion Boundaries , 2000, International Journal of Computer Vision.

[3] David G. Lowe,et al. Distinctive Image Features from Scale-Invariant Keypoints , 2004, International Journal of Computer Vision.

[4] Jitendra Malik,et al. Scale-Space and Edge Detection Using Anisotropic Diffusion , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[5] Michael Brady,et al. Saliency, Scale and Image Description , 2001, International Journal of Computer Vision.

[6] Jianbo Shi,et al. Object-Specific Figure-Ground Segregation , 2003, CVPR.

[7] Cordelia Schmid,et al. Shape recognition with edge-based features , 2003, BMVC.

[8] Tony Lindeberg,et al. Shape-Adapted Smoothing in Estimation of 3-D Depth Cues from Affine Distortions of Local 2-D Brightness Structure , 1994, ECCV.

[9] Brian V. Funt,et al. A data set for color research , 2002 .

[10] Michel Vidal-Naquet,et al. Visual features of intermediate complexity and their use in classification , 2002, Nature Neuroscience.

[11] Guillermo Sapiro,et al. Robust anisotropic diffusion , 1998, IEEE Trans. Image Process..

[12] James A. Sethian,et al. Level Set Methods and Fast Marching Methods , 1999 .

[13] Marko Subasic,et al. Level Set Methods and Fast Marching Methods , 2003 .

[14] Tony Lindeberg,et al. Shape-adapted smoothing in estimation of 3-D shape cues from affine deformations of local 2-D brightness structure , 1997, Image Vis. Comput..

[15] Adam Baumberg,et al. Reliable feature matching across widely separated views , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[16] Carlo Tomasi,et al. Depth Discontinuities by Pixel-to-Pixel Stereo , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[17] C. Schmid,et al. Scale-invariant shape features for recognition of object categories , 2004, CVPR 2004.

[18] Peter Nordlund,et al. Figure-ground segmentation using multiple cues , 1998 .

[19] Bernt Schiele,et al. Analyzing appearance and contour based methods for object categorization , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[20] Wenyuan Xu,et al. Behavioral analysis of anisotropic diffusion in image processing , 1996, IEEE Trans. Image Process..

[21] Max A. Viergever,et al. Scale and the differential structure of images , 1992, Image Vis. Comput..

[22] Jitendra Malik,et al. Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[23] Cordelia Schmid,et al. Matching images with different resolutions , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[24] Cordelia Schmid,et al. An Affine Invariant Interest Point Detector , 2002, ECCV.

[25] Martial Hebert,et al. Shape-based recognition of wiry objects , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[26] Martial Hebert,et al. Shape-based recognition of wiry objects , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27] C. Schmid,et al. Indexing based on scale invariant interest points , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[28] Cordelia Schmid,et al. Local Grayvalue Invariants for Image Retrieval , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[29] Max A. Viergever,et al. The Gaussian scale-space paradigm and the multiscale local jet , 1996, International Journal of Computer Vision.