Geometric and photometric invariant distinctive regions detection

In this paper, we present a number of enhancements to the Kadir/Brady salient region detector which result in a significant improvement in performance. The modifications we make include: stabilising the difference between consecutive scales when calculating the inter-scale saliency, a new sampling strategy using overlap of pixels, partial volume estimation and parzen windowing. Repeatability is used as the criterion for evaluating the performance of the algorithm. We observe the repeatability for distinctive regions selected from an image and from the same image after applying a particular transformation. The transformations we use include planar rotation, pixel translation, spatial scaling, and intensity shifts and scaling. Experimental results show that the average repeatability rate is improved from 46% to approximately 78% when all the enhancements are applied. We also compare our algorithm with other region detectors on a set of sequences of real images, and our detector outperforms most of the state of the art detectors.

[1]  Andrew Zisserman,et al.  An Affine Invariant Salient Region Detector , 2004, ECCV.

[2]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[3]  C. Schmid,et al.  Indexing based on scale invariant interest points , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[4]  Richard Szeliski,et al.  Finding People in Repeated Shots of the Same Scene , 2006, BMVC.

[5]  Pietro Perona,et al.  Object class recognition by unsupervised scale-invariant learning , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[6]  Pietro Perona,et al.  One-shot learning of object categories , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Cordelia Schmid,et al.  An Affine Invariant Interest Point Detector , 2002, ECCV.

[8]  Adam Baumberg,et al.  Reliable feature matching across widely separated views , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[9]  Kingkarn Sookhanaphibarn,et al.  A new feature extractor invariant to intensity, rotation, and scaling of color images , 2006, Inf. Sci..

[10]  Michael Brady,et al.  Saliency, Scale and Image Description , 2001, International Journal of Computer Vision.

[11]  Luc Van Gool,et al.  Wide Baseline Stereo Matching based on Local, Affinely Invariant Regions , 2000, BMVC.

[12]  Pietro Perona,et al.  A sparse object category model for efficient learning and exhaustive recognition , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[13]  Andrew Zisserman,et al.  Video data mining using configurations of viewpoint invariant regions , 2004, CVPR 2004.

[14]  Cordelia Schmid,et al.  A Comparison of Affine Region Detectors , 2005, International Journal of Computer Vision.

[15]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[16]  Bryan S. Morse,et al.  Multiscale image registration using scale trace correlation , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).