Model-based extraction of image area descriptors using a multi-scale attention operator

A model-based method for transformation-invariant area descriptor extraction is proposed in this paper in the context of object recognition and image matching. Local image descriptors are extracted in salient circular fragments of variable size, which indicate image locations with high intensity contrast, regional homogeneity and shape saliency. Three different types of descriptors - pose, intensity, and area shape - are extracted to form a single descriptor vector. The pose and intensity descriptors are made relational and normalized in order to achieve invariance to image similarity transformations and affine intensity changes. The method requires no image segmentation since the feature points are time-efficiently extracted in a multi-scale manner by analyzing circular areas of various sizes.

[1]  Cordelia Schmid,et al.  A Performance Evaluation of Local Descriptors , 2005, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[3]  M. Fatih Demirci,et al.  Object Recognition as Many-to-Many Feature Matching , 2006, International Journal of Computer Vision.

[4]  Cordelia Schmid,et al.  A performance evaluation of local descriptors , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Roman M. Palenichka,et al.  Automatic Extraction of Control Points for the Registration of Optical Satellite and LiDAR Images , 2010, IEEE Transactions on Geoscience and Remote Sensing.

[6]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[7]  Cordelia Schmid,et al.  A Comparison of Affine Region Detectors , 2005, International Journal of Computer Vision.

[8]  Tomaso A. Poggio,et al.  Image Representations and Feature Selection for Multimedia Database Search , 2003, IEEE Trans. Knowl. Data Eng..