A Wavelet-Based Approach to Image Feature Stability Assessment

In this paper, we present a novel method for assessing image-feature stability. The method hinges on applying the discrete wavelet transform to the image features under study throughout a number of video frames in an image sequence. For purposes of stability assessment, we recover the imagefeature vectors for each video frame and then track them trough a series of consecutive frames in the image sequence. We apply the discrete wavelet transform to the time series constructed from the pairwise Euclidean distances for each of the image features under study and use the wavelet transform coefficients to assess their stability. We then recover the stable features by clustering together those time series which exhibit largely constant low-pass wavelet coefficients. We present results of the stability analysis for Harris corners, Maximally Stable Extremal Regions, and Scale Invariant Feature Transform regions extracted from two real-world video sequences. We also elaborate on the applications of our method to indexing, retrieval, and compression of stable image feature vectors.

[1]  O. Rioul,et al.  Wavelets and signal processing , 1991, IEEE Signal Processing Magazine.

[2]  Cordelia Schmid,et al.  Evaluation of Interest Point Detectors , 2000, International Journal of Computer Vision.

[3]  Walter Cazzola,et al.  Image Database Retrieval Using Wavelet Packets Compressed Data , 2002 .

[4]  Jiri Matas,et al.  Robust wide-baseline stereo from maximally stable extremal regions , 2004, Image Vis. Comput..

[5]  Ingrid Daubechies,et al.  The wavelet transform, time-frequency localization and signal analysis , 1990, IEEE Trans. Inf. Theory.

[6]  Cordelia Schmid,et al.  A performance evaluation of local descriptors , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Wan-Chi Siu,et al.  Denoising by singularity detection , 1999, IEEE Trans. Signal Process..

[8]  Jon Louis Bentley,et al.  Multidimensional binary search trees used for associative searching , 1975, CACM.

[9]  Jerome M. Shapiro,et al.  Embedded image coding using zerotrees of wavelet coefficients , 1993, IEEE Trans. Signal Process..

[10]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[11]  Stéphane Mallat,et al.  A Theory for Multiresolution Signal Decomposition: The Wavelet Representation , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  Cordelia Schmid,et al.  Local Grayvalue Invariants for Image Retrieval , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[13]  W. Heisenberg Über den anschaulichen Inhalt der quantentheoretischen Kinematik und Mechanik , 1927 .

[14]  Jitendra Malik,et al.  Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[15]  Christopher G. Harris,et al.  A Combined Corner and Edge Detector , 1988, Alvey Vision Conference.

[16]  Andrew Zisserman,et al.  Automated Scene Matching in Movies , 2002, CIVR.

[17]  Michael Brady,et al.  Saliency, Scale and Image Description , 2001, International Journal of Computer Vision.

[18]  Serge J. Belongie,et al.  Behavior recognition via sparse spatio-temporal features , 2005, 2005 IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance.

[19]  A. W. M. van den Enden,et al.  Discrete Time Signal Processing , 1989 .

[20]  Cordelia Schmid,et al.  An Affine Invariant Interest Point Detector , 2002, ECCV.

[21]  Edward H. Adelson,et al.  A multiresolution spline with application to image mosaics , 1983, TOGS.

[22]  Ivan Laptev,et al.  On Space-Time Interest Points , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[23]  James L. Crowley,et al.  A Representation for Shape Based on Peaks and Ridges in the Difference of Low-Pass Transform , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.