Global semantic classification of scenes using ridgelet transform

In recent years, new harmonic analysis tools providing sparse representation in high dimension space have been proposed. In particular, ridgelets and curvelets bases are similar to the sparse components of naturally occurring image data derived empirically by computational neuroscience researchers. Ridgelets take the form of basis elements which exhibit very high directional sensitivity and are highly anisotropic. The ridgelet transform have been shown to provide a sparse representation for smooth objects with straight edges. Independently, for the purpose of scene description, the shape of the Fourier energy spectra has been used as an efficient way to provide a “holistic” description of the scene picture and its semantic category. Similarly, we focus on a simple binary semantic classification (artificial vs. natural) based on various ridgelet features. The learning stage is performed on a large image database using different state of the art Linear Discriminant techniques. Classification results are compared with those resulting from the Gabor representation. Additionally, ridgelet representation provides us with a way to accurately reconstruct the original signal. Using this synthesis step, we filter the ridgelet coefficients with the discriminant vector. The resulting image identifies the elements within the scene contributing to the different perceptual dimensions.

[1]  Aude Oliva,et al.  Global semantic classification of scenes using power spectrum templates , 1999 .

[2]  A. G. Flesia,et al.  Can recent innovations in harmonic analysis `explain' key findings in natural image statistics? , 2001, Network.

[3]  Emmanuel J. Candès,et al.  The curvelet transform for image denoising , 2002, IEEE Trans. Image Process..

[4]  E. Candès New tight frames of curvelets and optimal representations of objects with C² singularities , 2002 .

[5]  David J. Field,et al.  Emergence of simple-cell receptive field properties by learning a sparse code for natural images , 1996, Nature.

[6]  Tristrom Cooke,et al.  Two Variations on Fisher's Linear Discriminant for Pattern Recognition , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  Antonio Torralba,et al.  Semantic organization of scenes using discriminant structural templates , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[8]  Emmanuel J. Candès Monoscale Ridgelets for the Representation of Images with Edges , 1910 .

[9]  E. Candès,et al.  New tight frames of curvelets and optimal representations of objects with piecewise C2 singularities , 2004 .

[10]  Antonio Torralba,et al.  Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.

[11]  Gunnar Rätsch,et al.  Constructing Descriptive and Discriminative Nonlinear Features: Rayleigh Coefficients in Kernel Feature Spaces , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  B. Scholkopf,et al.  Fisher discriminant analysis with kernels , 1999, Neural Networks for Signal Processing IX: Proceedings of the 1999 IEEE Signal Processing Society Workshop (Cat. No.98TH8468).

[13]  Antonio Torralba,et al.  Statistics of natural image categories , 2003, Network.

[14]  E. Candès,et al.  Curvelets: A Surprisingly Effective Nonadaptive Representation for Objects with Edges , 2000 .

[15]  Konstantinos N. Plataniotis,et al.  Face recognition using kernel direct discriminant analysis algorithms , 2003, IEEE Trans. Neural Networks.

[16]  Antonio Torralba,et al.  Scene-Centered Description from Spatial Envelope Properties , 2002, Biologically Motivated Computer Vision.