Non-linear Invertible Representation for Joint Statistical and Perceptual Feature Decorrelation

The aim of many image mappings is representing the signal in a basis of decorrelated features. Two fundamental aspects must be taken into account in the basis selection problem: data distribution and the qualitative meaning of the underlying space. The classical PCA techniques reduce the statistical correlation using the data distribution. However, in applications where human vision has to be taken into account, there are perceptual factors that make the feature space uneven, and additional interaction among the dimensions may arise. In this work a common framework is presented to analyse the perceptual and statistical interactions among the coefficients of any representation. Using a recent non-linear perception model a set of input-dependent features is obtained which simultaneously remove the statistical and perceptual correlations between coefficients. A fast method to invert this representation is also presented, so no input-dependent transform has to be stored. The decorrelating power of the proposed representation suggests that it is a promising alternative to the linear transforms used in image coding, fusion or retrieval applications.

[1]  Eero P. Simoncelli,et al.  Image compression via joint statistical characterization in the wavelet domain , 1999, IEEE Trans. Image Process..

[2]  Francesc J. Ferri,et al.  Perceptually and statistically decorrelated features for image representation: application to transform coding , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[3]  David J. Field,et al.  Emergence of simple-cell receptive field properties by learning a sparse code for natural images , 1996, Nature.

[4]  J A Solomon,et al.  Model of visual contrast gain control and pattern masking. , 1997, Journal of the Optical Society of America. A, Optics, image science, and vision.

[5]  A. Murat Tekalp,et al.  Digital Video Processing , 1995 .

[6]  Terrence J. Sejnowski,et al.  The “independent components” of natural scenes are edge filters , 1997, Vision Research.

[7]  Eero P. Simoncelli,et al.  Modeling Surround Suppression in V1 Neurons with a Statistically Derived Normalization Model , 1998, NIPS.

[8]  B. Wandell Foundations of vision , 1995 .

[9]  Keinosuke Fukunaga,et al.  Introduction to statistical pattern recognition (2nd ed.) , 1990 .

[10]  Andrew B. Watson,et al.  Visibility of DCT basis functions: effects of contrast masking , 1994, Proceedings of IEEE Data Compression Conference (DCC'94).

[11]  J. M. Foley,et al.  Contrast masking in human vision. , 1980, Journal of the Optical Society of America.

[12]  Elemer E. Rosinger Non-linear partial differential equations , 1990 .

[13]  J.G. Daugman,et al.  Entropy reduction and decorrelation in visual coding by oriented neural receptive fields , 1989, IEEE Transactions on Biomedical Engineering.

[14]  Patrick C. Teo,et al.  Perceptual image distortion , 1994, Proceedings of 1st International Conference on Image Processing.

[15]  R. R. Clarke Transform coding of images , 1985 .