The structure of images

In practice the relevant details of images exist only over a restricted range of scale. Hence it is important to study the dependence of image structure on the level of resolution. It seems clear enough that visual perception treats images on several levels of resolution simultaneously and that this fact must be important for the study of perception. However, no applicable mathematically formulated theory to deal with such problems appers to exist. In this paper it is shown that any image can be embedded in a one-parameter family of derived images (with resolution as the parameter) in essentially only one unique way if the constraint that no spurious detail should be generated when the resolution is diminished, is applied. The structure of this family is governed by the well known diffusion equation (a parabolic, linear, partial differential equation of the second order). As such the structure fits into existing theories that treat the front end of the visual system as a continuous tack of homogeneous layer, characterized by iterated local processing schemes. When resolution is decreased the images becomes less articulated because the extrem (“light and dark blobs”) disappear one after the other. This erosion of structure is a simple process that is similar in every case. As a result any image can be described as a juxtaposed and nested set of light and dark blobs, wherein each blod has a limited range of resolution in which it manifests itself. The structure of the family of derived images permits a derivation of the sampling density required to sample the image at multiple scales of resolution. The natural scale along the resolution axis (leading to an informationally uniform sampling density) is logarithmic, thus the structure is apt for the description of size invariances.

[1]  A. C. Esq. XL. On contour and slope lines , 1859 .

[2]  J. Maxwell L. On hills and dales: To the editors of the Philosophical Magazine and Journal , 1870 .

[3]  H. Marko Die Systemtheorie der homogenen Schichten , 1969, Kybernetik.

[4]  R. Thom Stabilité structurelle et morphogenèse , 1974 .

[5]  M. Spivak A comprehensive introduction to differential geometry , 1979 .

[6]  Roger W. Ehrich,et al.  Representation of Random Waveforms by Relational Trees , 1976, IEEE Transactions on Computers.

[7]  G. A. Hay,et al.  A model of visual threshold detection. , 1977, Journal of theoretical biology.

[8]  D Marr,et al.  Bandpass channels, zero-crossings, and early visual information processing. , 1979, Journal of the Optical Society of America.

[9]  D Marr,et al.  Theory of edge detection , 1979, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[10]  Azriel Rosenfeld,et al.  Segmentation and Estimation of Image Region Properties through Cooperative Hierarchial Computation , 1981, IEEE Transactions on Systems, Man, and Cybernetics.

[11]  J. Koenderink,et al.  Invariant features of contrast detection: an explanation in terms of self-similar detector arrays. , 1982, Journal of the Optical Society of America.

[12]  Andrew P. Witkin,et al.  Scale-Space Filtering , 1983, IJCAI.

[13]  R. Röhler Ein Modell zur örtlich-zeitlichen Signalübertragung im visuellen System des Menschen auf der Basis der linearen Systemtheorie kontinuierlicher Medien , 1976, Biological Cybernetics.

[14]  J. Koenderink,et al.  The structure of two-dimensional scalar fields with applications to vision , 1979, Biological Cybernetics.

[15]  J. Koenderink,et al.  Visual detection of spatial contrast; Influence of location in the visual field, target extent and illuminance level , 1978, Biological Cybernetics.