Separating desired image and signal invariant components from extraneous variations

Images and signals can be characterized by representations invariant to time shifts, spatial shifts, frequency shifts, and scale changes as the situation dictates. Advances in time-frequency analysis and scale transform techniques have made this possible. The next step is to distinguish between invariant forms representing different classes of image or signal. Unfortunately, additional factors such as noise contamination and `style' differences complicate this. A ready example is found in text, where letters and words may vary in size and position within the image segment being examined. Examples of complicating variations include font used, corruption during fax transmission, and printer characteristics. The solution advanced in this paper is to cast the desired invariants into separate subspaces for each extraneous factor or group of factors. The first goal is to have minimal overlap between these subspaces and the second goal is to be able to identify each subspace accurately. Concepts borrowed from high-resolution spectral analysis, but adapted uniquely to this problem have been found to be useful in this context. Once the pertinent subspace is identified, the recognition of a particular invariant form within this subspace is relatively simple using well-known singular value decomposition techniques.

[1]  Leon Cohen,et al.  The scale representation , 1993, IEEE Trans. Signal Process..

[2]  Ching Y. Suen,et al.  Historical review of OCR research and development , 1992, Proc. IEEE.

[3]  V. Pisarenko The Retrieval of Harmonics from a Covariance Function , 1973 .

[4]  Harry Wechsler,et al.  A paradigm for invariant object recognition of brightness, optical flow and binocular disparity images , 1982, Pattern Recognit. Lett..

[5]  Gabriel Cristóbal,et al.  Wigner distribution representation of digital images , 1987, Pattern Recognit. Lett..

[6]  William J. Williams,et al.  Discrete scale transform for signal analysis , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[7]  Harry Wechsler,et al.  Derivation of optical flow using a spatiotemporal-Frequency approach , 1987, Comput. Vis. Graph. Image Process..

[8]  Harry Wechsler,et al.  A Theory for Invariant Object Recognition in the Frontoparallel Plane , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Harry Wechsler,et al.  Spatial/spatial-frequency representations for image segmentation and grouping , 1991, Image Vis. Comput..

[10]  William J. Williams,et al.  Reduced Interference Time-Frequency Distributions , 1992 .