论文信息 - Dynamic estimation in computational vision

Dynamic estimation in computational vision

Spatial coherence constraints are commonly used to regularize the problems of reconstructing dense visual fields like depth, shape, and motion. Recent developments in theory and practice show that the local nature of spatial coherence constraints allows us to solve single-frame reconstruction problems efficiently with, for example, multiresolution approaches. While it is reasonable to impose temporal as well as spatial coherence on the unknown for a more robust estimation through data fusion over both space and time, such temporal, multi-frame extensions of the problems have not been as widely considered, perhaps due to the different and severe computational demands imposed by the sequential arrival of the image data. We present here an efficient filtering algorithm for sequential estimation of dense visual fields, using stochastic descriptor dynamic system models to capture temporal smoothness and dynamics of the fields. Theoretically, standard Kalman filtering techniques (generalized for stochastic descriptor systems) are applicable to solving temporally-extended visual field reconstruction problems, but their implementation is practically impossible because of the high dimensionality and because the time-varying nature of such problems requires on-line propagation of large covariance matrices. By exploiting the inherent local spatial structure of the reconstruction problem, however, we have developed filtering techniques that effectively approximate the information form of the Kalman filter. This is achieved by replacing covariance propagation steps with what are essentially low-order spatial model identification steps, in which spatial models with a strictly local support are constructed based on covariance information. In effect, we are decomposing the multi-frame problem into a series of Bayesian single-frame problems, in which the spatial prior model used reflects knowledge from the previous image frames. The resulting filtering algorithm has memory and computational requirements of O(N) each for a frame of data, where N is the number of pixels in a frame, and, additionally, the filter is implementable in parallel. As low-level visual field reconstruction is often considered to be a front-end in a hierachical visual processing system and thus might be VLSI-implemented, we have also designed a square root version of the information Kalman filter as an alternative algorithm with a reduced numerical dynamic range. The square root information 3 filter features an efficient, iterative computational structure and is parallelizable as well. Experiments have shown several beneficial effects of our multi-frame formulation applied to the sequential estimation of optical flow. For example, temporal assimilation of the data makes the reconstruction more robust to noise. Also, there are cases where the classic "aperture problem" of motion vision cannot be resolved satisfactorily by spatial regularization alone but is dramatically resolved by the additional temporal coherence constraint. Thesis Supervisor: Alan S. Willsky Title: Professor, Electrical Engineering

Toshio Mike Chin

[1] A. Yuille,et al. A common theoretical framework for visual motion's spatial and temporal coherence , 1989, [1989] Proceedings. Workshop on Visual Motion.

[2] E. Asplund,et al. Inverses of Matrices $\{a_{ij}\}$ which Satisfy $a_{ij} = 0$ for $j > i+p$. , 1959 .

[3] A. Mariano. Contour Analysis: A New Approach for Melding Geophysical Fields , 1990 .

[4] Hans-Hellmut Nagel,et al. An Investigation of Smoothness Constraints for the Estimation of Displacement Vector Fields from Image Sequences , 1983, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5] P. Pérez,et al. Parallel visual motion analysis using multiscale Markov random fields , 1991, Proceedings of the IEEE Workshop on Visual Motion.

[6] Phil Greenway,et al. Temporal Regularisation Of Optical Flow , 1988, Other Conferences.

[7] J. Gillis,et al. Matrix Iterative Analysis , 1961 .

[8] Joachim Heel. Dynamic motion vision , 1990, Robotics Auton. Syst..

[9] Harry L. Van Trees,et al. Detection, Estimation, and Modulation Theory, Part I , 1968 .

[10] Ajit Singh,et al. Incremental estimation of image-flow using a Kalman filter , 1991, Proceedings of the IEEE Workshop on Visual Motion.

[11] Charles R. Johnson,et al. Matrix analysis , 1985, Statistical Inference for Engineers and Data Scientists.

[12] D J Heeger,et al. Model for the extraction of image flow. , 1987, Journal of the Optical Society of America. A, Optics and image science.

[13] B. G. Schunck. The image flow constraint equation , 1986 .

[14] David G. Luenberger,et al. Time-invariant descriptor systems , 1978, Autom..

[15] Andrew Blake,et al. Visual Reconstruction , 1987, Deep Learning for EEG-Based Brain–Computer Interfaces.

[16] T. N. Stevenson,et al. Fluid Mechanics , 2021, Nature.

[17] Berthold K. P. Horn,et al. Determining Optical Flow , 1981, Other Conferences.

[18] A. Willsky,et al. Kalman filtering and Riccati equations for descriptor systems , 1992 .

[19] W. Eric L. Grimson,et al. An implementation of a computational theory of visual surface interpolation , 1983, Comput. Vis. Graph. Image Process..

[20] P. Anandan. Measuring Visual Motion From Image Sequences , 1987 .

[21] A.K. Jain,et al. Advances in mathematical models for image processing , 1981, Proceedings of the IEEE.

[22] Donald Geman,et al. Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images , 1984 .

[23] L. Silverman,et al. Image model representation and line-by-line recursive restoration , 1976, 1976 IEEE Conference on Decision and Control including the 15th Symposium on Adaptive Processes.

[24] J.K. Aggarwal,et al. Correspondence processes in dynamic scene analysis , 1981, Proceedings of the IEEE.

[25] José L. Marroquín,et al. Random measure fields and the integration of visual information , 1992, IEEE Trans. Syst. Man Cybern..

[26] Dennis Michael Martinez. Model-based motion estimation and its application to restoration and interpolation of motion pictures , 1986 .

[27] Joseph K. Kearney,et al. Optical Flow Estimation: An Error Analysis of Gradient-Based Methods with Local Optimization , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28] Berthold K. P. Horn. Image Intensity Understanding , 1975 .

[29] Philippe Saint-Marc,et al. Adaptive Smoothing: A General Tool for Early Vision , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[30] H. C. Longuet-Higgins,et al. The interpretation of a moving retinal image , 1980, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[31] A. Willsky,et al. An estimation-based approach to the reconstruction of optical flow , 1987 .

[32] Katsushi Ikeuchi,et al. Numerical Shape from Shading and Occluding Boundaries , 1981, Artif. Intell..

[33] Michael J. Black,et al. A model for the detection of motion over time , 1990, [1990] Proceedings Third International Conference on Computer Vision.

[34] A. Meygret,et al. Segmentation of optical flow and 3D data for the interpretation of mobile objects , 1990, [1990] Proceedings Third International Conference on Computer Vision.

[35] Ellen C. Hildreth,et al. Computations Underlying the Measurement of Visual Motion , 1984, Artif. Intell..

[36] John W. Woods,et al. Two-dimensional discrete Markovian fields , 1972, IEEE Trans. Inf. Theory.

[37] J. D. Robbins,et al. Motion-compensated television coding: Part I , 1979, The Bell System Technical Journal.

[38] Donald B. Olson,et al. Motion and evolution of oceanic rings in a numerical model and in observations , 1990 .

[39] M. Bertero,et al. Ill-posed problems in early vision , 1988, Proc. IEEE.

[40] Y. J. Tejwani,et al. Robot vision , 1989, IEEE International Symposium on Circuits and Systems,.

[41] G. Bierman. Factorization methods for discrete sequential estimation , 1977 .

[42] Shahriar Negahdaripour,et al. IEEE Transactions on Pattern Analysis and Machine Intelligence , 2004 .

[43] A.S. Willsky,et al. Relationships between digital signal processing and control and estimation theory , 1978, Proceedings of the IEEE.

[44] Joachim Heel. Temporal surface reconstruction , 1991, Proceedings. 1991 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[45] J. D. Robbins,et al. Motion-compensated coding: Some new results , 1980, The Bell System Technical Journal.

[46] G. Bierman. A Comparison of Discrete Linear Filtering Algorithms , 1973, IEEE Transactions on Aerospace and Electronic Systems.

[47] A. Krener,et al. Modeling and estimation of discrete-time Gaussian reciprocal processes , 1990 .

[48] R. Weale. Vision. A Computational Investigation Into the Human Representation and Processing of Visual Information. David Marr , 1983 .

[49] A. Jazwinski. Stochastic Processes and Filtering Theory , 1970 .

[50] Gene H. Golub,et al. Matrix computations , 1983 .

[51] Demetri Terzopoulos. Regularization ofInverseVisualProblemsInvolving Discontinuities , 1986 .

[52] Fuyun Ling. Givens rotation based least squares lattice and related algorithms , 1991, IEEE Trans. Signal Process..

[53] Jerry L. Prince,et al. Motion estimation from tagged MR image sequences , 1992, IEEE Trans. Medical Imaging.

[54] F. Lewis. A survey of linear singular systems , 1986 .

[55] J. Heel. Direct Estimation of Structure and Motion from Multiple Frames , 1990 .

[56] Uwe Schwiegelshohn,et al. A Square Root and Division Free Givens Rotation for Solving Least Squares Problems on Systolic Arrays , 1991, SIAM J. Sci. Comput..

[57] Arthur Gelb,et al. Applied Optimal Estimation , 1974 .

[58] Laurent D. Cohen,et al. A finite element method applied to new active contour models and 3D reconstruction from cross sections , 1990, [1990] Proceedings Third International Conference on Computer Vision.

[59] JOHN w. WOODS,et al. Kalman filtering in two dimensions , 1977, IEEE Trans. Inf. Theory.

[60] John W. Woods,et al. Two-dimensional Kalman filtering , 1981 .

[61] A. Willsky,et al. Solution and linear estimation of 2-D nearest-neighbor models , 1990 .

[62] Katsushi Ikeuchi,et al. Determining Surface Orientations of Specular Surfaces by Using the Photometric Stereo Method , 1981, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[63] Fred C. Schweppe,et al. Uncertain dynamic systems , 1973 .

[64] A. Bryson,et al. Discrete square root filtering: A survey of current techniques , 1971 .

[65] G. Golub,et al. Block Preconditioning for the Conjugate Gradient Method , 1985 .

[66] G. Forsythe,et al. The cyclic Jacobi method for computing the principal values of a complex matrix , 1960 .

[67] W E Grimson,et al. A computational theory of visual surface interpolation. , 1982, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[68] Berthold K. P. Horn,et al. Hill shading and the reflectance map , 1981, Proceedings of the IEEE.

[69] D. Luenberger. Dynamic equations in descriptor form , 1977 .