An optimization framework for feature extraction

In this paper, we propose a unified optimization framework for feature extraction that lets us simultaneously take into account image data and semantic knowledge: We model objects using a language that specifies both photometric and geometric constraints and defines an information-theoretic objective function that measures the fit of the models to the data. We then treat the problem of finding objects as one of generating the optimal description of the image in terms of this language.We have validated our framework by performing extensive experiments on detecting objects in aerial imagery described by simple geometric constraints and have developed two algorithms for generating optimal descriptions. The first one starts with a rough sketch of a polygonal object and deforms the initial contour to maximize the objective function, thus finding object outlines. The second one automatically extracts complex rectilinear buildings from complex aerial images.

[1]  A. Hanson,et al.  Overview of the SRI Cartographic Modeling Environment , 1992 .

[2]  L. Quam Road Tracking and Anomaly Detection in Aerial Imagery , 1978 .

[3]  Donald Geman,et al.  Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images , 1984 .

[4]  Edward H. Adelson,et al.  The Laplacian Pyramid as a Compact Image Code , 1983, IEEE Trans. Commun..

[5]  Olivier D. Faugeras,et al.  HYPER: A New Approach for the Recognition and Positioning of Two-Dimensional Objects , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Martin A. Fischler,et al.  Detection of roads and linear structures in low-resolution aerial imagery using a multisource knowledge integration technique☆ , 1981 .

[7]  Robert C. Bolles,et al.  3DPO: A Three- Dimensional Part Orientation System , 1986, IJCAI.

[8]  Takeo Kanade,et al.  A Production System for Region Analysis , 1979, IJCAI.

[9]  Robert M. Haralick,et al.  Digital Step Edges from Zero Crossing of Second Directional Derivatives , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[11]  Rodney A. Brooks,et al.  Symbolic Reasoning Among 3-D Models and 2-D Images , 1981, Artif. Intell..

[12]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[13]  Dana H. Ballard,et al.  Generalizing the Hough transform to detect arbitrary shapes , 1981, Pattern Recognit..

[14]  Edwin P. D. Pednault,et al.  Some Experiments in Applying Inductive Inference Principles to Surface Reconstruction , 1989, IJCAI.

[15]  Demetri Terzopoulos,et al.  On Matching Deformable Models to Images , 1987, Topical Meeting on Machine Vision.

[16]  Ramakant Nevatia,et al.  Detecting buildings in aerial images , 1988, Comput. Vis. Graph. Image Process..

[17]  J. McDermott,et al.  Rule-Based Interpretation of Aerial Imagery , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  A. Rosenfeld A nonlinear edge detection technique , 1970 .

[19]  M K Brown,et al.  The Extraction of Curved Surface Features with Generic Range Sensors , 1986 .

[20]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Pascal Fua,et al.  Object Delineation as an Optimization Problem , 1989 .

[22]  Martin A. Fischler,et al.  Linear delineation , 1987 .

[23]  J. Rissanen A UNIVERSAL PRIOR FOR INTEGERS AND ESTIMATION BY MINIMUM DESCRIPTION LENGTH , 1983 .

[24]  Richard W. Hamming,et al.  Coding and Information Theory , 1980 .

[25]  Ronald Lumia,et al.  Model-based strategies for high-level robot vision , 1986, Comput. Vis. Graph. Image Process..

[26]  Pascal Fua,et al.  Model driven edge detection , 1990, Machine Vision and Applications.

[27]  Donald Geman,et al.  Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28]  Pascal Fua,et al.  Using Generic Geometric Models for Intelligent Shape Extraction , 1987, AAAI.

[29]  Thomas O. Binford,et al.  Survey of Model-Based Image Analysis Systems , 1982 .

[30]  Charles F. Hockett,et al.  A mathematical theory of communication , 1948, MOCO.

[31]  Kenneth I Laws Integrated Split/Merge Image Segmentation , 1988 .

[32]  Martin A. Fischler,et al.  The Representation and Matching of Pictorial Structures , 1973, IEEE Transactions on Computers.

[33]  Jerome A. Feldman,et al.  Decision Theory and Artificial Intelligence: I. A Semantics-Based Region Analyzer , 1974, Artif. Intell..

[34]  Yvan G. Leclerc,et al.  Constructing simple stable descriptions for image partitioning , 1989, International Journal of Computer Vision.

[35]  Demetri Terzopoulos,et al.  Snakes: Active contour models , 2004, International Journal of Computer Vision.

[36]  C. S. Wallace,et al.  A General Selection Criterion for Inductive Inference , 1984, ECAI.