The detection of 2D image features using local energy

Accurate detection and localization of two-dimensional (2D) image features (or `keypoints') is important for vision tasks such as structure from motion, stereo matching, and line labeling. 2D image features are ideal for these vision tasks because 2D image features are high in information and yet they occur sparsely in typical images. Several methods for the detection of 2D image features have already been developed. However, it is di cult to assess the performance of these methods because no one has produced an adequate de nition of corners that encompasses all types of 2D luminance variations that make up 2D image features. The fact that there does not exist a consensus on the de nition of 2D image features is not surprising given the confusion surrounding the de nition of 1D image features. The general perception of 1D image features has been that they correspond to `edges' in an image and so are points where the intensity gradient in some direction is a local maximum. The Sobel [68], Canny [7] and Marr-Hildreth [37] operators all use this model of 1D features, either implicitly or explicitly. However, other pro les in an image also make up valid 1D features, such as spike and roof pro les, as well as combinations of all these feature types. Spikes and roof pro les can also be found by looking for points where the rate of change of the intensity gradient is locally maximal, as Canny did in de ning a `roof-detector' in much the same manner that he developed his `edge-detector'. While this allows the detection of a wider variety of 1D features pro les, it comes no closer to the goal of unifying these di erent feature types to an encompassing de nition of 1D features. The introduction of the local energy model of image features by Morrone and Owens [45] in 1987 provided a uni ed de nition of 1D image features for the rst iii time. They postulated that image features correspond to points in an image where there is maximal phase congruency in the frequency domain representation of the image. That is, image features correspond to points of maximal order in the phase domain of the image signal. These points of maximal phase congruency correspond to step-edge, roof, and ramp intensity pro les, and combinations thereof. They also correspond to the Mach bands perceived by humans in trapezoidal feature pro les. This thesis extends the notion of phase congruency to 2D image features. As 1D image features correspond to points of maximal 1D order in the phase domain of the image signal, this thesis contends that 2D image features correspond to maximal 2D order in this domain. These points of maximal 2D phase congruency include all the di erent types of 2D image features, including grey-level corners, line terminations, blobs, and a variety of junctions. Early attempts at 2D feature detection were simple `corner detectors' based on a model of a grey-level corner in much the same way that early 1D feature detectors were based on a model of step-edges. Some recent attempts have included more complex models of 2D features, although this is basically a more complex a priori judgement of the types of luminance pro les that are to be labeled as 2D features. This thesis develops the 2D local energy feature detector based on a new, uni ed de nition of 2D image features that marks points of locally maximum 2D order in the phase domain representation of the image as 2D image features. The performance of an implementation of 2D local energy is assessed, and compared to several existing methods of 2D feature detection. This thesis also shows that in contrast to most other methods of 2D feature detection, 2D local energy is an idempotent operator. The extension of phase congruency to 2D image features also uni es the detection of image features. 1D and 2D image features correspond to 1D and 2D order in the phase domain representation of the image respectively. This de nition imposes a hierarchy of image features, with 2D image features being a subset of 1D image features. This ordering of image features has been implied ever since 1D features were used as candidate points for 2D feature detection by Kitchen [28] and others. Local energy enables the extraction of both 1D and 2D image features in a consistent manner; 2D image features are extracted from the 1D image features using the same iv operations that are used to extract 1D image features from the input image. The consistent approach to the detection of image features presented in this thesis allows the hierarchy of primitive image features to be naturally extended to higher order image features. These higher order image features can then also be extracted from higher order image data using the same hierarchical approach. This thesis shows how local energy can be naturally extended to the detection of 1D (surface) and higher order image features in 3D data sets. Results are presented for the detection of 1D image features in 3D confocal microscope images, showing superior performance to the 3D extension of the Sobel operator [74]. v Preface Some of the work in this thesis has already been published. Most of the work in Chapters 4 to 6 appears in a technical report in the Department of Computer Science [58], and a more concise version of this work is to appear in Image and Vision Computing [59]. I am the principal contributing author for both these papers. With the exception of the 3D surface detector described in Section 7.1, all of the work presented in this thesis|including algorithms and implementations|is my own. The surface detection work has been principally performed by Chris Pudney, with my contribution being to the general methodology of implementation and the optimizations with regard to performing the FFTs and applying the energy lters. Various forms of this work have been published in the Proceedings of ANZIIS'95 [53] and Proceedings of the International Computer Science Conference [54] with another paper on this work to appear in the Journal of Assisted Confocal Microscopy. vi Acknowledgements First of all I would like to thank my supervisor Robyn Owens for her constant support and guidance throughout my candidature. She ensured that I got o on the right foot and always knew where I was, and where I was going even when I didn't, particularly at the beginning of my studies. It was always reassuring to know that she would quickly understand any problems I was grappling with and o er helpful suggestions, although the speed she did this and its apparent ease to her were a little disconcerting. Robyn also strongly encouraged me to apply to travel to Switzerland to study at ETH Z urich which was both very bene cial to my studies and a fantastic time. I would like to thank UWA and ETH Z urich for supporting my stay in Switzerland. Thanks go to everyone in the Image Sciences (BIWI) group at ETH for making my stay educational and enjoyable. Olaf, Vreni, Martin, Markus and the lunch crew, Gaudenz, Friedrich, Wolfram, Tuomo, Olof, and Marjan made for an entertaining time. Extra special thanks go to the K ublers for making me feel like part of the family during my stay in Switzerland; Olaf and Guni for showing me the beautiful Swiss countryside while unsuccessfully trying to give me wanderschaden, Dani for risking the meat with an Aussie at his birthday party BBQ, and Flo for being such a great mate, for losing the Kiwi accent (eh?), and for nally getting his revenge for numerous dumpings at Triggs Beach by taking me snowboarding on a hard-as-rock glacier. I would also like to thank the non-resident member of the K ubler household, Gaudie, for his friendship and for introducing me to the outdoor cinema, and Roman for all the basketball. I have bene ted from many conversations with Mike Robbins, Peter Kovesi and vii Chris Pudney regarding local energy. Bruce Backman was a good sounding board for ideas and always provided a di erent perspective. Friedrich Heitger was very helpful with e-mail messages when I was trying to implement his work, and while I was at ETH taught me much of what I know about lter design. Olof Henricsson and I also had many interesting discussions regarding the use of low-level features after our weekly hit of tennis. Lachlan Partington and Chris Pudney provided valuable feedback through their careful reading of a late draft of this thesis. I also thank Chris for supplying both the output for Figure 4 and numerous tips on LATEX. I would like to thank the Department of Computer Science at The University of Western Australia and the Department of Education, Employment and Training for their support during this candidature. Thanks go to my family for their support of my studies, especially my parents who put me through school and then put up with me at home for most of my tertiary studies. I am forever endebted to mywife Karenza for her love and support throughout my studies and to her I give special thanks. viii

[1]  D H HUBEL,et al.  RECEPTIVE FIELDS AND FUNCTIONAL ARCHITECTURE IN TWO NONSTRIATE VISUAL AREAS (18 AND 19) OF THE CAT. , 1965, Journal of neurophysiology.

[2]  Ian D. Reid,et al.  Tracking foveated corner clusters using affine structure , 1993, 1993 (4th) International Conference on Computer Vision.

[3]  Robyn A. Owens,et al.  Feature detection from local energy , 1987, Pattern Recognit. Lett..

[4]  P Perona,et al.  Preattentive texture discrimination with early vision mechanisms. , 1990, Journal of the Optical Society of America. A, Optics and image science.

[5]  Rachid Deriche,et al.  Accurate corner detection: an analytical study , 1990, [1990] Proceedings Third International Conference on Computer Vision.

[6]  Peter Kovesi,et al.  Feature Detection Using Oriented Local Energy for 3D Confocal Microscope Images , 1995, ICSC.

[7]  Rachid Deriche,et al.  Recovering and characterizing image features using an efficient model based approach , 1993, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[8]  Michael Brady,et al.  The Curvature Primal Sketch , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  S. Sutherland Seeing things , 1989, Nature.

[10]  G. Orban,et al.  Responses of visual cortical neurons to curved stimuli and chevrons , 1990, Vision Research.

[11]  Wolfgang Förstner,et al.  A Framework for Low Level Feature Extraction , 1994, ECCV.

[12]  V. Berzins Accuracy of laplacian edge detectors , 1984 .

[13]  Peter Kovesi,et al.  A 3D local energy surface detector for confocal microscope images , 1995, Proceedings of Third Australian and New Zealand Conference on Intelligent Information Systems. ANZIIS-95.

[14]  Robyn A. Owens,et al.  2D feature detection via local energy , 1997, Image Vis. Comput..

[15]  W. Freeman Steerable filters and local analysis of image structure , 1992 .

[16]  Irwin Edward Sobel,et al.  Camera Models and Machine Perception , 1970 .

[17]  Hans Knutsson,et al.  Texture Analysis Using Two-Dimensional Quadrature Filters , 1983 .

[18]  Ramesh C. Jain,et al.  Detecting time-varying corners , 1984, Comput. Vis. Graph. Image Process..

[19]  Gerald Sommer,et al.  Junction classification by multiple orientation detection , 1994, ECCV.

[20]  Robyn Owens,et al.  The 2D Local Energy Model , 1994 .

[21]  R. Haralick Edge and region analysis for digital image data , 1980 .

[22]  Hans P. Moravec Towards Automatic Visual Obstacle Avoidance , 1977, IJCAI.

[23]  D. Burr,et al.  Feature detection in human vision: a phase-dependent energy model , 1988, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[24]  Peter Kovesi,et al.  Image Features from Phase Congruency , 1995 .

[25]  James W. Cooper,et al.  Early jump-out corner detectors , 1991, Proceedings. 1991 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[26]  Pietro Perona Steerable-scalable kernels for edge detection and junction analysis , 1992, Image Vis. Comput..

[27]  Olaf Kübler,et al.  Simulation of neural contour mechanisms: from simple to end-stopped cells , 1992, Vision Research.

[28]  Karl Rohr,et al.  Modelling and identification of characteristic intensity variations , 1992, Image Vis. Comput..

[29]  J. Canny Finding Edges and Lines in Images , 1983 .

[30]  Hans P. Moravec Visual Mapping by a Robot Rover , 1979, IJCAI.

[31]  Jean Serra,et al.  Image Analysis and Mathematical Morphology , 1983 .

[32]  Svetha Venkatesh,et al.  Edge detection is a projection , 1989, Pattern Recognit. Lett..

[33]  E H Adelson,et al.  Spatiotemporal energy models for the perception of motion. , 1985, Journal of the Optical Society of America. A, Optics and image science.

[34]  J. Alison Noble,et al.  Finding Corners , 1988, Alvey Vision Conference.

[35]  Rachid Deriche,et al.  3D edge detection by separable recursive filtering and edge closing , 1990, [1990] Proceedings. 10th International Conference on Pattern Recognition.

[36]  Hans-Hellmut Nagel,et al.  Volumetric model and 3D trajectory of a moving car derived from monocular TV frame sequences of a street scene , 1981, Comput. Graph. Image Process..

[37]  Han Wang,et al.  Gray Level Corner Detection , 1998, MVA.

[38]  Steven W. Zucker,et al.  A Three-Dimensional Edge Operator , 1981, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[39]  Paul Beaudet,et al.  Rotationally invariant image operators , 1978 .

[40]  Robyn Owens,et al.  2D feature detection and identification , 1992 .

[41]  U Tiede,et al.  3-D segmentation of MR images of the head for 3-D display. , 1990, IEEE transactions on medical imaging.

[42]  Hsun K. Liu,et al.  Two and three dimensional boundary detection , 1977 .

[43]  Rüdiger von der Heydt,et al.  Detection of General Edges and Keypoints , 1992, ECCV.

[44]  Peter Kovesi,et al.  Surface Detection in 3D Confocal Microscope Images via Local Energy and Ridge Tracing , 1996 .

[45]  John G. Daugman,et al.  Complete discrete 2-D Gabor transforms by neural networks for image analysis and compression , 1988, IEEE Trans. Acoust. Speech Signal Process..

[46]  Gérard G. Medioni,et al.  Corner detection and curve representation using cubic B-splines , 1986, Proceedings. 1986 IEEE International Conference on Robotics and Automation.

[47]  David L. Waltz,et al.  Understanding Line drawings of Scenes with Shadows , 1975 .

[48]  Christopher G. Harris,et al.  A Combined Corner and Edge Detector , 1988, Alvey Vision Conference.

[49]  R. Wilson,et al.  Anisotropic Nonstationary Image Estimation and Its Applications: Part I - Restoration of Noisy Images , 1983, IEEE Transactions on Communications.

[50]  F. A. Seiler,et al.  Numerical Recipes in C: The Art of Scientific Computing , 1989 .

[51]  D. Burr,et al.  Mach bands are phase dependent , 1986, Nature.

[52]  Svetha Venkatesh,et al.  An energy feature detection scheme , 1989 .

[53]  Robert M. Haralick,et al.  Digital Step Edges from Zero Crossing of Second Directional Derivatives , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.