Active fixation for scene exploration

It is well-known that active selection of fixation points in humans is highly context and task dependent. It is therefore likely that successful computational processes for fixation in active vision should be so too. We are considering active fixation in the context of recognition of man-made objects characterized by their shapes. In this situation the qualitative shape and type of observed junctions play an important role. The fixations are driven by a grouping strategy, which forms sets of connected junctions separated from the surrounding at depth discontinuities. We have furthermore developed a methodology for rapid active detection and classification of junctions by selection of fixation points. The approach is based on direct computations from image data and allows integration of stereo and accommodation cues with luminance information. This work form a part of an effort to perform active recognition of generic objects, in the spirit of Malik and Biederman, but on real imagery rather than on line-drawings.

[1]  Tony Lindeberg,et al.  Detecting salient blob-like image structures and their scales with a scale-space primal sketch: A method for focus-of-attention , 1993, International Journal of Computer Vision.

[2]  I. Biederman Recognition-by-components: a theory of human image understanding. , 1987, Psychological review.

[3]  John K. Tsotsos,et al.  An Attentional Prototype for Early Vision , 1992, ECCV.

[4]  Azriel Rosenfeld,et al.  3-D Shape Recovery Using Distributed Aspect Matching , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  John K. Tsotsos,et al.  Efficient serial associative memory , 1993, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[6]  Kjell Brunnström,et al.  Active Detection and Classsification of Junctions by Foveation with a Head-Eye System Guided by the Scale-Space Primal Sketch , 1992, ECCV.

[7]  John K. Tsotsos,et al.  Integration of camera motion behaviours for active object recognition , 1994 .

[8]  Azriel Rosenfeld,et al.  Gray-level corner detection , 1982, Pattern Recognit. Lett..

[9]  Ruzena Bajcsy,et al.  Active and exploratory perception , 1992, CVGIP Image Underst..

[10]  Tony Lindeberg,et al.  Scale-Space Theory in Computer Vision , 1993, Lecture Notes in Computer Science.

[11]  David L. Waltz,et al.  Understanding Line drawings of Scenes with Shadows , 1975 .

[12]  Azriel Rosenfeld,et al.  From volumes to views: An approach to 3-D object recognition , 1992, CVGIP Image Underst..

[13]  John K. Tsotsos,et al.  Active object recognition , 1992, Proceedings 1992 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[14]  Jan-Olof Eklundh,et al.  A head-eye system - Analysis and design , 1992, CVGIP Image Underst..

[15]  J. Koenderink,et al.  Receptive field families , 1990, Biological Cybernetics.

[16]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[17]  Jay Martin Tenenbaum,et al.  Accommodation in computer vision , 1971 .

[18]  I. Biederman Human image understanding: Recent research and a theory , 1985, Computer Vision Graphics and Image Processing.

[19]  Yiannis Aloimonos,et al.  Purposive and qualitative active vision , 1990, [1990] Proceedings. 10th International Conference on Pattern Recognition.

[20]  Hans P. Moravec Obstacle avoidance and navigation in the real world by a seeing robot rover , 1980 .

[21]  S. Ullman Visual routines , 1984, Cognition.

[22]  Jan-Olof Eklundh,et al.  Integrating primary ocular processes , 1992, Image Vis. Comput..

[23]  Christopher M. Brown,et al.  Where to Look Next Using a Bayes Net: Incorporating Geometric Relations , 1992, ECCV.

[24]  Kjell Brunnström,et al.  Active Fixation for Junction Classification , 1993, CAIP.

[25]  Kjell Brunnström,et al.  Active fixation for scene exploration , 1996, International Journal of Computer Vision.

[26]  Dana H. Ballard,et al.  Animate Vision , 1991, Artif. Intell..

[27]  Irving Biederman,et al.  Human image understanding: Recent research and a theory , 1985, Comput. Vis. Graph. Image Process..

[28]  Hans-Hellmut Nagel,et al.  Volumetric model and 3D trajectory of a moving car derived from monocular TV frame sequences of a street scence , 1982, Computer Graphics and Image Processing.

[29]  Jitendra Malik,et al.  Interpreting line drawings of curved objects , 1986, International Journal of Computer Vision.

[30]  James L. Crowley,et al.  Vision as Process: Basic Research on Computer Vision Systems , 1997 .

[31]  Jan-Olof Eklundh,et al.  Shape Representation by Multiscale Contour Approximation , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[32]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[33]  Hans Knutsson,et al.  Focus of attention control , 1990 .

[34]  Yiannis Aloimonos,et al.  Active vision , 2004, International Journal of Computer Vision.

[35]  Wei Zhang,et al.  An extension of Marr's signature based edge classification and other methods determining diffuseness and height of edges, and bar edge width , 1993, 1993 (4th) International Conference on Computer Vision.

[36]  Kjell Brunnström,et al.  On scale and resolution in active analysis of local image structure , 1990, Image Vis. Comput..