A miniaturized space-variant active vision system: Cortex-I

We have developed a prototype for a miniaturized, active vision system with a sensor architecture based on a logarithmically structured, space-variant, pixel geometry. The central part of the image has a high resolution, and the periphery has a a smoothly falling resolution. The human visual system uses a similar image architecture. Our system integrates a miniature CCD-based camera, a novel pantilt actuator/controller, general purpose processors, a video-telephone modem and a display. Due to the ability of space-variant sensors to cover large work spaces, yet provide high acuity with an extremely small number of pixels, architectures with space-variant, active vision systems provide a potential for reductions in system size and cost of several orders of magnitude. Cortex-I takes up less than a third of a cubic foot, including camera, actuators, control, computers, and power supply, and was built for a (one-off) parts cost of roughly US $2000. In this paper, we describe several applications that we have developed for Cortex-I such as tracking moving objects, visual attention, pattern recognition (license plate reading), and video-telephone communcications (teleoperation). We report here on the design of the camera and optics (8 × 8 × 8 mm), a method to convert the uniform image to a space-variant image, and a new miniature pan-tilt actuator, the spherical pointing motor (SPM), (4 × 5 × 6 cm). Finally, we discuss applications for motion tracking and license plate reading. Potential application domains for systems of this type include vision systems for mobile robots and robot manipulators, traffic monitoring systems, security and surveillance, telerobotics, and consumer video communications. The long-range goal of this project is to demonstrate that major new applications of robotics will become feasible when small, low-cost, machine-vision systems can be mass produced. We use the term “commodity robotics” to express the expected impact of the possibilities for opening up new application niches in robotics and machine vision, for what has until now been an expensive, and therefore limited, technology.

[1]  Eric Krotkov,et al.  An agile stereo camera system for flexible image acquisition , 1988, IEEE J. Robotics Autom..

[2]  Volker Graefe,et al.  Applications of dynamic monocular machine vision , 1988, Machine Vision and Applications.

[3]  Benjamin B. Bederson,et al.  A miniature pan-tilt actuator: the spherical pointing motor , 2011, IEEE Trans. Robotics Autom..

[4]  Ping-Wen Ong,et al.  Space-variant optical character recognition , 1992, Proceedings., 11th IAPR International Conference on Pattern Recognition. Vol.II. Conference B: Pattern Recognition Methodology and Systems.

[5]  A. Lynn Abbott,et al.  Active surface reconstruction by integrating focus, vergence, stereo, and camera calibration , 1990, [1990] Proceedings Third International Conference on Computer Vision.

[6]  Charles M Hains,et al.  Image-processing system , 1993 .

[7]  I-Chen Wu,et al.  Machine-independent image processing: Performance of apply on diverse architectures , 1989, Comput. Vis. Graph. Image Process..

[8]  Benjamin B. Bederson,et al.  Two miniature pan-tilt devices , 1992, Proceedings 1992 IEEE International Conference on Robotics and Automation.

[9]  Peter J. Burt,et al.  Attention mechanisms for vision in a dynamic world , 1988, [1988 Proceedings] 9th International Conference on Pattern Recognition.

[10]  A. Lynn Abbott,et al.  Surface Reconstruction By Dynamic Integration Of Focus, Camera Vergence, And Stereo , 1988, [1988 Proceedings] Second International Conference on Computer Vision.

[11]  James J. Clark,et al.  Modal Control Of An Attentive Vision System , 1988, [1988 Proceedings] Second International Conference on Computer Vision.

[12]  Azriel Rosenfeld,et al.  Computer Vision , 1988, Adv. Comput..

[13]  Michael D. Howard,et al.  HBA Vision Architecture: Built and Benchmarked , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Dana H. Ballard,et al.  The Rochester Robot , 1988 .

[15]  Daniel E. Koditschek,et al.  Distributed real-time control of a spatial robot juggler , 1992, Computer.

[16]  Yiannis Aloimonos,et al.  Active vision , 2004, International Journal of Computer Vision.

[17]  Dana H. Ballard,et al.  Animate Vision , 1991, Artif. Intell..

[18]  Eric L. Schwartz,et al.  Applications of computer graphics and image processing to 2D and 3D modeling of the functional architecture of visual cortex , 1988, IEEE Computer Graphics and Applications.

[19]  R. Bajcsy Active perception , 1988 .

[20]  Richard S. Wallace Miniature direct drive rotary actuators , 1993, Robotics Auton. Syst..

[21]  A. L. Abbott Selective fixation control for machine vision: a survey , 1991, Conference Proceedings 1991 IEEE International Conference on Systems, Man, and Cybernetics.

[22]  Richard S. Wallace Miniature direct drive rotary actuators II: Eye, finger and leg , 1994, Robotics Auton. Syst..

[23]  Carl F. R. Weiman,et al.  Logarithmic spiral grids for image-processing and display , 1979 .

[24]  Andrea Califano,et al.  Data and model driven foveation , 1990, [1990] Proceedings. 10th International Conference on Pattern Recognition.

[25]  Benjamin B. Bederson,et al.  Space variant image processing , 1994, International Journal of Computer Vision.

[26]  Giulio Sandini,et al.  Active vision based on space-variant sensing , 1991 .

[27]  Allen M. Waxman,et al.  Visual learning, adaptive expectations, and behavioral conditioning of the mobile robot MAVIN , 1991, Neural Networks.

[28]  Lloyd Motz,et al.  Three-Component Optically Compensated Varifocal System* , 1962 .

[29]  Leonard Bergstein General Theory of Optically Compensated Varifocal Systems , 1958 .

[30]  Giulio Sandini,et al.  A Foveated Retina-Like Sensor Using CCD Technology , 1989, Analog VLSI Implementation of Neural Systems.

[31]  E. L. Schwartz,et al.  Spatial mapping in the primate sensory projection: Analytic structure and relevance to perception , 1977, Biological Cybernetics.

[32]  Eric L. Schwartz,et al.  Design considerations for a space-variant visual sensor with complex-logarithmic geometry , 1990, [1990] Proceedings. 10th International Conference on Pattern Recognition.

[33]  N.R. Malik,et al.  Graph theory with applications to engineering and computer science , 1975, Proceedings of the IEEE.

[34]  Terry Caelli,et al.  On the Representation of Visual Information , 2001, IWVF.

[35]  A. Lynn Abbott,et al.  University of Illinois active vision system , 1992, Other Conferences.

[36]  Peter K. Allen,et al.  Real-time visual servoing , 1991, Proceedings. 1991 IEEE International Conference on Robotics and Automation.