Biological Models for Active Vision: Towards a Unified Architecture

Building a general-purpose, real-time active vision system completely based on biological models is a great challenge. We apply a number of biologically plausible algorithms which address different aspects of vision, such as edge and keypoint detection, feature extraction, optical flow and disparity, shape detection, object recognition and scene modelling into a complete system. We present some of the experiments from our ongoing work, where our system leverages a combination of algorithms to solve complex tasks.

[1]  João Rodrigues,et al.  Multi-scale keypoints in V1 and beyond: object segregation, scale selection, saliency maps and face detection. , 2006, Bio Systems.

[2]  John K. Tsotsos Neurobiological Models of Visual Attention , 2002 .

[3]  Estela Bicho,et al.  The dynamic neural field approach to cognitive robotics , 2006, Journal of neural engineering.

[4]  Pietro Perona,et al.  One-shot learning of object categories , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  David Burr,et al.  Spatiotopic perceptual maps in humans: evidence from motion adaptation , 2012, Proceedings of the Royal Society B: Biological Sciences.

[6]  François Brémond,et al.  Video understanding for complex activity recognition , 2006, Machine Vision and Applications.

[7]  Kunihiko Fukushima,et al.  Neocognitron for handwritten digit recognition , 2003, Neurocomputing.

[8]  Monique Thonnat,et al.  Ontology based complex object recognition , 2008, Image Vis. Comput..

[9]  King-Sun Fu,et al.  Syntactic Pattern Recognition And Applications , 1968 .

[10]  Li Fei-Fei,et al.  Towards total scene understanding: Classification, annotation and segmentation in an automatic framework , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[11]  Sinan Kalkan,et al.  Deep Hierarchies in the Primate Visual Cortex: What Can We Learn for Computer Vision? , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Laurent Itti,et al.  Biologically Inspired Mobile Robot Vision Localization , 2009, IEEE Transactions on Robotics.

[13]  Thomas Serre,et al.  Robust Object Recognition with Cortex-Like Mechanisms , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Sanja Fidler,et al.  Towards Scalable Representations of Object Categories: Learning a Hierarchy of Parts , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[15]  Geoffrey E. Hinton,et al.  Connectionist Architectures for Artificial Intelligence , 1990, Computer.

[16]  Hans-Hellmut Nagel,et al.  From image sequences towards conceptual descriptions , 1988, Image Vis. Comput..

[17]  Stephan K. U. Zibner,et al.  Dynamic Neural Fields as Building Blocks of a Cortex-Inspired Architecture for Robotic Scene Representation , 2011, IEEE Transactions on Autonomous Mental Development.

[18]  Carlo Tomasi,et al.  Good features to track , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[19]  Michael Leyton,et al.  A Process-Grammar for Shape , 1988, Artif. Intell..

[20]  Heiko Neumann,et al.  Neural Mechanisms for the Robust Representation of Junctions , 2004, Neural Computation.

[21]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[22]  Raja Chatila,et al.  Combining Structural Descriptions and Image-based Representations for Image, Object, and Scene Recognition , 2005, IJCAI.

[23]  Joachim M. Buhmann,et al.  Regularizing phase-based stereo , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[24]  Murray Shanahan,et al.  Machine Perception using a Blackboard Architecture , 2007 .

[25]  D. J. Felleman,et al.  Distributed hierarchical processing in the primate cerebral cortex. , 1991, Cerebral cortex.

[26]  Joachim M. Buhmann,et al.  Learning the Compositional Nature of Visual Object Categories for Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27]  Olaf Kübler,et al.  Simulation of neural contour mechanisms: from simple to end-stopped cells , 1992, Vision Research.

[28]  John K. Tsotsos Analyzing vision at the complexity level , 1990, Behavioral and Brain Sciences.

[29]  João M. F. Rodrigues,et al.  Multi-scale cortical keypoints for realtime hand tracking and gesture recognition , 2012 .

[30]  Bernd Neumann,et al.  On scene interpretation with description logics , 2006, Image Vis. Comput..

[31]  Gordon Wyeth,et al.  Mapping a Suburb With a Single Camera Using a Biologically Inspired SLAM System , 2008, IEEE Transactions on Robotics.

[32]  Eli Shechtman,et al.  In defense of Nearest-Neighbor based image classification , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.