A Proto-Object Based Dynamic Visual Saliency Model with an FPGA Implementation

The ability to attend to salient regions of a visual scene is an innate and necessary preprocessing step for both biological and engineered systems performing high-level visual tasks (e.g. object detection, tracking, and classification). Computational efficiency, in regard to processing bandwidth and speed, is improved by only devoting computational resources to salient regions of the visual stimuli. In this paper, we first present a biologically-plausible, bottom-up, dynamic visual saliency model based on the notion of proto-objects. This is achieved by incorporating the temporal characteristics of the visual stimulus into the model, similarly to the manner in which early stages of the human visual system extracts temporal information. This model outperforms state-of-the-art dynamic visual saliency models in predicting human eye fixations on a commonly-used video dataset with associated eye tracking data. Secondly, for this model to have practical applications, it must be capable of performing its computations in real-time under lowpower, small-size, and lightweight constraints. To address this, we introduce a Field-Programmable Gate Array implementation of the model on an Opal Kelly 7350 Kintex-7 board. This novel hardware implementation allows for processing of up to 23.35 frames per second running on a 100 MHz clock -- better than 26x speedup from the software implementation.

[1]  Majid Nili Ahmadabadi,et al.  Fast saliency map extraction from video: A hardware approach , 2013, 2013 8th Iranian Conference on Machine Vision and Image Processing (MVIP).

[2]  Alain Trémeau,et al.  Salient objects detection in dynamic scenes using color and texture features , 2017, Multimedia Tools and Applications.

[3]  Derrick J. Parkhurst,et al.  Scene content selected by active vision. , 2003, Spatial vision.

[4]  Benjamin W Tatler,et al.  The central fixation bias in scene viewing: selecting an optimal viewing position independently of motor biases and image feature distributions. , 2007, Journal of vision.

[5]  Nuno Vasconcelos,et al.  On the plausibility of the discriminant center-surround hypothesis for visual saliency. , 2008, Journal of vision.

[6]  Stefan Mihalas,et al.  A model of proto-object based saliency , 2014, Vision Research.

[7]  Aykut Erdem,et al.  A comparative study for feature integration strategies in dynamic saliency estimation , 2017, Signal Process. Image Commun..

[8]  Narayanan Vijaykrishnan,et al.  Saliency-driven dynamic configuration of HMAX for energy-efficient multi-object recognition , 2013, 2013 IEEE Computer Society Annual Symposium on VLSI (ISVLSI).

[9]  R. Rosenholtz A simple saliency model predicts a number of motion popout phenomena , 1999, Vision Research.

[10]  Narayanan Vijaykrishnan,et al.  Emulating Mammalian Vision on Reconfigurable Hardware , 2012, 2012 IEEE 20th International Symposium on Field-Programmable Custom Computing Machines.

[11]  A. Treisman,et al.  A feature-integration theory of attention , 1980, Cognitive Psychology.

[12]  F. Vitu,et al.  Eye movements in reading isolated words: evidence for strong biases towards the center of the screen , 2004, Vision Research.

[13]  Asha Iyer,et al.  Components of bottom-up gaze allocation in natural images , 2005, Vision Research.

[14]  Laurent Itti,et al.  Real-time high-performance attention focusing in outdoors color video streams , 2002, IS&T/SPIE Electronic Imaging.

[15]  Dong Wang,et al.  Saliency-driven scaling optimization for image retargeting , 2011, The Visual Computer.

[16]  K. Pearson Mathematical Contributions to the Theory of Evolution. III. Regression, Heredity, and Panmixia , 1896 .

[17]  Pierre Baldi,et al.  A principled approach to detecting surprising events in video , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[18]  N. P. Bichot,et al.  Visuospatial attention: Beyond a spotlight model , 1999, Psychonomic bulletin & review.

[19]  Christof Koch,et al.  Modeling attention to salient proto-objects , 2006, Neural Networks.

[20]  R. von der Heydt,et al.  A neural model of figure-ground organization. , 2007, Journal of neurophysiology.

[21]  Victor Leboran,et al.  Dynamic Whitening Saliency , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Z Li,et al.  Visual segmentation by contextual influences via intra-cortical interactions in the primary visual cortex. , 1999, Network.

[23]  Pierre Baldi,et al.  Bayesian surprise attracts human attention , 2005, Vision Research.

[24]  R. von der Heydt,et al.  Coding of Border Ownership in Monkey Visual Cortex , 2000, The Journal of Neuroscience.

[25]  Narayanan Vijaykrishnan,et al.  An FPGA Implementation of Information Theoretic Visual-Saliency System and Its Optimization , 2011, 2011 IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines.

[26]  Nanning Zheng,et al.  Learning to Detect a Salient Object , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27]  Ralph Etienne-Cummings,et al.  Proto-object based visual saliency model with a motion-sensitive channel , 2013, 2013 IEEE Biomedical Circuits and Systems Conference (BioCAS).

[28]  William Bialek,et al.  Entropy and Information in Neural Spike Trains , 1996, cond-mat/9603127.

[29]  Gert Cauwenberghs,et al.  Large-Scale Neuromorphic Spiking Array Processors: A Quest to Mimic the Brain , 2018, Front. Neurosci..

[30]  J. Duncan Selective attention and the organization of visual information. , 1984, Journal of experimental psychology. General.

[31]  D. M. Green,et al.  Signal detection theory and psychophysics , 1966 .

[32]  L. Itti Author address: , 1999 .

[33]  Ronald A. Rensink The Dynamic Representation of Scenes , 2000 .

[34]  S. Petersen,et al.  The pulvinar and visual salience , 1992, Trends in Neurosciences.

[35]  T. Poggio,et al.  Hierarchical models of object recognition in cortex , 1999, Nature Neuroscience.

[36]  John K. Tsotsos,et al.  Attention based on information maximization , 2010 .

[37]  Russell L. De Valois,et al.  PII: S0042-6989(00)00210-8 , 2000 .

[38]  Aykut Erdem,et al.  Spatio-Temporal Saliency Networks for Dynamic Saliency Prediction , 2016, IEEE Transactions on Multimedia.

[39]  S Ullman,et al.  Shifts in selective visual attention: towards the underlying neural circuitry. , 1985, Human neurobiology.

[40]  Christopher M. Masciocchi,et al.  Everyone knows what is interesting: salient locations which should be fixated. , 2009, Journal of vision.

[41]  Nathalie Guyader,et al.  Modelling Spatio-Temporal Saliency to Predict Gaze Direction for Short Videos , 2009, International Journal of Computer Vision.

[42]  P. Sterling,et al.  Efficiency of Information Transmission by Retinal Ganglion Cells , 2004, Current Biology.

[43]  Pietro Perona,et al.  Graph-Based Visual Saliency , 2006, NIPS.

[44]  Nicu Sebe,et al.  A proto-object-based computational model for visual saliency. , 2013, Journal of vision.

[45]  R. Shapley,et al.  Temporal-frequency selectivity in monkey visual cortex , 1996, Visual Neuroscience.

[46]  Peyman Milanfar,et al.  Static and space-time visual saliency detection by self-resemblance. , 2009, Journal of vision.

[47]  F. Qiu,et al.  Figure-ground mechanisms provide structure for selective attention , 2007, Nature Neuroscience.

[48]  Berin Martini,et al.  Hardware accelerated visual attention algorithm , 2011, 2011 45th Annual Conference on Information Sciences and Systems.

[49]  Ralph Etienne-Cummings,et al.  How is motion integrated into a proto-object based visual saliency model? , 2015, 2015 49th Annual Conference on Information Sciences and Systems (CISS).

[50]  Eduardo Ros,et al.  Real-Time Visual Saliency Architecture for FPGA With Top-Down Attention Modulation , 2014, IEEE Transactions on Industrial Informatics.

[51]  Matthew H Tong,et al.  of the Annual Meeting of the Cognitive Science Society Title SUNDAy : Saliency Using Natural Statistics for Dynamic Analysis of Scenes Permalink , 2009 .

[52]  P Cavanagh,et al.  Attention-based motion perception. , 1992, Science.

[53]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[54]  Minho Lee,et al.  Implementation of Visual Attention System Using Artificial Retina Chip and Bottom-Up Saliency Map Model , 2011, ICONIP.

[55]  Liming Zhang,et al.  A Novel Multiresolution Spatiotemporal Saliency Detection Model and Its Applications in Image and Video Compression , 2010, IEEE Transactions on Image Processing.

[56]  Stefan Winkler,et al.  Motion saliency outweighs other low-level features while watching videos , 2008, Electronic Imaging.

[57]  M. Goldberg,et al.  The representation of visual salience in monkey parietal cortex , 1998, Nature.

[58]  Robert B. Fisher,et al.  A computer vision model for visual-object-based attention and eye movements , 2008, Comput. Vis. Image Underst..

[59]  Wei Zhang,et al.  The Application of Visual Saliency Models in Objective Image Quality Assessment: A Statistical Evaluation , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[60]  Ralph Etienne-Cummings,et al.  Neuromorphic visual saliency implementation using stochastic computation , 2017, 2017 IEEE International Symposium on Circuits and Systems (ISCAS).