论文信息 - An efficient algorithm for attention-driven image interpretation from segments

An efficient algorithm for attention-driven image interpretation from segments

In the attention-driven image interpretation process, an image is interpreted as containing several perceptually attended objects as well as the background. The process benefits greatly a content-based image retrieval task with attentively important objects identified and emphasized. An important issue to be addressed in an attention-driven image interpretation is to reconstruct several attentive objects iteratively from the segments of an image by maximizing a global attention function. The object reconstruction is a combinational optimization problem with a complexity of 2^N which is computationally very expensive when the number of segments N is large. In this paper, we formulate the attention-driven image interpretation process by a matrix representation. An efficient algorithm based on the elementary transformation of matrix is proposed to reduce the computational complexity to 3@wN(N-1)^2/2, where @w is the number of runs. Experimental results on both the synthetic and real data show a significantly improved processing speed with an acceptable degradation to the accuracy of object formulation.

[1] C. Koch,et al. Computational modelling of visual attention , 2001, Nature Reviews Neuroscience.

[2] N. Suga,et al. Criticisms of 'Specific long-term memory traces in primary auditory cortex' , 2004, Nature Reviews Neuroscience.

[3] Scott B. Steinman,et al. Computational Models of Visual Attention , 2002 .

[4] Fan Chung,et al. Spectral Graph Theory , 1996 .

[5] E. Reingold,et al. Combinatorial Algorithms: Theory and Practice , 1977 .

[6] Zheru Chi,et al. Attention-driven image interpretation with application to image retrieval , 2006, Pattern Recognit..

[7] J ValdésJulio,et al. 2006 Special issue , 2006 .

[8] Pietro Perona,et al. Selective visual attention enables learning and recognition of multiple objects in cluttered scenes , 2005, Comput. Vis. Image Underst..

[9] Weisi Lin,et al. Modeling visual attention's modulatory aftereffects on visual sensitivity and quality evaluation , 2005, IEEE Transactions on Image Processing.

[10] B. S. Manjunath,et al. Unsupervised Segmentation of Color-Texture Regions in Images and Video , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[11] Joachim M. Buhmann,et al. A minimum entropy approach to adaptive image polygonization , 2003, IEEE Trans. Image Process..

[12] Lie Lu,et al. A generic framework of user attention model and its application in video summarization , 2005, IEEE Trans. Multim..

[13] Laurent Itti,et al. Automatic foveation for video compression using a neurobiological model of visual attention , 2004, IEEE Transactions on Image Processing.

[14] Jitendra Malik,et al. Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[15] H. Joseph Straight. Combinatorics: An Invitation , 1993 .

[16] J Theeuwes,et al. Visual selective attention: a theoretical analysis. , 1993, Acta psychologica.

[17] Jeremy M. Wolfe,et al. The Level of Attention: Mediating Between the Stimulus and Perception , 2003 .

[18] Patrick Le Callet,et al. A coherent computational approach to model bottom-up visual attention , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19] Pietro Perona,et al. Is bottom-up attention useful for object recognition? , 2004, CVPR 2004.

[20] Albert Ali Salah,et al. A Selective Attention-Based Method for Visual Pattern Recognition with Application to Handwritten Digit Recognition and Face Recognition , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[21] A. D. Kraus,et al. Matrices for engineers , 1987 .

[22] Wan-Chi Siu,et al. Multimedia Information Retrieval and Management , 2003 .

[23] Kosuke Sato,et al. Real-time gesture recognition by learning and selective control of visual interest points , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24] Lei Guo,et al. Automatic attention object extraction from images , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[25] Christof Koch,et al. Modeling attention to salient proto-objects , 2006, Neural Networks.

[26] Takashi Matsuyama,et al. Multiobject Behavior Recognition by Event Driven Selective Attention Method , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[27] J. Wolfe,et al. What attributes guide the deployment of visual attention and how do they do it? , 2004, Nature Reviews Neuroscience.

[28] Zhuowen Tu,et al. Image Parsing: Unifying Segmentation, Detection, and Recognition , 2005, International Journal of Computer Vision.

[29] Gunther Heidemann,et al. Focus-of-attention from local color symmetries , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[30] S Ullman,et al. Shifts in selective visual attention: towards the underlying neural circuitry. , 1985, Human neurobiology.

[31] Fuhui Long,et al. Fundamentals of Content-Based Image Retrieval , 2003 .

[32] Hemant D. Tagare,et al. A Maximum-Likelihood Strategy for Directing Attention during Visual Search , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[33] Chun-Jen Tsai,et al. Visual sensitivity guided bit allocation for video coding , 2006, IEEE Transactions on Multimedia.

[34] Bärbel Mertsching,et al. Data- and Model-Driven Gaze Control for an Active-Vision System , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[35] Michael Lindenbaum,et al. Attention-based dynamic visual search using inner-scene similarity: algorithms and bounds , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[36] Christof Koch,et al. A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .