Extraction of salient objects based on image clustering and saliency

Over the past decades, numerous methods have been proposed on salient object detection. However, most of these methods need users’ interactions as a prerequisite to control their progress. In this paper, we propose a novel method for extraction of salient objects based on image clustering and saliency map from natural scene images. This method is a combination of image clustering, saliency map generation and automatic initialization. First, a graph based clustering method is applied to split the input image into regions. Second, a saliency map of the input image is generated using the contrast among split regions. From the split regions and generated saliency map, an adaptive threshold is defined, which classify the split regions into foreground and background. After that, the initial mask for object detection is determined using the classified foreground and background clusters and saliency values. A grab-cut with our initial mask is applied to extract the objects of interest, and the experimental results have shown that our proposed method is able to replace manual labeling of initialization in object detection.

[1]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[2]  William A. Barrett,et al.  Object-based image editing , 2002, ACM Trans. Graph..

[3]  Andrew Zisserman,et al.  An Affine Invariant Salient Region Detector , 2004, ECCV.

[4]  John K. Tsotsos,et al.  Attention based on information maximization , 2010 .

[5]  Loong Fah Cheong,et al.  Active segmentation with fixation , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[6]  Daniel P. Huttenlocher,et al.  Efficient Graph-Based Image Segmentation , 2004, International Journal of Computer Vision.

[7]  David Salesin,et al.  A Bayesian approach to digital matting , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[8]  S. Süsstrunk,et al.  Frequency-tuned salient region detection , 2009, CVPR 2009.

[9]  Fernand Meyer,et al.  Topographic distance and watershed lines , 1994, Signal Process..

[10]  Marie-Pierre Jolly,et al.  Interactive graph cuts for optimal boundary & region segmentation of objects in N-D images , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[11]  Rama Chellappa,et al.  Moving object segmentation and dynamic scene reconstruction using two frames , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[12]  Gueesang Lee,et al.  Morphological gradient applied to new active contour model for color image segmentation , 2012, ICUIMC.

[13]  John K. Tsotsos,et al.  Modeling Visual Attention via Selective Tuning , 1995, Artif. Intell..

[14]  Yuan Yan Tang,et al.  Multiview Hessian discriminative sparse coding for image annotation , 2013, Comput. Vis. Image Underst..

[15]  Nuno Vasconcelos,et al.  Bottom-up saliency is a discriminant process , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[16]  Simone Frintrop,et al.  A Real-time Visual Attention System Using Integral Images , 2007, ICVS 2007.

[17]  D. V. van Essen,et al.  A neurobiological model of visual attention and invariant pattern recognition based on dynamic routing of information , 1993, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[18]  Weifeng Liu,et al.  Multiview Hessian Regularization for Image Annotation , 2013, IEEE Transactions on Image Processing.

[19]  Demetri Terzopoulos,et al.  Snakes: Active contour models , 2004, International Journal of Computer Vision.

[20]  P. König,et al.  Does luminance‐contrast contribute to a saliency map for overt visual attention? , 2003, The European journal of neuroscience.

[21]  Xing Xie,et al.  Salient Region Detection Using Weighted Feature Maps Based on the Human Visual Attention Model , 2004, PCM.

[22]  Peter Auer,et al.  Object recognition using segmentation for feature detection , 2004, ICPR 2004.

[23]  William A. Barrett,et al.  Intelligent scissors for image composition , 1995, SIGGRAPH.

[24]  Matti Pietikäinen,et al.  Automatic Dynamic Texture Segmentation Using Local Descriptors and Optical Flow , 2013, IEEE Transactions on Image Processing.

[25]  Andrew Blake,et al.  "GrabCut" , 2004, ACM Trans. Graph..

[26]  Nuno Vasconcelos,et al.  Modeling, Clustering, and Segmenting Video with Mixtures of Dynamic Textures , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27]  Shi-Min Hu,et al.  Global contrast based salient region detection , 2011, CVPR 2011.

[28]  HongJiang Zhang,et al.  Contrast-based image attention analysis by using fuzzy growing , 2003, MULTIMEDIA '03.

[29]  Marie-Pierre Jolly,et al.  Interactive Graph Cuts for Optimal Boundary and Region Segmentation of Objects in N-D Images , 2001, ICCV.

[30]  Pietro Perona,et al.  Graph-Based Visual Saliency , 2006, NIPS.

[31]  K. Parvati,et al.  Image Segmentation Using Gray-Scale Morphology and Marker-Controlled Watershed Transformation , 2008 .

[32]  Pierre Baldi,et al.  Bayesian surprise attracts human attention , 2005, Vision Research.

[33]  Liqing Zhang,et al.  Saliency Detection: A Spectral Residual Approach , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[34]  Yoshinori Hara,et al.  On image segmentation for object-based image retrieval , 2002, Object recognition supported by user interaction for service robots.

[35]  S Ullman,et al.  Shifts in selective visual attention: towards the underlying neural circuitry. , 1985, Human neurobiology.

[36]  Gareth Funka-Lea,et al.  Graph Cuts and Efficient N-D Image Segmentation , 2006, International Journal of Computer Vision.

[37]  Huchuan Lu,et al.  Bayesian Saliency via Low and mid Level Cues , 2022 .