An EMD based approach for Saliency Detection in Multimedia Data

The advancement of multimedia technology has increased the problem of its storage, transmission and display etc. As this multimedia data is mostly targeted to humans, multimedia researchers are working in field of incorporation of human’s perception, action related abilities. Detection of interesting content in images or videos based on human’s attention-based perspective is termed as Saliency. Various existing popular methods which have been used for saliency estimation were based on techniques like Difference of Gradients (DoG), K1 Divergence. These methods are time consuming and are applicable on selected parts of input only. This causes need to find a better approach which should be time-efficient and provides better results. In this paper Earth Mover’s Distance (EMD) based approach is adopted with bottom-up features-based saliency estimation for image and video data. Result shows that incorporation of EMD which is based on calculating histogram differences to find minimal cost incurred in transforming input data as close as output data, provides better results in terms of time consumption.

[1]  Leonidas J. Guibas,et al.  The Earth Mover's Distance as a Metric for Image Retrieval , 2000, International Journal of Computer Vision.

[2]  Gert Kootstra,et al.  Gestalt Principles for Attention and Segmentation in Natural and Artificial Vision Systems , 2011 .

[3]  S Ullman,et al.  Shifts in selective visual attention: towards the underlying neural circuitry. , 1985, Human neurobiology.

[4]  Bin Fang,et al.  A Computational Model for Saliency Maps by Using Local Entropy , 2010, AAAI.

[5]  Yuan Yan Tang,et al.  A Visual-Attention Model Using Earth Mover's Distance-Based Saliency Measurement and Nonlinear Feature Combination , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Emmanuel Dellandréa,et al.  A People Counting System Based on Face Detection and Tracking in a Video , 2009, 2009 Sixth IEEE International Conference on Advanced Video and Signal Based Surveillance.

[7]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[8]  M. Schlossberg Matters of Intelligence: Conceptual Structures in Cognitive Neuroscience. , 1988 .

[9]  George Ghinea,et al.  Discovering salient objects from videos using spatiotemporal salient region detection , 2015, Signal Process. Image Commun..

[10]  F. Liu,et al.  Modelling saliency attention to predict eye direction by topological structure and earth mover’s distance , 2017, PloS one.

[11]  Alan C. Bovik,et al.  Perceptual Video Processing: Seeing the Future , 2010, Proc. IEEE.

[12]  Chuancai Liu,et al.  Salient Object Detection Based on Context and Location Prior , 2015 .

[13]  Frédo Durand,et al.  Learning to predict where humans look , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[14]  Özgür Ulusoy,et al.  Automatic detection of salient objects and spatial relations in videos for a video database system , 2008, Image Vis. Comput..

[15]  Lin Yang,et al.  Unified Saliency Detection Model Using Color and Texture Features , 2016, PloS one.

[16]  Trevor Darrell,et al.  Fast contour matching using approximate earth mover's distance , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..