M3S-NIR: Multi-modal Multi-scale Noise-Insensitive Ranking for RGB-T Saliency Detection

RGB-Thermal saliency detection is to use thermal infrared information to assist salient object detection with visible light information. Multi-Modal Multi-Scale Noise-Insensitive Ranking (M3S-NIR), is proposed for RGB-Thermal (RGB-T) saliency detection. Given spatially aligned RGB and thermal images, M3S-NIR first segments them together into a set of multi-scale superpixels. Second, it takes these superpixels as graph nodes and performs multi-modal multi-scale manifold ranking to achieve saliency calculation, in which the cross-modal and cross-scale collaborations are performed to integrate different kinds of information. Third, to handle noises and corruptions of ranking seeds (i.e., boundary superpixels) introduced by salient objects and RGB-T alignment, M3S-NIR introduces an intermediate variable to infer the optimal ranking seeds, and formulates it as a sparse learning problem. Finally, M3S-NIR uses a unified ADMM (Alternating Direction Method of Multipliers)-based optimization framework to solve the ranking model efficiently. Extensive experiments on the benchmark dataset demonstrate the effectiveness of the proposed approach over other state-of-the-art RGB-T saliency detection methods.

[1]  Lihi Zelnik-Manor,et al.  Context-Aware Saliency Detection , 2012, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Xiaogang Wang,et al.  Saliency detection by multi-context deep learning , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3]  Huchuan Lu,et al.  Salient Object Detection via Multiple Instance Learning , 2017, IEEE Transactions on Image Processing.

[4]  David Dagan Feng,et al.  Robust saliency detection via regularized random walks ranking , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Stephen P. Boyd,et al.  Distributed Optimization and Statistical Learning via the Alternating Direction Method of Multipliers , 2011, Found. Trends Mach. Learn..

[6]  Huchuan Lu,et al.  Saliency Detection via Graph-Based Manifold Ranking , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[7]  Yizhou Yu,et al.  Visual saliency based on multiscale deep features , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[8]  Shao-Yi Chien,et al.  Real-Time Salient Object Detection with a Minimum Spanning Tree , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Huchuan Lu,et al.  Saliency detection via Cellular Automata , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[10]  George Hripcsak,et al.  Technical Brief: Agreement, the F-Measure, and Reliability in Information Retrieval , 2005, J. Am. Medical Informatics Assoc..

[11]  Pascal Fua,et al.  SLIC Superpixels Compared to State-of-the-Art Superpixel Methods , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Allen Y. Yang,et al.  Robust Face Recognition via Sparse Representation , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Esa Rahtu,et al.  Fast and Efficient Saliency Detection Using Sparse Sampling and Kernel Density Estimation , 2011, SCIA.

[14]  Qiaosong Wang,et al.  GraB: Visual Saliency via Novel Graph Model and Background Priors , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Yunpeng Ma,et al.  A Unified RGB-T Saliency Detection Benchmark: Dataset, Baselines, Analysis and A Novel Approach , 2017, ArXiv.

[16]  Huchuan Lu,et al.  Graph-Regularized Saliency Detection With Convex-Hull-Based Center Prior , 2013, IEEE Signal Processing Letters.

[17]  Aykut Erdem,et al.  Visual saliency estimation by nonlinearly integrating features using region covariances. , 2013, Journal of vision.