论文信息 - COTS: A Multipurpose RGB-D Dataset for Saliency and Image Manipulation Applications

COTS: A Multipurpose RGB-D Dataset for Saliency and Image Manipulation Applications

Many modern computer vision systems include several modules that perform different processing operations packaged as a single pipeline architecture. This generally introduces a challenge in the evaluation process since most datasets provide evaluation data for just one of the operations. In this paper, we present an RGB-D dataset that was designed from first principles to cater for applications that involve salient object detection, segmentation, inpainting and blending techniques. This addresses a gap in the evaluation of image inpainting and blending applications that generally rely on subjective evaluation due to the lack of availability of comparative data. A set of experiments were carried out to demonstrate how the COTS dataset can be used to evaluate these different applications. This dataset includes a variety of scenes, where each scene is captured multiple times, each time adding a new object to the previous scene. This allows for a comparative analysis at pixel level in image inpainting and blending applications. Moreover, all objects were manually labeled in order to offer the possibility of salient object detection even in scenes that contain multiple objects. An online test with 1267 participants was also carried out, and this dataset also includes the click coordinates of users’ selection for every image, introducing a user interaction dimension in the same RGB-D dataset. This dataset was also validated using state of the art techniques, obtaining an $F_\beta $ of 0.957 in salient object detection and a mean (Intersection over Union) IoU of 0.942 in Segmentation. Results demonstrate that the COTS dataset introduces novel possibilities for the evaluation of computer vision applications.

Carl James Debono | Dylan Seychell | Matthew Sacco | Mark Bugeja | Jeremy Borg

[1] Yu Fu,et al. Visual saliency detection by spatially weighted dissimilarity , 2011, CVPR 2011.

[2] Huchuan Lu,et al. Amulet: Aggregating Multi-level Convolutional Features for Salient Object Detection , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[3] Ali Borji,et al. What is a Salient Object? A Dataset and a Baseline Model for Salient Object Detection , 2014, IEEE Transactions on Image Processing.

[4] Aykut Erdem,et al. Visual saliency estimation by nonlinearly integrating features using region covariances. , 2013, Journal of vision.

[5] Rynson W. H. Lau,et al. Geometry-Aware Distillation for Indoor Semantic Segmentation , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[6] Qingming Huang,et al. Image Saliency Detection Video Saliency Detection Co-saliency Detection Temporal RGBD Saliency Detection Motion , 2018 .

[7] Vladlen Koltun,et al. A Large Dataset of Object Scans , 2016, ArXiv.

[8] Markus Vincze,et al. Segmentation of unknown objects in indoor environments , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[9] Ali Borji,et al. Salient Object Detection: A Benchmark , 2015, IEEE Transactions on Image Processing.

[10] James M. Rehg,et al. The Secrets of Salient Object Segmentation , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[11] Changming Sun,et al. Knowledge Adaptation for Efficient Semantic Segmentation , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[12] Alexandru Telea,et al. An Image Inpainting Technique Based on the Fast Marching Method , 2004, J. Graphics, GPU, & Game Tools.

[13] Guillermo Sapiro,et al. Navier-stokes, fluid dynamics, and image and video inpainting , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[14] Ting Zhao,et al. Pyramid Feature Attention Network for Saliency Detection , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[15] KochChristof,et al. A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 1998 .

[16] Ross B. Girshick,et al. Mask R-CNN , 2017, 1703.06870.

[17] Ross B. Girshick,et al. Fast R-CNN , 2015, 1504.08083.

[18] Li Xu,et al. Hierarchical Image Saliency Detection on Extended CSSD , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19] Huchuan Lu,et al. Saliency Detection via Graph-Based Manifold Ranking , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[20] Ting-Chun Wang,et al. Image Inpainting for Irregular Holes Using Partial Convolutions , 2018, ECCV.

[21] Toby P. Breckon,et al. A comparative review of plausible hole filling strategies in the context of scene depth image completion , 2018, Comput. Graph..

[22] Amélie Grenier. Visual Scene Understanding for Self-Driving Cars Using Deep Learning and Stereovision , 2019 .

[23] Gayoung Lee,et al. ELD-Net: An Efficient Deep Learning Architecture for Accurate Saliency Detection , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24] Azeddine Beghdadi,et al. Inpainted image quality assessment , 2013, European Workshop on Visual Information Processing (EUVIP).

[25] Markus Vincze,et al. A Global Hypotheses Verification Method for 3D Object Recognition , 2012, ECCV.

[26] Trevor Darrell,et al. Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[27] Ali Borji,et al. CAT2000: A Large Scale Fixation Dataset for Boosting Saliency Research , 2015, ArXiv.

[29] Ruigang Yang,et al. Stereoscopic inpainting: Joint color and depth completion from stereo images , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[30] Carl J. Debono,et al. An Approach for Objective Quality Assessment of Image Inpainting Results , 2020, 2020 IEEE 20th Mediterranean Electrotechnical Conference ( MELECON).

[31] Rynson W. H. Lau,et al. Inferring Attention Shift Ranks of Objects for Image Saliency , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[32] Lihi Zelnik-Manor,et al. Context-aware saliency detection , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[33] Xiaochun Cao,et al. Depth Enhanced Saliency Detection Method , 2014, ICIMCS '14.

[34] Pieter Abbeel,et al. BigBIRD: A large-scale 3D database of object instances , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[35] Esa Rahtu,et al. Segmenting Salient Objects from Images and Videos , 2010, ECCV.

[36] Huchuan Lu,et al. Ranking Saliency , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[37] Peyman Milanfar,et al. Static and space-time visual saliency detection by self-resemblance. , 2009, Journal of vision.

[38] Federico Tombari,et al. Online learning for automatic segmentation of 3D data , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[39] Filip Malmberg,et al. Visual Saliency: From Pixel-Level to Object-Level Analysis , 2019, Springer International Publishing.

[40] Anil Kumar Tiwari,et al. Saliency enabled compression in JPEG framework , 2018, IET Image Process..

[41] Carl James Debono,et al. Intra-object segmentation using depth information , 2018, 2018 19th IEEE Mediterranean Electrotechnical Conference (MELECON).

[42] Ruigang Yang,et al. Instance Segmentation of LiDAR Point Clouds , 2020, 2020 IEEE International Conference on Robotics and Automation (ICRA).

[43] Carl J. Debono,et al. Efficient object selection using depth and texture information , 2016, 2016 Visual Communications and Image Processing (VCIP).

[44] Carl J. Debono,et al. Ranking Regions of Visual Saliency in RGB-D Content , 2018, 2018 International Conference on 3D Immersion (IC3D).

[45] Dieter Fox,et al. A large-scale hierarchical multi-view RGB-D object dataset , 2011, 2011 IEEE International Conference on Robotics and Automation.

[46] Jiandong Tian,et al. RGBD Salient Object Detection via Deep Fusion , 2016, IEEE Transactions on Image Processing.

[47] Michael Firman,et al. RGBD Datasets: Past, Present and Future , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[48] Liqing Zhang,et al. Saliency Detection: A Spectral Residual Approach , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[49] Esa Rahtu,et al. Fast and Efficient Saliency Detection Using Sparse Sampling and Kernel Density Estimation , 2011, SCIA.

[50] Naila Murray,et al. Saliency estimation using a non-parametric low-level vision model , 2011, CVPR 2011.

[51] Neil D. B. Bruce,et al. Revisiting Salient Object Detection: Simultaneous Detection, Ranking, and Subitizing of Multiple Salient Objects , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[52] Antonio J. Plaza,et al. Image Segmentation Using Deep Learning: A Survey , 2021, IEEE transactions on pattern analysis and machine intelligence.

[53] Carl James Debono,et al. Monoscopic inpainting approach using depth information , 2016, 2016 18th Mediterranean Electrotechnical Conference (MELECON).

[54] Pietro Perona,et al. Microsoft COCO: Common Objects in Context , 2014, ECCV.

[55] Patrick Pérez,et al. Region filling and object removal by exemplar-based image inpainting , 2004, IEEE Transactions on Image Processing.

[56] Kaiming He,et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[57] Kun Xu,et al. A survey of image synthesis and editing with generative adversarial networks , 2017 .

[58] Shi-Min Hu,et al. Global contrast based salient region detection , 2011, CVPR 2011.

[59] Frédo Durand,et al. Learning to predict where humans look , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[60] Pau-Choo Chung,et al. NINEPINS: Nuclei Instance Segmentation with Point Annotations , 2020, ArXiv.