Segmentation of Underwater Object in Videos

Video segmentation is a necessary step for object tracking. Existing methods that are used to extract object from the background based on an intensive sequence of searching all across the frames, thus this process performs lots of searching works result with low efficiency, whereas other methods obtain segmentation by clustering pixels which resulting in over-segmentation. Inspired by breakthroughs in semantic segmentation, in this paper, we propose to combine appearance and dynamic cues, which is a common conception and plays a key role in successfully segmenting objects in videos. To implement this idea, we combine Deep Convolutional Neural Network (DCNN) and optical flow information of two continuous frames. To overcome the difficulty of segmentation of underwater object in videos induced by the presence of different types of suspension particle from like the water droplets and dust particles to the poor lighting and over lighting conditions, In this work, Contrast-Limited Adaptive Histogram Equalization (CLAHE) and a simple color resign method are used to enhance details and reduce greenish and bluish effects. Some DCNN variants are applied to semantic segmentation and achieve great efficiency. Specifically, because DCNN can obtain different spatial scale information, as a DCNN variant, DeepLab gets a good performance in semantic segmentation. By using atrous convolution, DeepLab network's filters can observe greater receptive field without reducing the feature map dimension, therefore this structure keeps global and position information. Consequently, we compromises above mentioned methods, The optical flow estimation is carried out on the image processed by the CLAHE method, and the accurate segmentation results are obtained by using the DeepLab network. Experiments show good performance of our method.

[1]  Kristen Grauman,et al.  Supervoxel-Consistent Foreground Propagation in Video , 2014, ECCV.

[2]  Kristen Grauman,et al.  FusionSeg: Learning to Combine Motion and Appearance for Fully Automatic Segmentation of Generic Objects in Videos , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3]  Mei Han,et al.  Efficient hierarchical graph-based video segmentation , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[4]  Bernt Schiele,et al.  Video Segmentation with Superpixels , 2012, ACCV.

[5]  Vittorio Ferrari,et al.  Fast Object Segmentation in Unconstrained Video , 2013, 2013 IEEE International Conference on Computer Vision.

[6]  Yong Jae Lee,et al.  Key-segments for video object segmentation , 2011, 2011 International Conference on Computer Vision.

[7]  Chenliang Xu,et al.  Streaming Hierarchical Video Segmentation , 2012, ECCV.

[8]  Thomas Brox,et al.  Video Segmentation with Just a Few Strokes , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[9]  Jitendra Malik,et al.  Tracking as Repeated Figure/Ground Segmentation , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[10]  Jitendra Malik,et al.  Object Segmentation by Long Term Analysis of Point Trajectories , 2010, ECCV.

[11]  Mubarak Shah,et al.  Video Object Segmentation through Spatially Accurate and Temporally Dense Extraction of Primary Object Regions , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[12]  Jitendra Malik,et al.  Large displacement optical flow , 2009, CVPR.

[13]  Antonio Torralba,et al.  LabelMe video: Building a video database with human annotations , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[14]  James M. Rehg,et al.  Motion Coherent Tracking with Multi-label MRF optimization , 2010, BMVC.

[15]  Michal Irani,et al.  Video Segmentation by Non-Local Consensus voting , 2014, BMVC.

[16]  James M. Rehg,et al.  Combining Self Training and Active Learning for Video Segmentation , 2011, BMVC.

[17]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[18]  William Brendel,et al.  Video object segmentation by tracking regions , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[19]  Jian Sun,et al.  Video object cut and paste , 2005, SIGGRAPH 2005.

[20]  Guillermo Sapiro,et al.  Video SnapCut: robust video object cutout using localized classifiers , 2009, ACM Trans. Graph..

[21]  Iasonas Kokkinos,et al.  DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[23]  Dimitris N. Metaxas,et al.  ]Video object segmentation by hypergraph cut , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[24]  Ronan Collobert,et al.  Learning to Refine Object Segments , 2016, ECCV.

[25]  Maneesh Agrawala,et al.  Interactive video cutout , 2005, SIGGRAPH 2005.