A Robust Deep Affinity Network for Multiple Ship Tracking

Multiple ship tracking (MST) is an important task in marine surveillance and ship situational awareness systems. Considerable work has been conducted on multiple object tracking in recent years, but it has focused primarily on pedestrians and automobiles, leaving a gap in studies on MST due to the particularities of complex marine scenes, such as ship scale variations, the long-tailed distribution of ships, and long-term occlusions caused by ship movements. In this article, we present a robust deep affinity network (RoDAN) for MST. To overcome the above difficulties in MST, we start with the basic deep affinity network (DAN) and improve it in three aspects: scale, region, and motion. For the scale dimension, we integrate an atrous spatial pyramid pooling (ASPP) module to improve the modeling ability for multiscale ships. For the region dimension, we propose the joint global region modeling (JGRM) module, which further strengthens the modeling ability of DAN and exploit it to overcome the long-tailed distribution property of ships. For the motion dimension, we propose the motion-matching optimization (MMO) module to fine-tune the tracking results and make our tracker more robust, less reliant on the front-end detector, and ameliorate long-term occlusions. The experimental results demonstrate that our MST method outperforms the state-of-the-art methods. In particular, it reduces the number of ID switches (IDSs) and trajectory fragmentations (FMs), achieving holistically preferable performance. Meanwhile, our method achieves a comparable speed.

[1]  L. Wixson Detecting Salient Motion by Accumulating Directionally-Consistent Flow , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Dietrich Paulus,et al.  Simple online and realtime tracking with a deep association metric , 2017, 2017 IEEE International Conference on Image Processing (ICIP).

[3]  Xinqiang Chen,et al.  Accurate Ship Tracking in Maritime Images with Kernelized Correlation Filter and Curve Fitting , 2019, 2019 International Conference on Sensing and Instrumentation in IoT Era (ISSI).

[4]  René Vidal,et al.  A Unified Algebraic Approach to 2-D and 3-D Motion Segmentation , 2004, ECCV.

[5]  Deepu Rajan,et al.  Video Processing From Electro-Optical Sensors for Object Detection and Tracking in a Maritime Environment: A Survey , 2016, IEEE Transactions on Intelligent Transportation Systems.

[6]  Konrad Schindler,et al.  Learning by Tracking: Siamese CNN for Robust Target Association , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[7]  Ming-Hsuan Yang,et al.  Incremental Learning for Robust Visual Tracking , 2008, International Journal of Computer Vision.

[8]  Bernt Schiele,et al.  Multi-person Tracking by Multicut and Deep Matching , 2016, ECCV Workshops.

[9]  Vitaly Ablavsky,et al.  Background models for tracking objects in water , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[10]  Zhiqiang Shen,et al.  Learning Efficient Convolutional Networks through Network Slimming , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[11]  Rui Caseiro,et al.  High-Speed Tracking with Kernelized Correlation Filters , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Feiyue Huang,et al.  Chained-Tracker: Chaining Paired Attentive Regression Results for End-to-End Joint Multiple-Object Detection and Tracking , 2020, ECCV.

[13]  Luca Iocchi,et al.  Automatic Maritime Surveillance with Visual Target Detection , 2011 .

[14]  Xiaoqin Zhang,et al.  Single and Multiple Object Tracking Using Log-Euclidean Riemannian Subspace and Block-Division Appearance Model , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Andrew Zisserman,et al.  Detect to Track and Track to Detect , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[16]  Horst Bischof,et al.  Semi-supervised On-Line Boosting for Robust Tracking , 2008, ECCV.

[17]  Hui Zhou,et al.  Robust Multi-Modality Multi-Object Tracking , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[18]  Nikos Paragios,et al.  Motion-based background subtraction using adaptive kernel density estimation , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[19]  Guangzhong Liu,et al.  M3C: Multimodel-and-Multicue-Based Tracking by Detection of Surrounding Vessels in Maritime Environment for USV , 2019, Electronics.

[20]  Silvio Savarese,et al.  Tracking the Untrackable: Learning to Track Multiple Cues with Long-Term Dependencies , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[21]  Yi Ma,et al.  Robust principal component analysis? , 2009, JACM.

[22]  Mansour H. Assaf,et al.  Ship track estimation using GPS data and Kalman Filter , 2018, 2018 IEEE International Instrumentation and Measurement Technology Conference (I2MTC).

[23]  Nahum Kiryati,et al.  Piecewise-Smooth Dense Optical Flow via Level Sets , 2006, International Journal of Computer Vision.

[24]  Fredrik Gunnarsson,et al.  Distributed Recursive Gaussian Processes for RSS Map Applied to Target Tracking , 2017, IEEE Journal of Selected Topics in Signal Processing.

[25]  Oliver Sawodny,et al.  Accurate Optoacoustic and Inertial 3-D Pose Tracking of Moving Objects With Particle Filtering , 2020, IEEE Transactions on Instrumentation and Measurement.

[26]  Michael J. Black,et al.  EigenTracking: Robust Matching and Tracking of Articulated Objects Using a View-Based Representation , 1996, International Journal of Computer Vision.

[27]  Hans-Peter Kriegel,et al.  Subspace clustering , 2012, WIREs Data Mining Knowl. Discov..

[28]  Bing Yan,et al.  Ship Tracking with Static Electric Field Based on Adaptive Progressive Update Extended Kalman Filter , 2018 .

[29]  Takeo Kanade,et al.  Background Subtraction for Freely Moving Cameras , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[30]  Erhan Gundogdu,et al.  MARVEL: A Large-Scale Image Dataset for Maritime Vessels , 2016, ACCV.

[31]  Mubarak Shah,et al.  Deep Affinity Network for Multiple Object Tracking , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[32]  James M. Rehg,et al.  Multi-object Tracking with Neural Gating Using Bilinear LSTM , 2018, ECCV.

[33]  Daniel Cremers,et al.  Motion Competition: A Variational Approach to Piecewise Parametric Motion Segmentation , 2005, International Journal of Computer Vision.

[34]  Horst Bischof,et al.  Real-Time Tracking via On-line Boosting , 2006, BMVC.

[35]  Fabio Tozeto Ramos,et al.  Simple online and realtime tracking , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[36]  Francesco Solera,et al.  Performance Measures and a Data Set for Multi-target, Multi-camera Tracking , 2016, ECCV Workshops.

[37]  Iasonas Kokkinos,et al.  DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[38]  Iasonas Kokkinos,et al.  Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs , 2014, ICLR.

[39]  Ehud Rivlin,et al.  Robust Fragments-based Tracking using the Integral Histogram , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[40]  W. Eric L. Grimson,et al.  Using adaptive tracking to classify and monitor activities in a site , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[41]  Jitendra Malik,et al.  Object Segmentation by Long Term Analysis of Point Trajectories , 2010, ECCV.

[42]  Nuno Vasconcelos,et al.  Layered Dynamic Textures , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[43]  Vinod Nair,et al.  An unsupervised, online learning framework for moving object detection , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[44]  Zygmunt L. Szpak,et al.  Maritime surveillance: Tracking ships inside a dynamic background using a fast level-set , 2011, Expert Syst. Appl..

[45]  Xiaowei Zhou,et al.  Moving Object Detection by Detecting Contiguous Outliers in the Low-Rank Representation , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[46]  Laura Leal-Taixé,et al.  Tracking Without Bells and Whistles , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[47]  Zhimin Zhou,et al.  GF-4 Satellite and Automatic Identification System Data Fusion for Ship Tracking , 2019, IEEE Geoscience and Remote Sensing Letters.

[48]  Huafeng Wu,et al.  Robust Ship Tracking via Multi-view Learning and Sparse Representation , 2018, Journal of Navigation.

[49]  Jules-Raymond Tapamo,et al.  Detection and tracking of moving objects in a maritime environment using level set with shape priors , 2013, EURASIP J. Image Video Process..

[50]  Yann LeCun,et al.  Learning a similarity metric discriminatively, with application to face verification , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[51]  Ajmal Mian,et al.  Simultaneous Detection and Tracking with Motion Modelling for Multiple Object Tracking , 2020, ECCV.

[52]  Lu Zhang,et al.  Preserving Structure in Model-Free Tracking , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[53]  George Papandreou,et al.  Rethinking Atrous Convolution for Semantic Image Segmentation , 2017, ArXiv.

[54]  Piet B. W. Schwering,et al.  Persistent maritime surveillance using multi-sensor feature association and classification , 2012, Defense + Commercial Sensing.

[55]  Philip H. S. Torr,et al.  The Eighth Visual Object Tracking VOT2020 Challenge Results , 2020, ECCV Workshops.

[56]  Wenhan Luo,et al.  Multiple object tracking: A literature review , 2014, Artif. Intell..

[57]  Ali Farhadi,et al.  YOLOv3: An Incremental Improvement , 2018, ArXiv.

[58]  Guoying Zhao,et al.  Background Subtraction Based on Low-Rank and Structured Sparse Decomposition , 2015, IEEE Transactions on Image Processing.

[59]  Yoshua Bengio,et al.  Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.

[60]  Kevin I-Kai Wang,et al.  A Resetting Approach for INS and UWB Sensor Fusion Using Particle Filter for Pedestrian Tracking , 2020, IEEE Transactions on Instrumentation and Measurement.

[61]  Lei Xiao,et al.  Real-Time Inland CCTV Ship Tracking , 2018, Mathematical Problems in Engineering.

[62]  Rainer Stiefelhagen,et al.  Evaluating Multiple Object Tracking Performance: The CLEAR MOT Metrics , 2008, EURASIP J. Image Video Process..

[63]  Ming-Hsuan Yang,et al.  Robust Object Tracking with Online Multiple Instance Learning , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[64]  Ming Yang,et al.  Game-Theoretic Multiple Target Tracking , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[65]  Mohsen Guizani,et al.  Self-Selective Correlation Ship Tracking Method for Smart Ocean System , 2019, ArXiv.

[66]  Antoni B. Chan,et al.  Joint Motion Segmentation and Background Estimation in Dynamic Scenes , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[67]  Luca Iocchi,et al.  Argos - a Video Surveillance System for boat Traffic Monitoring in Venice , 2009, Int. J. Pattern Recognit. Artif. Intell..

[68]  René Vidal,et al.  A Benchmark for the Comparison of 3-D Motion Segmentation Algorithms , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.