Using machine learning for unsupervised maritime waypoint discovery from streaming AIS data

Estimating the future position of a deep sea vessel more than 24 hours in advance is a major challenge for Dutch logistics service providers (LSPs). Their unscheduled arrival in ports directly impacts scheduling and waiting times of barges, propagating throughout the entire supply chain network. To help LSPs' planners improve planning operations, we intend to capture the characteristics of maritime routes for a specific region (the North Sea connecting the Netherlands and United Kingdom) in the form of a directed graph, which can be used as a foundation for predicting destination and arrival time of each associated vessel. To create such graph we need an efficient way to extract waypoints for traffic data and this is the problem we will address in this paper. Since LSPs only use publicly available data for arrival estimation, our solution is entirely based on Automatic Identification System (AIS) data. Extracting positional information from AIS, we explore various machine learning approaches to identify clusters. We apply DBSCAN algorithm and show its advantages and disadvantages when used on AIS data. The same process is repeated using meta-heuristics, comparing clustering results generated by a genetic algorithm and by modified ant-colony optimization to those produced by DBSCAN. Finally, we present a hybrid approach and its ability to discover waypoints, highlighting the achieved improvements. To extend the problem, two constraints are added. The first is the requirement to handle large volumes of streaming AIS data on standard PC-based hardware. The second introduces the common situation of "dark areas" in a map due to problems with receiving and transmitting AIS data. The algorithm discovers route waypoints in efficient and effective ways under these constraints.

[1]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[2]  Darrell Whitley,et al.  A genetic algorithm tutorial , 1994, Statistics and Computing.

[3]  A. Harati-Mokhtari,et al.  Automatic Identification System (AIS): Data Reliability and Human Error Implications , 2007, Journal of Navigation.

[4]  M. Vespe,et al.  Unsupervised learning of maritime traffic patterns for anomaly detection , 2012 .

[5]  Wout Dullaert,et al.  Schedule Unreliability in Liner Shipping: Origins and Consequences for the Hinterland Supply Chain , 2007 .

[6]  Maria-Eugenia Iacob,et al.  Towards an Approach for Long Term AIS-Based Prediction of Vessel Arrival Times , 2016 .

[7]  Wen-Chih Peng,et al.  A Framework of Moving Behavior Modeling in the Maritime Surveillance , 2011 .

[8]  Samir Chatterjee,et al.  A Design Science Research Methodology for Information Systems Research , 2008 .

[9]  Michele Vespe,et al.  Vessel Pattern Knowledge Discovery from AIS Data: A Framework for Anomaly Detection and Route Prediction , 2013, Entropy.

[10]  Marco Dorigo,et al.  Ant system: optimization by a colony of cooperating agents , 1996, IEEE Trans. Syst. Man Cybern. Part B.

[11]  Paolo Braca,et al.  Detection of malicious AIS position spoofing by exploiting radar information , 2013, Proceedings of the 16th International Conference on Information Fusion.

[12]  Helwig Hauser,et al.  Visual Analysis of Multivariate Movement Data using Interactive Difference Views , 2010, VMV.

[13]  Jon Louis Bentley,et al.  Quad trees a data structure for retrieval on composite keys , 1974, Acta Informatica.

[14]  Changqing Liu,et al.  Vessel Track Recovery With Incomplete AIS Data Using Tensor CANDECOM/PARAFAC Decomposition , 2014 .