Knowledge extraction from maritime spatiotemporal data: An evaluation of clustering algorithms on Big Data

In this paper we attempt to define the major trade routes which vessels of trade follow when travelling across the globe in a scalable, data-driven unsupervised way. For this, we exploit a large volume of historical AIS data, so as to estimate the location and connections of the major trade routes, with minimal reliance on other sources of information. We address the challenges posed due to the volume of data by leveraging distributed computing techniques and present a novel MapReduce based algorithmic approach, capable of handling skewed and nonuniform geospatial data. In the direction, we calculate and compare the performance (execution time and compression ratio) and accuracy of several mature clustering algorithms and present preliminary results.

[1]  Anil K. Jain Data clustering: 50 years beyond K-means , 2010, Pattern Recognit. Lett..

[2]  Jing Deng,et al.  Ship trajectory prediction for intelligent traffic management using clustering and ANN , 2016, 2016 UKACC 11th International Conference on Control (CONTROL).

[3]  Rikard Laxhammar,et al.  Anomaly detection for sea surveillance , 2008, 2008 11th International Conference on Information Fusion.

[4]  Derya Birant,et al.  ST-DBSCAN: An algorithm for clustering spatial-temporal data , 2007, Data Knowl. Eng..

[5]  A. Skorodumovs,et al.  Methods for Processing and Interpretation of AIS Signals Corrupted by Noise and Packet Collisions , 2012 .

[6]  Flora D. Salim,et al.  Clustering Big Spatiotemporal-Interval Data , 2016, IEEE Transactions on Big Data.

[7]  Stan Matwin,et al.  Improving Fishing Pattern Detection from Satellite AIS Using Data Mining and Machine Learning , 2016, PloS one.

[8]  Andrew W. Moore,et al.  X-means: Extending K-means with Efficient Estimation of the Number of Clusters , 2000, ICML.

[9]  Jakub Montewka,et al.  A method for detecting possible near miss ship collisions from AIS data , 2015 .

[10]  Jos van Hillegersberg,et al.  Using machine learning for unsupervised maritime waypoint discovery from streaming AIS data , 2015, I-KNOW.

[11]  Michele Vespe,et al.  Vessel Pattern Knowledge Discovery from AIS Data: A Framework for Anomaly Detection and Route Prediction , 2013, Entropy.

[12]  Sanjay Garg,et al.  Development and validation of OPTICS based spatio-temporal clustering technique , 2016, Inf. Sci..

[13]  Jarke J. van Wijk,et al.  Eurographics/ Ieee-vgtc Symposium on Visualization 2009 Visualization of Vessel Movements , 2022 .

[14]  Jarke J. van Wijk,et al.  Evaluation of the Visibility of Vessel Movement Features in Trajectory Visualizations , 2011, Comput. Graph. Forum.

[15]  Fabio Mazzarella,et al.  Discovering vessel activities at sea using AIS data: Mapping of fishing footprints , 2014, 17th International Conference on Information Fusion (FUSION).