An Adaptive Hash-Based Text Deduplication for ADS-B Data-Dependent Trajectory Clustering Problem

The Automatic Dependent Surveillance-Broadcast (ADS-B) protocol is equipped in aircraft as an alternative to secondary radar. This emerging technology produces such a prospective type of data to effectively broadcast the aircraft's status (location, velocity, etc.,) in a specific area, which is very useful in air traffic management (ATM). However, there is still a limited number of advanced studies from machine learning/data mining perspectives relying on this kind of data in ATM research. On the other hand, Locality Sensitive Hashing (LSH) is a data mining technique often used to find similar items in the data with high-dimension properties. It is thus relatively suitable for handling with trajectories data to group similar flight paths. From these factors, we reveal in this paper an adaptive LSH- based algorithm, used in near-duplicated documents detection, for the problem of clustering the nearest trajectories by representing the trajectories as a bag-of-words used popularly in text mining. To illustrate our proposed method, an experiment is designed and carried out in thirty successive days, employing the raw ADS-B data collected from FlightAware for the case of Changi International Airport, Singapore. The evaluation based on Silhouette score shows promising results of measuring the clustering performance.

[1]  Edward Lester,et al.  Benefits and incentives for ADS-B equipage in the National Airspace System , 2007 .

[2]  John O. Andrews,et al.  REDESIGNING THE NATIONAL AIRSPACE SYSTEM FOR SUSTAINABILITY , 2004 .

[3]  Robert F. Mills,et al.  Security analysis of the ADS-B implementation in the next generation air transportation system , 2011, Int. J. Crit. Infrastructure Prot..

[4]  Ivan Martinovic,et al.  Bringing up OpenSky: A large-scale ADS-B sensor network for research , 2014, IPSN-14 Proceedings of the 13th International Symposium on Information Processing in Sensor Networks.

[5]  Jae-Gil Lee,et al.  Trajectory Outlier Detection: A Partition-and-Detect Framework , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[6]  Cheng-Lung Wu,et al.  Research review of air traffic management , 2002 .

[7]  Joost Ellerbroek,et al.  Modeling aircraft performance parameters with open ADS-B data , 2017 .

[8]  Kotagiri Ramamohanarao,et al.  Fast trajectory clustering using Hashing methods , 2016, 2016 International Joint Conference on Neural Networks (IJCNN).

[9]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[10]  Gennady L. Andrienko,et al.  Clustering Trajectories by Relevant Parts for Air Traffic Analysis , 2018, IEEE Transactions on Visualization and Computer Graphics.

[11]  Eric Feron,et al.  Trajectory Clustering and an Application to Airspace Monitoring , 2011, IEEE Trans. Intell. Transp. Syst..

[12]  Ying Zhao,et al.  Analysis of Automatic Dependent Surveillance-Broadcast Data , 2017, AAAI Fall Symposia.

[13]  David H. Douglas,et al.  ALGORITHMS FOR THE REDUCTION OF THE NUMBER OF POINTS REQUIRED TO REPRESENT A DIGITIZED LINE OR ITS CARICATURE , 1973 .

[14]  Banavar Sridhar,et al.  Airspace Complexity and its Application in Air Traffic Management , 1998 .

[15]  Teresa Nicole Brooks Using Autoencoders To Learn Interesting Features For Detecting Surveillance Aircraft , 2018, ArXiv.

[16]  Duminda Wijesekera,et al.  ADS-Bsec: A novel framework to secure ADS-B , 2017, ICT Express.

[17]  Xiaofang Li,et al.  An Adaptive Trajectory Clustering Method Based on Grid and Density in Mobile Pattern Analysis , 2017, Sensors.

[18]  H. Fricke,et al.  Large-Scale Flight Phase Identification from ADS-B Data Using Machine Learning Methods , 2016 .

[19]  José M. Barreiro,et al.  A Reinforcement Learning Model Equipped with Sensors for Generating Perception Patterns: Implementation of a Simulated Air Navigation System Using ADS-B (Automatic Dependent Surveillance-Broadcast) Technology , 2017, Sensors.

[20]  Dacheng Tao,et al.  A survey on trajectory clustering analysis , 2018, ArXiv.

[21]  Jae-Gil Lee,et al.  Trajectory clustering: a partition-and-group framework , 2007, SIGMOD '07.