Improving maritime traffic emission estimations on missing data with CRBMs

Abstract Maritime traffic emissions are a major concern to governments as they heavily impact the Air Quality in coastal cities. Ships use the Automatic Identification System (AIS) to continuously report position and speed among other features, and therefore this data is suitable to be used to estimate emissions, if it is combined with engine data. However, important ship features are often inaccurate or missing. State-of-the-art complex systems, like CALIOPE at the Barcelona Supercomputing Center, are used to model Air Quality. These systems can benefit from AIS based emission models as they are very precise in positioning the pollution. Unfortunately, these models are sensitive to missing or corrupted data, and therefore they need data curation techniques to significantly improve the estimation accuracy. In this work, we propose a methodology for treating ship data using Conditional Restricted Boltzmann Machines (CRBMs) plus machine learning methods to improve the quality of data passed to emission models that can also be applied to other GPS and time-series problems. Results show that we can improve the default methods proposed to cover missing data. In our results, we observed that using our method the models boosted their accuracy to detect otherwise undetectable emissions. In particular, we used a real data-set of AIS data, provided by the Spanish Port Authority, to estimate that thanks to our method, the model was able to detect 45 % of additional emissions, representing 152 tonnes of pollutants per week in Barcelona and propose new features that may enhance emission modeling.

[1]  Xing Xie,et al.  Collaborative location and activity recommendations with GPS history data , 2010, WWW '10.

[2]  Masao Furusho,et al.  Ship Behavior Analysis for Real Operating of Container Ships Using AIS Data , 2016 .

[3]  Bart Degraeuwe,et al.  Impact of maritime transport emissions on coastal air quality in Europe , 2014 .

[4]  Ciuffo Biagio,et al.  Regulating Air Emissions from Ships: The State of the Art on Methodologies, Technologies and Policy Options , 2010 .

[5]  S. Eddy Hidden Markov models. , 1996, Current opinion in structural biology.

[6]  Heikki Mannila,et al.  Discovery of Frequent Episodes in Event Sequences , 1997, Data Mining and Knowledge Discovery.

[7]  Geoffrey E. Hinton,et al.  Restricted Boltzmann machines for collaborative filtering , 2007, ICML '07.

[8]  Zoubin Ghahramani,et al.  An Introduction to Hidden Markov Models and Bayesian Networks , 2001, Int. J. Pattern Recognit. Artif. Intell..

[9]  Beatriz Tovar,et al.  Port-city exhaust emission model: An application to cruise and ferry operations in Las Palmas Port , 2015 .

[10]  John B. Shoven,et al.  I , Edinburgh Medical and Surgical Journal.

[11]  Xingshe Zhou,et al.  Detecting wandering behavior based on GPS traces for elders with dementia , 2012, 2012 12th International Conference on Control Automation Robotics & Vision (ICARCV).

[12]  Jae-Gil Lee,et al.  MoveMine: Mining moving object data for discovery of animal movement patterns , 2011, TIST.

[13]  J. Kukkonen,et al.  A modelling system for the exhaust emissions of marine traffic and its application in the Baltic Sea area , 2009 .

[14]  Geoffrey E. Hinton,et al.  Modeling Human Motion Using Binary Latent Variables , 2006, NIPS.

[16]  Ricard Gavaldà,et al.  Learning from Time-Changing Data with Adaptive Windowing , 2007, SDM.

[17]  Geoffrey E. Hinton,et al.  Factored conditional restricted Boltzmann Machines for modeling motion style , 2009, ICML '09.

[18]  V. Eyring,et al.  Second IMO GHG study 2009 , 2009 .

[19]  J. Kukkonen,et al.  Extension of an assessment model of ship traffic exhaust emissions for particulate matter and carbon monoxide , 2011 .

[20]  Marc Guevara,et al.  The Potential impacts of electric vehicles on air quality in the urban areas of Barcelona and Madrid (Spain) , 2014 .

[21]  Christian Igel,et al.  An Introduction to Restricted Boltzmann Machines , 2012, CIARP.

[22]  Francesco Di Natale,et al.  Particulate matter in marine diesel engines exhausts: Emissions and control strategies , 2015 .

[23]  Ayomi Bandara,et al.  GPS Trace Mining for Discovering Behaviour Patterns , 2015, 2015 International Conference on Intelligent Environments.

[24]  Xavier Querol,et al.  Impact of harbour emissions on ambient PM10 and PM2.5 in Barcelona (Spain): Evidences of secondary aerosol formation within the urban area. , 2016, The Science of the total environment.

[25]  Oriol Jorba,et al.  HERMESv3, a stand-alone multi-scale atmospheric emission modelling framework – Part 1: global and regional module , 2019, Geoscientific Model Development.

[26]  Jee-Hyong Lee,et al.  An approach for multi-label classification by directed acyclic graph with label correlation maximization , 2016, Inf. Sci..

[27]  J. Kukkonen,et al.  A Comprehensive Inventory of the Ship Traffic Exhaust Emissions in the Baltic Sea from 2006 to 2009 , 2013, AMBIO.

[28]  José María Baldasano,et al.  An improved system for modelling Spanish emissions: HERMESv2.0 , 2013 .

[29]  Geoffrey E. Hinton,et al.  Conditional Restricted Boltzmann Machines for Structured Output Prediction , 2011, UAI.

[30]  Geoffrey E. Hinton A Practical Guide to Training Restricted Boltzmann Machines , 2012, Neural Networks: Tricks of the Trade.

[31]  David Carrera,et al.  Automatic Generation of Workload Profiles Using Unsupervised Learning Pipelines , 2018, IEEE Transactions on Network and Service Management.