Machine learning model development for predicting road transport GHG emissions in Canada

Abstract Prediction of greenhouse gas (GHG) emissions is important to minimise their negative impact on climate change and global warming. In this article, we propose new models based on data mining and supervised machine learning algorithms (regression and classification) for predicting GHG emissions arising from passenger and freight road transport in Canada. Four models are investigated, namely, artificial neural network multilayer perceptron, multiple linear regression, multinomial logistic regression and decision tree models. From the results, it was found that artificial neural network multilayer perceptron model showed better predictive performance over other models. Ensemble technique (Bagging & Boosting) was applied on the developed multilayer perceptron model, which significantly improved the model’s predictive performance.

[1]  Zhi-Hua Zhou,et al.  Ensemble Methods: Foundations and Algorithms , 2012 .

[2]  Saed Sayad Real Time Data Mining , 2011 .

[3]  Andrew Lewis,et al.  Let a biogeography-based optimizer train your Multi-Layer Perceptron , 2014, Inf. Sci..

[4]  Mohammed Erritali,et al.  A comparative study of decision tree ID3 and C4.5 , 2014 .

[5]  W. Pitts,et al.  A Logical Calculus of the Ideas Immanent in Nervous Activity (1943) , 2021, Ideas That Created the Future.

[6]  Marcelo Ângelo Cirillo,et al.  Data classification with binary response through the Boosting algorithm and logistic regression , 2017, Expert Syst. Appl..

[7]  Wilfried Winiwarter,et al.  Assessing the uncertainty associated with national greenhouse gas emission inventories:: a case study for Austria , 2001 .

[8]  D. Opitz,et al.  Popular Ensemble Methods: An Empirical Study , 1999, J. Artif. Intell. Res..

[9]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[10]  Jean Carletta,et al.  Assessing Agreement on Classification Tasks: The Kappa Statistic , 1996, CL.

[11]  Gene Smith,et al.  Session 5: Special Topics , 2009 .

[12]  Ahmed F. Mashaly,et al.  MLP and MLR models for instantaneous thermal efficiency prediction of solar still under hyper-arid environment , 2016, Comput. Electron. Agric..

[13]  A. Viera,et al.  Understanding interobserver agreement: the kappa statistic. , 2005, Family medicine.

[14]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[15]  C. Y. Peng,et al.  An Introduction to Logistic Regression Analysis and Reporting , 2002 .

[16]  Ridho K. Wattimena,et al.  Predicting the stability of hard rock pillars using multinomial logistic regression , 2014 .

[17]  Thomas G. Dietterich Multiple Classifier Systems , 2000, Lecture Notes in Computer Science.

[18]  Christian W. Dawson,et al.  An artificial neural network approach to rainfall-runoff modelling , 1998 .

[19]  Xin Yan,et al.  Linear Regression Analysis: Theory and Computing , 2009 .

[20]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .