Fuel Prediction and Reduction in Public Transportation by Sensor Monitoring and Bayesian Networks

We exploit the use of a controller area network (CAN-bus) to monitor sensors on the buses of local public transportation in a big European city. The aim is to advise fleet managers and policymakers on how to reduce fuel consumption so that air pollution is controlled and public services are improved. We deploy heuristic algorithms and exhaustive ones to generate Bayesian networks among the monitored variables. The aim is to describe the relevant relationships between the variables, to discover and confirm the possible cause–effect relationships, to predict the fuel consumption dependent on the contextual conditions of traffic, and to enable an intervention analysis to be conducted on the variables so that our goals are achieved. We propose a validation technique using Bayesian networks based on Granger causality: it relies upon observations of the time series formed by successive values of the variables in time. We use the same method based on Granger causality to rank the Bayesian networks obtained as well. A comparison of the Bayesian networks discovered against the ground truth is proposed in a synthetic data set, specifically generated for this study: the results confirm the validity of the Bayesian networks that agree on most of the existing relationships.

[1]  Dimitris Margaritis,et al.  Speculative Markov blanket discovery for optimal feature selection , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).

[2]  Thomas Bousonville,et al.  Estimating truck fuel consumption with machine learning using telematics, topology and weather data , 2019, 2019 International Conference on Industrial Engineering and Systems Management (IESM).

[3]  R. Bouckaert Bayesian belief networks : from construction to inference , 1995 .

[4]  David Maxwell Chickering,et al.  A Transformational Characterization of Equivalent Bayesian Network Structures , 1995, UAI.

[5]  Constantin F. Aliferis,et al.  The max-min hill-climbing Bayesian network structure learning algorithm , 2006, Machine Learning.

[6]  Jörg Breitung,et al.  Testing for short- and long-run causality: A frequency-domain approach , 2006 .

[7]  D. H. Vu,et al.  A variance inflation factor and backward elimination based robust regression model for forecasting monthly electricity demand using climatic variables , 2015 .

[8]  J. Pearl,et al.  Causal Inference in Statistics: A Primer , 2016 .

[9]  Judea Pearl,et al.  The seven tools of causal inference, with reflections on machine learning , 2019, Commun. ACM.

[10]  C. Granger Investigating Causal Relations by Econometric Models and Cross-Spectral Methods , 1969 .

[11]  Luís C. Neves,et al.  Application of machine learning for fuel consumption modelling of trucks , 2017, 2017 IEEE International Conference on Big Data (Big Data).

[12]  Sandareka Wickramanayake,et al.  Fuel consumption prediction of fleet vehicles using Machine Learning: A comparative study , 2016, 2016 Moratuwa Engineering Research Conference (MERCon).

[13]  Haytham Elghazel,et al.  A hybrid algorithm for Bayesian network structure learning with application to multi-label learning , 2014, Expert Syst. Appl..

[14]  S. Shafer All models are wrong. , 2012, Anesthesiology.

[15]  Luis F. Miranda-Moreno,et al.  Vehicular fuel consumption estimation using real-world measures through cascaded machine learning modeling , 2020 .

[16]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[17]  Chang Liu,et al.  Vehicle Fuel Consumption Prediction Method Based on Driving Behavior Data Collected from Smartphones , 2020 .

[18]  David Heckerman,et al.  A Tutorial on Learning with Bayesian Networks , 1999, Innovations in Bayesian Networks.

[19]  James M. Robins,et al.  On the Validity of Covariate Adjustment for Estimating Causal Effects , 2010, UAI.

[20]  Jan-Willem Romeijn,et al.  ‘All models are wrong...’: an introduction to model uncertainty , 2012 .

[21]  David Maxwell Chickering,et al.  Learning Bayesian Networks: The Combination of Knowledge and Statistical Data , 1994, Machine Learning.

[22]  C. Henderson,et al.  Bayesian networks : A guide for their application in natural resource management and policy March , 2010 .

[23]  R. W. Robinson Counting unlabeled acyclic digraphs , 1977 .

[24]  D. Margaritis Learning Bayesian Network Model Structure from Data , 2003 .

[25]  Javier E. Contreras-Reyes,et al.  Assessing Granger-Causality in the Southern Humboldt Current Ecosystem Using Cross-Spectral Methods , 2020, Entropy.

[26]  Maria Samarakou,et al.  OBD-II sensor diagnostics for monitoring vehicle operation and consumption , 2020 .

[27]  Nir Friedman,et al.  Learning Bayesian Network Structure from Massive Datasets: The "Sparse Candidate" Algorithm , 1999, UAI.

[28]  T. Grigoratos,et al.  How accurately can we measure vehicle fuel consumption in real world operation? , 2021 .

[29]  G. Box,et al.  On a measure of lack of fit in time series models , 1978 .

[30]  Pedro Larrañaga,et al.  Learning Bayesisan Networks by Genetic Algorithms: A Case Study in the Prediction of Survival in Malignant Skin Melanoma , 1997, AIME.

[31]  Zina Ben Miled,et al.  A Machine Learning Model for Average Fuel Consumption in Heavy Vehicles , 2019, IEEE Transactions on Vehicular Technology.

[32]  H. Akaike,et al.  Information Theory and an Extension of the Maximum Likelihood Principle , 1973 .