Development of Machine Learning-based Predictive Models for Air Quality Monitoring and Characterization

One of the biggest environmental problems right now is air pollution. Air quality is needed to be consistently monitored and assessed to ensure better living conditions. The U.S. Environmental Protection Agency (EPA) uses the air quality index (AQI) to standardize the air quality. However, AQI requires precise and accurate sensor readings and complex calculations, making it not feasible for portable air quality monitoring devices. The aim of this paper is to find an alternative way of monitoring and characterizing air quality through the use of integrated gas sensors and building predictive models using machine learning algorithms that can be used to obtain data-driven solutions to mitigate the risk of air pollution. The proposed methodology is implemented by building a prototype for the integrated sensors using DHT 11 temperature and relative humidity sensor, MQ2, MQ5 and MQ135 gas sensors. Five predictive models are developed in the study, k-nearest neighbors (KNN), support vector machine (SVM), Naïve-Bayesian classifier, random forest and neural network. Results show that the researchers are able to obtain an accuracy of 98.67%, 97.78%, 98.67%, 94.22%, and 99.56% for all the five models respectively, having the neural network to be the best performing model.

[1]  Juan He,et al.  Design of an intelligent indoor air quality monitoring and purification device , 2017, 2017 IEEE 3rd Information Technology and Mechatronics Engineering Conference (ITOEC).

[2]  Agostino Di Ciaccio,et al.  Computational Statistics and Data Analysis Measuring the Prediction Error. a Comparison of Cross-validation, Bootstrap and Covariance Penalty Methods , 2022 .

[3]  Alejandro H. Ballado,et al.  Microcontroller and app-based air quality monitoring system for particulate matter 2.5 (PM2.5) and particulate matter 1 (PM1) , 2017, 2017IEEE 9th International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment and Management (HNICEM).

[4]  You-Chiun Wang,et al.  Efficient Data Gathering and Estimation for Metropolitan Air Quality Monitoring by Using Vehicular Sensor Networks , 2017, IEEE Transactions on Vehicular Technology.

[5]  Rashmi Agrawal K-Nearest Neighbor for Uncertain Data , 2014 .

[6]  Zhe Yang,et al.  Mobile microscopy and machine learning provide accurate and high-throughput monitoring of air quality , 2017, 2017 Conference on Lasers and Electro-Optics (CLEO).

[7]  M Chiwewe Tapiwa,et al.  Machine learning based estimation of Ozone using spatio-temporal data from air quality monitoring stations , 2016 .

[8]  M. Carmen Garrido,et al.  A More Realistic K-Nearest Neighbors Method and Its Possible Applications to Everyday Problems , 2017, 2017 International Conference on Intelligent Environments (IE).

[9]  Judith Molka-Danielsen,et al.  Big Data Analytics for Air Quality Monitoring at a Logistics Shipping Base via Autonomous Wireless Sensor Network Technologies , 2017, 2017 5th International Conference on Enterprise Systems (ES).