Using Machine Learning in the Prediction of the Influence of Atmospheric Parameters on Health

Technological development has brought humanity to the era of an information society in which information is the main driver. This implies existing large amounts of data from which knowledge should be extracted. In this sense, artificial intelligence represents a trend applied in many areas of human activity. This paper is focused on ensemble modeling based on the use of several machine learning algorithms, which enable the prediction of the risk to human health due to the state of atmospheric factors. The model uses two multi-agents as a technique of emergent intelligence to make a collective decision. The first agent makes a partial decision on the prediction task by learning from the available historical data. In contrast, the second agent does the same from the data available in real-time. The proposed prediction model was evaluated in a case study related to the city of Niš, Republic of Serbia, and showed a better result than each algorithm separately. It represents a reasonable basis for further upgrading both in the scope of different groups of the atmospheric parameters and in the methodological sense, as well as technically through implementation in a practical web citizen service.

[1]  Y. Lim,et al.  Forecasting of non-accidental, cardiovascular, and respiratory mortality with environmental exposures adopting machine learning approaches , 2022, Environmental Science and Pollution Research.

[2]  R. Malekzadeh,et al.  Spatial environmental factors predict cardiovascular and all-cause mortality: Results of the SPACE study , 2022, PloS one.

[3]  Xuemei Wang,et al.  The Development and Application of Machine Learning in Atmospheric Environment Studies , 2021, Remote. Sens..

[4]  J. Botai,et al.  A Literature Review of the Impacts of Heat Stress on Human Health across Africa , 2021, Sustainability.

[5]  C. Rosendorff,et al.  The Impact of Environmental Factors on the Mortality of Patients with Chronic Heart Failure. , 2021, The American journal of cardiology.

[6]  Josue Rodolfo Cuevas Juarez,et al.  Machine Learning-Based Prediction of Air Quality , 2020, Applied Sciences.

[7]  Panayiotis E. Pintelas,et al.  Special Issue on Ensemble Learning and Applications , 2020, Algorithms.

[8]  Hongzhang Xu,et al.  Deep learning in environmental remote sensing: Achievements and challenges , 2020, Remote Sensing of Environment.

[9]  Renato Umeton,et al.  Automated machine learning: Review of the state-of-the-art and opportunities for healthcare , 2020, Artif. Intell. Medicine.

[10]  Qing Li,et al.  Air Quality Index and Air Pollutant Concentration Prediction Based on Machine Learning Algorithms , 2019, Applied Sciences.

[11]  Jian Wang,et al.  Deep learning and its application in geochemical mapping , 2019, Earth-Science Reviews.

[12]  Weijian Zhou,et al.  Severe haze in northern China: A synergy of anthropogenic emissions and atmospheric processes , 2019, Proceedings of the National Academy of Sciences.

[13]  H Murfi,et al.  Ensemble learning for predicting mortality rates affected by air quality , 2019, Journal of Physics: Conference Series.

[14]  Indri Sulistianingsih,et al.  C4.5 Algorithm Modeling For Decision Tree Classification Process Against Status UKM , 2018 .

[15]  kwang-yul kim,et al.  Analysis of source regions and meteorological factors for the variability of spring PM 10 concentrations in Seoul, Korea , 2018 .

[16]  Yuanyuan Wang,et al.  Daily air quality index forecasting with hybrid models: A case in China. , 2017, Environmental pollution.

[17]  Renjian Zhang,et al.  Roles of regional transport and heterogeneous reactions in the PM2.5 increase during winter haze episodes in Beijing. , 2017, The Science of the total environment.

[18]  G. Peters,et al.  Effects of atmospheric transport and trade on air pollution mortality in China , 2017 .

[19]  Marcella Busilacchio,et al.  Recursive neural network model for analysis and forecast of PM10 and PM2.5 , 2017 .

[20]  Jinfeng Wang,et al.  An Ensemble Spatiotemporal Model for Predicting PM2.5 Concentrations , 2017, International journal of environmental research and public health.

[21]  S. Xie,et al.  Spatial Distribution of Ozone Formation in China Derived from Emissions of Speciated Volatile Organic Compounds. , 2017, Environmental science & technology.

[22]  Chuanhua Yu,et al.  The influence of temperature on mortality and its Lag effect: a study in four Chinese cities with different latitudes , 2016, BMC Public Health.

[23]  Guangming Zeng,et al.  Land use regression models coupled with meteorology to model spatial and temporal variability of NO2 and PM10 in Changsha, China , 2015 .

[24]  Journal Ajer,et al.  Comparison of Different Classification Techniques Using WEKA for Hematological Data , 2015 .

[25]  Jingfeng Huang,et al.  A satellite-based geographically weighted regression model for regional PM2.5 estimation over the Pearl River Delta region in China , 2014 .

[26]  S. Hajat,et al.  Comparative Assessment of the Effects of Climate Change on Heat- and Cold-Related Mortality in the United Kingdom and Australia , 2014, Environmental health perspectives.

[27]  Kai Zhang,et al.  What weather variables are important in predicting heat-related mortality? A new application of statistical learning methods. , 2014, Environmental research.

[28]  Dong Yu,et al.  Deep Learning: Methods and Applications , 2014, Found. Trends Signal Process..

[29]  K. Lazarević,et al.  The impact of the July 2007 heat wave on daily mortality in Belgrade, Serbia. , 2013, Central European journal of public health.

[30]  Yuming Guo,et al.  Global climate change: impact of diurnal temperature range on mortality in Guangzhou, China. , 2013, Environmental pollution.

[31]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[32]  Jay P Shimshack,et al.  Absolute humidity, temperature, and influenza mortality: 30 years of county-level evidence from the United States. , 2012, American journal of epidemiology.

[33]  Vijay Prakash,et al.  An efficient info-gain algorithm for finding frequent sequential traversal patterns from web logs based on dynamic weight constraint , 2012, CUBE.

[34]  A. Stohl,et al.  Atmospheric mercury observations from Antarctica: seasonal variation and source and sink region calculations , 2011 .

[35]  M. Bell,et al.  Vulnerability to temperature-related mortality in Seoul, Korea , 2011, Environmental research letters : ERL [Web site].

[36]  I. Tošić,et al.  The maximum temperatures and heat waves in Serbia during the summer of 2007 , 2011 .

[37]  Devashsish Thakur,et al.  Re optimization of ID3 and C4.5 decision tree , 2010, 2010 International Conference on Computer and Communication Technology (ICCCT).

[38]  A Biggeri,et al.  Effects of cold weather on mortality: results from 15 European cities within the PHEWE project. , 2008, American journal of epidemiology.

[39]  Scott C. Doney,et al.  Carbon source/sink information provided by column CO 2 measurements from the Orbiting Carbon Observatory , 2008 .

[40]  J. Gulliver,et al.  A review of land-use regression models to assess spatial variation of outdoor air pollution , 2008 .

[41]  César Hervás-Martínez,et al.  Data Mining Algorithms to Classify Students , 2008, EDM.

[42]  C. Stefanadis,et al.  CLimate Impacts on Myocardial infarction deaths in the Athens TErritory: the CLIMATE study , 2006, Heart.

[43]  V. Kendrovski The impact of ambient temperature on mortality among the urban population in Skopje, Macedonia during the period 1996–2000 , 2006, BMC public health.

[44]  Eric R. Ziegel,et al.  Geographically Weighted Regression , 2006, Technometrics.

[45]  Miha Vuk,et al.  ROC curve, lift chart and calibration plot , 2006, Advances in Methodology and Statistics.

[46]  Giorgio Corani,et al.  Air quality prediction in Milan: feed-forward neural networks, pruned neural networks and lazy learning , 2005 .

[47]  D. Vujović,et al.  Trends in extreme summer temperatures at Belgrade , 2005 .

[48]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[49]  Bianca Zadrozny,et al.  Obtaining calibrated probability estimates from decision trees and naive Bayesian classifiers , 2001, ICML.

[50]  J. Friedman Special Invited Paper-Additive logistic regression: A statistical view of boosting , 2000 .

[51]  F. Pope,et al.  University of Birmingham The effect of meteorological conditions and atmospheric composition in the occurrence and development of new particle formation (NPF) events in Europe , 2021 .

[52]  H. Gjoreski,et al.  Prediction of Air Pollution Concentration Using Weather Data and Regression Models , 2020 .

[53]  Andreas Jacobsen Lepperød,et al.  Air Quality Prediction with Machine Learning , 2019 .

[54]  K. Lazarević,et al.  Changes in stroke mortality trends and premature mortality due to stroke in Serbia, 1992–2013 , 2015, International Journal of Public Health.

[55]  José Hernández-Orallo,et al.  Calibration of Machine Learning Models , 2012 .

[56]  Wang Xiaohu,et al.  An Application of Decision Tree Based on ID3 , 2012 .

[57]  Hassan Haleh,et al.  A Combined Model of MCDM and Data Mining for Determining Question Weights in Scientific Exams , 2012 .

[58]  Teddy Mantoro,et al.  A Comparison Study of Classifier Algorithms for Mobile-phone's Accelerometer Based Activity Recognition , 2012 .

[59]  Harry Zhang,et al.  The Optimality of Naive Bayes , 2004, FLAIRS.

[60]  Gerald Benoît,et al.  Data mining , 2002, Annu. Rev. Inf. Sci. Technol..

[61]  Lloyd A. Smith,et al.  Practical feature subset selection for machine learning , 1998 .