Missing Data Imputation Through SGTM Neural-Like Structure for Environmental Monitoring Tasks

The article describes a new missing data imputation method. It is based on the use of a high-speed neural-like structure of the Successive Geometric Transformations Model. The importance of the research is based on the analysis of disadvantages of the known methods for missing data processing. Various simple and complex algorithms are analyzed, among which are the arithmetic mean algorithm, regression modeling, etc. It is shown that the above-mentioned imputation methods in data monitoring of air pollution do not allow to obtain reliable results due to the low prediction accuracy. An effective method for processing data imputation through SGTM neural-like structure is proposed. An example of filling data by forecasting CO, NO and NO2 missed parameters in data monitoring of air pollution is given. A comparison of the proposed method with the arithmetic mean algorithm is carried out. Accuracies of the data imputation by developed method and by arithmetic mean algorithm are based on calculated evaluation criteria: Root mean squared errors. Experimentally established that the data imputation method through SGTM neural-like structure has a three times higher accuracy of the data imputation than the arithmetic mean algorithm. The proposed approach can be used in various areas such as medicine, materials science, economics, science services, etc.

[1]  I. Yurchak,et al.  Neurolike networks on the basis of Geometrical Transformation Machine , 2008, 2008 International Conference on Perspective Technologies and Methods in MEMS Design.

[2]  Oleksii K. Tyshchenko,et al.  A neuro-fuzzy Kohonen network for data stream possibilistic clustering and its online self-learning procedure , 2017, Appl. Soft Comput..

[3]  Nataliia Kunanets,et al.  Recovery Gaps in Experimental Data , 2018, COLINS.

[4]  Volodymyr Lytvynenko,et al.  Technology of Gene Expression Profiles Filtering Based on Wavelet Analysis , 2018 .

[5]  Ivanna Dronyuk,et al.  Ateb-prediction simulation of traffic using OMNeT++ modeling tools , 2016, 2016 XIth International Scientific and Technical Conference Computer Sciences and Information Technologies (CSIT).

[6]  Natalia Kryvinska,et al.  It is all about services-fundamentals, drivers, and business models , 2013, J. Serv. Sci. Res..

[7]  Oleksii K. Tyshchenko,et al.  A cascade deep neuro-fuzzy system for high-dimensional online possibilistic fuzzy clustering , 2016, 2016 XIth International Scientific and Technical Conference Computer Sciences and Information Technologies (CSIT).

[8]  Roderick J A Little,et al.  A Review of Hot Deck Imputation for Survey Non‐response , 2010, International statistical review = Revue internationale de statistique.

[9]  Shlomo Engelberg Digital Signal Processing: An Experimental Approach , 2008 .

[10]  A. Cohen,et al.  Exposure assessment for estimation of the global burden of disease attributable to outdoor air pollution. , 2012, Environmental science & technology.

[11]  Oleksii K. Tyshchenko,et al.  A deep cascade neural network based on extended neo-fuzzy neurons and its adaptive learning algorithm , 2017, 2017 IEEE First Ukraine Conference on Electrical and Computer Engineering (UKRCON).

[12]  Oleksii K. Tyshchenko,et al.  An evolving connectionist system for data stream fuzzy clustering and its online learning , 2017, Neurocomputing.

[13]  Vasyl Teslyuk,et al.  Development and Implementation of the Technical Accident Prevention Subsystem for the Smart Home System , 2018 .

[14]  S. D. Vito,et al.  CO, NO2 and NOx urban pollution monitoring with on-field calibrated electronic nose by automatic bayesian regularization , 2009 .

[15]  Vasyl Lytvyn,et al.  Time Dependence of the Output Signal Morphology for Nonlinear Oscillator Neuron Based on Van der Pol Model , 2018 .

[16]  Pavlo Mulesa,et al.  Hybrid clustering-classification neural network in the medical diagnostics of reactive arthritis , 2016, ArXiv.

[17]  Sergii Babichev,et al.  Development of a technique for the reconstruction and validation of gene network models based on gene expression profiles , 2018 .

[18]  Ivan Izonin,et al.  Model and Principles for the Implementation of Neural-Like Structures Based on Geometric Data Transformations , 2018 .

[19]  Nataliia Kunanets,et al.  The Information Support of Virtual Research Teams by Means of Cloud Managers , 2018 .

[20]  Natalya Shakhovska,et al.  The structure of information systems for environmental monitoring , 2016, 2016 XIth International Scientific and Technical Conference Computer Sciences and Information Technologies (CSIT).

[21]  Craig K. Enders,et al.  An introduction to modern missing data analyses. , 2010, Journal of school psychology.