Deep Learning From Spatio-Temporal Data Using Orthogonal Regularizaion Residual CNN for Air Prediction

Air pollution is harmful to human health and restricts economic development, so predicting when and where air pollution will occur is a challenging and important issue, especially in fields of urban planning, factory production and human activities. In this paper, we propose a deep Spatio-Temporal Orthogonal Regularization Residual CNN (ST-OR-ResNet) for air prediction. Deep Convolutional Neural Network (CNN) is presented to capture the complex spatio-temporal relation of the dynamic biased meteorological data. Residual learning is designed to avoid unpredictable oscillations when training the network and verifying errors. For the issue of characteristic statistical migration and saddle point proliferation in deep network, the orthogonality regularizations are designed to stabilize the back-propagation errors, utilizing various advanced analytical tools such as restricted isometry property without extra hassle. We then benchmark their effects on public real-world datasets to demonstrate that ST-OR-ResNet has better predictive performance than the state-of-the-art methods.

[1]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[2]  Luca Bertinetto,et al.  Learning feed-forward one-shot learners , 2016, NIPS.

[3]  Shiliang Pu,et al.  All You Need is Beyond a Good Init: Exploring Better Solution for Training Extremely Deep Convolutional Neural Networks with Orthonormality and Modulation , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  C. Brodley,et al.  Decision tree classification of land cover from remotely sensed data , 1997 .

[5]  Luc Van Gool,et al.  Dynamic Filter Networks , 2016, NIPS.

[6]  Emmanuel J. Candès,et al.  Decoding by linear programming , 2005, IEEE Transactions on Information Theory.

[7]  S. Andradóttir A method for discrete stochastic optimization , 1995 .

[8]  M. Etkind Air pollution-an overview , 1995 .

[9]  Xiaohan Chen,et al.  Can We Gain More from Orthogonality Regularizations in Training Deep CNNs? , 2018, NeurIPS.

[10]  Haichao Zhang,et al.  Sparse Coding and its Applications in Computer Vision , 2015 .

[11]  Qingqing He,et al.  Satellite-based mapping of daily high-resolution ground PM 2.5 in China via space-time regression modeling , 2018 .

[12]  Razvan Pascanu,et al.  Natural Neural Networks , 2015, NIPS.

[13]  Yong Wang,et al.  Learning Traffic as Images: A Deep Convolutional Neural Network for Large-Scale Transportation Network Speed Prediction , 2017, Sensors.

[14]  Minh N. Do,et al.  Special paraunitary matrices, Cayley transform, and multidimensional orthogonal filter banks , 2006, IEEE Transactions on Image Processing.

[15]  Koji Zettsu,et al.  Dynamically pre-trained deep recurrent neural networks using environmental monitoring data for predicting PM2.5 , 2015, Neural Computing and Applications.

[16]  Yuichi Yoshida,et al.  Spectral Norm Regularization for Improving the Generalizability of Deep Learning , 2017, ArXiv.

[17]  J. Schwartz,et al.  Incorporating local land use regression and satellite aerosol optical depth in a hybrid model of spatiotemporal PM2.5 exposures in the Mid-Atlantic states. , 2012, Environmental science & technology.

[18]  Yu Zheng,et al.  Deep Distributed Fusion Network for Air Quality Prediction , 2018, KDD.

[19]  Hanqiu Sun,et al.  Video Saliency Prediction Using Spatiotemporal Residual Attentive Networks , 2020, IEEE Transactions on Image Processing.

[20]  Jianbing Shen,et al.  Triplet Loss in Siamese Network for Object Tracking , 2018, ECCV.

[21]  Ding Liu,et al.  Learning Simple Thresholded Features With Sparse Support Recovery , 2018, IEEE Transactions on Circuits and Systems for Video Technology.

[22]  F. Xavier Roca,et al.  Regularizing CNNs with Locally Constrained Decorrelations , 2016, ICLR.

[23]  Tianrui Li,et al.  Predicting Citywide Crowd Flows Using Deep Spatio-Temporal Residual Networks , 2017, Artif. Intell..

[24]  S. Frick,et al.  Compressed Sensing , 2014, Computer Vision, A Reference Guide.

[25]  Johan A. K. Suykens,et al.  Least Squares Support Vector Machine Classifiers , 1999, Neural Processing Letters.

[26]  Serge J. Belongie,et al.  Residual Networks are Exponential Ensembles of Relatively Shallow Networks , 2016, ArXiv.

[27]  Mysore G. Satish,et al.  Dispersion model evaluation of PM2.5, NOx and SO2 from point and major line sources in Nova Scotia, Canada using AERMOD Gaussian plume air dispersion model , 2013 .

[28]  Richard Baraniuk,et al.  Mad Max: Affine Spline Insights Into Deep Learning , 2018, Proceedings of the IEEE.

[29]  Ruimao Zhang,et al.  Differentiable Dynamic Normalization for Learning Deep Representation , 2019, ICML.

[30]  Terence D. Sanger,et al.  Optimal unsupervised learning in a single-layer linear feedforward neural network , 1989, Neural Networks.

[31]  Takayuki Okatani,et al.  Optimization on Submanifolds of Convolution Kernels in CNNs , 2016, ArXiv.

[32]  Basura Fernando,et al.  Generalized BackPropagation, Étude De Cas: Orthogonality , 2016, ArXiv.

[33]  Venu Govindaraju,et al.  Normalization Propagation: A Parametric Technique for Removing Internal Covariate Shift in Deep Networks , 2016, ICML.

[34]  João Gama,et al.  Predicting Taxi–Passenger Demand Using Streaming Data , 2013, IEEE Transactions on Intelligent Transportation Systems.

[35]  Steven C. Hillmer,et al.  An ARIMA-Model-Based Approach to Seasonal Adjustment , 1982 .

[36]  H. Akimoto Global Air Quality and Pollution , 2003, Science.

[37]  Haibin Ling,et al.  A Deep Network Solution for Attention and Aesthetics Aware Photo Cropping , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[38]  Jieping Ye,et al.  Deep Multi-View Spatial-Temporal Network for Taxi Demand Prediction , 2018, AAAI.

[39]  Richard G. Baraniuk,et al.  A Spline Theory of Deep Learning , 2018, ICML 2018.

[40]  Yuichi Yoshida,et al.  Spectral Normalization for Generative Adversarial Networks , 2018, ICLR.

[41]  Jianbing Shen,et al.  Local Semantic Siamese Networks for Fast Tracking , 2019, IEEE Transactions on Image Processing.

[42]  Jieping Ye,et al.  Multi-Task Learning for Spatio-Temporal Event Forecasting , 2015, KDD.

[43]  Xianglong Liu,et al.  Orthogonal Weight Normalization: Solution to Optimization over Multiple Dependent Stiefel Manifolds in Deep Neural Networks , 2017, AAAI.

[44]  Xiaoping Liu,et al.  Satellite-based ground PM 2.5 estimation using timely structure adaptive modeling , 2016 .

[45]  Liangpei Zhang,et al.  Estimating Ground‐Level PM2.5 by Fusing Satellite and Station Observations: A Geo‐Intelligent Deep Learning Approach , 2017, 1707.03558.

[46]  David W. S. Wong,et al.  An adaptive inverse-distance weighting spatial interpolation technique , 2008, Comput. Geosci..

[47]  Armistead G Russell,et al.  Improving the Accuracy of Daily PM2.5 Distributions Derived from the Fusion of Ground-Level Measurements with Aerosol Optical Depth Observations, a Case Study in North China. , 2016, Environmental science & technology.

[48]  Yu Zheng,et al.  U-Air: when urban air quality inference meets big data , 2013, KDD.

[49]  L. F. Abbott,et al.  Random Walk Initialization for Training Very Deep Feedforward Networks , 2014, 1412.6558.

[50]  Yonghua Wu,et al.  Assessing satellite AOD based and WRF/CMAQ output PM2.5 estimators , 2013, Defense, Security, and Sensing.

[51]  Honglak Lee,et al.  Understanding and Improving Convolutional Neural Networks via Concatenated Rectified Linear Units , 2016, ICML.

[52]  Xiaogang Jin,et al.  Quadruplet Network With One-Shot Learning for Fast Visual Object Tracking , 2017, IEEE Transactions on Image Processing.

[53]  Trevor Darrell,et al.  Data-dependent Initializations of Convolutional Neural Networks , 2015, ICLR.