Air Quality Inference with Deep Convolutional Conditional Random Field

Conditional Random Field is a discriminative model for time series data. In this paper, we propose an improved CRF and apply it to the task of air quality inference. Different from the classical CRF, our linear chain CRF is a supervised learning based on the deep convolution neural network, which has a strong learning ability and fast processing speed for the engineering big data. Specifically, we model the state feature function and the state transition feature function with deep convolutional neural network. The parameter space can store more feature expressions learned from a large number of data. For the state transition feature function of linear conditional random field, we add the influence of input sequence on this function. Through the modelling and learning both vertex features and edge features from data, we obtain a more powerful and more efficient CRF. Experiments on natural language and air quality data show our CRF can achieve higher accuracy.

[1]  Yu Zheng,et al.  U-Air: when urban air quality inference meets big data , 2013, KDD.

[2]  Joseph S. Scire,et al.  A User's Guide for the CALPUFF Dispersion Model , 2000 .

[3]  Xing Xie,et al.  Sensing the pulse of urban refueling behavior , 2013, UbiComp.

[4]  R. Martin Satellite remote sensing of surface air quality , 2008 .

[5]  Lothar Thiele,et al.  Participatory Air Pollution Monitoring Using Smartphones , 2012 .

[6]  Joseph P. Pinto,et al.  Ground-level nitrogen dioxide concentrations inferred from the satellite-borne Ozone Monitoring Instrument , 2008 .

[7]  Guangzhong Sun,et al.  Driving with knowledge from the physical world , 2011, KDD.

[8]  Yu Zheng,et al.  T-share: A large-scale dynamic taxi ridesharing service , 2013, 2013 IEEE 29th International Conference on Data Engineering (ICDE).

[9]  Xing Xie,et al.  An Interactive-Voting Based Map Matching Algorithm , 2010, 2010 Eleventh International Conference on Mobile Data Management.

[10]  Xing Xie,et al.  Discovering regions of different functions in a city using human mobility and POIs , 2012, KDD.

[11]  Rayid Ghani,et al.  Analyzing the effectiveness and applicability of co-training , 2000, CIKM '00.

[12]  Xing Xie,et al.  Urban computing with taxicabs , 2011, UbiComp '11.

[13]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[14]  R. Martin,et al.  Estimating ground-level PM2.5 using aerosol optical depth determined from satellite remote sensing , 2006 .

[15]  Kun Li,et al.  MAQS: a personalized mobile sensing system for indoor air quality monitoring , 2011, UbiComp '11.

[16]  K. Pericleous,et al.  Modelling air quality in street canyons : a review , 2003 .