Crowdsourcing-based Urban Anomaly Prediction System for Smart Cities

Crowdsourcing has become an emerging data collection paradigm for smart city applications. A new category of crowdsourcing-based urban anomaly reporting systems have been developed to enable pervasive and real-time reporting of anomalies in cities (e.g., noise, illegal use of public facilities, urban infrastructure malfunctions). An interesting challenge in these applications is how to accurately predict an anomaly in a given region of the city before it happens. Prior works have made significant progress in anomaly detection. However, they can only detect anomalies after they happen, which may lead to significant information delay and lack of preparedness to handle the anomalies in an efficient way. In this paper, we develop a Crowdsourcing-based Urban Anomaly Prediction Scheme (CUAPS) to accurately predict the anomalies of a city by exploring both spatial and temporal information embedded in the crowdsourcing data. We evaluated the performance of our scheme and compared it to the state-of-the-art baselines using four real-world datasets collected from 311 service in the city of New York. The results showed that our scheme can predict different categories of anomalies in a city more accurately than the baselines.

[1]  Shou-De Lin,et al.  Inferring Air Quality for Station Location Recommendation Based on Urban Big Data , 2015, KDD.

[2]  Chao Huang,et al.  Unsupervised Interesting Places Discovery in Location-Based Social Sensing , 2016, 2016 International Conference on Distributed Computing in Sensor Systems (DCOSS).

[3]  Danai Koutra,et al.  Fast anomaly detection despite the duplicates , 2013, WWW.

[4]  Christos Faloutsos,et al.  RSC: Mining and Modeling Temporal Activity in Social Media , 2015, KDD.

[5]  Chao Huang,et al.  Exploiting spatial-temporal-social constraints for localness inference using online social media , 2016, 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[6]  Yuping Zhao,et al.  A Novel Fast Anti-Collision Algorithm for RFID Systems , 2007, 2007 International Conference on Wireless Communications, Networking and Mobile Computing.

[7]  T. Minka Estimating a Dirichlet distribution , 2012 .

[8]  Tarek F. Abdelzaher,et al.  Surrogate mobile sensing , 2014, IEEE Communications Magazine.

[9]  Chao Huang,et al.  Confidence-aware truth estimation in social sensing applications , 2015, 2015 12th Annual IEEE International Conference on Sensing, Communication, and Networking (SECON).

[10]  Yu Zheng,et al.  Detecting collective anomalies from multiple spatio-temporal datasets across different domains , 2015, SIGSPATIAL/GIS.

[11]  Nuno Constantino Castro,et al.  Time Series Data Mining , 2009, Encyclopedia of Database Systems.

[12]  Chao Huang,et al.  Topic-Aware Social Sensing with Arbitrary Source Dependency Graphs , 2016, 2016 15th ACM/IEEE International Conference on Information Processing in Sensor Networks (IPSN).