Extracting City Traffic Events from Social Streams

Cities are composed of complex systems with physical, cyber, and social components. Current works on extracting and understanding city events mainly rely on technology-enabled infrastructure to observe and record events. In this work, we propose an approach to leverage citizen observations of various city systems and services, such as traffic, public transport, water supply, weather, sewage, and public safety, as a source of city events. We investigate the feasibility of using such textual streams for extracting city events from annotated text. We formalize the problem of annotating social streams such as microblogs as a sequence labeling problem. We present a novel training data creation process for training sequence labeling models. Our automatic training data creation process utilizes instance-level domain knowledge (e.g., locations in a city, possible event terms). We compare this automated annotation process to a state-of-the-art tool that needs manually created training data and show that it has comparable performance in annotation tasks. An aggregation algorithm is then presented for event extraction from annotated text. We carry out a comprehensive evaluation of the event annotation and event extraction on a real-world dataset consisting of event reports and tweets collected over 4 months from the San Francisco Bay Area. The evaluation results are promising and provide insights into the utility of social stream for extracting city events.

[1]  Beate Commentz-Walter,et al.  A String Matching Algorithm Fast on the Average , 1979, ICALP.

[2]  Mitchell P. Marcus,et al.  Text Chunking using Transformation-Based Learning , 1995, VLC@ACL.

[3]  David Yarowsky,et al.  Techniques in Speech Acoustics , 1999, Computational Linguistics.

[4]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[5]  Ralph Grishman,et al.  Real-time event extraction for infectious disease outbreaks , 2002 .

[6]  James Allan,et al.  Text classification and named entities for new event detection , 2004, SIGIR '04.

[7]  J. Pucher,et al.  The Crisis of Public Transport in India: Overwhelming Needs but Limited Resources , 2004 .

[8]  M. Hansen,et al.  Participatory Sensing , 2019, Internet of Things.

[9]  Nicholas Kushmerick,et al.  Event Extraction from Heterogeneous News Sources , 2006 .

[10]  Charles Elkan Log-linear models and conditional random fields , 2007 .

[11]  Xing Chen,et al.  Extracting Key Entities and Significant Events from Online Daily News , 2008, IDEAL.

[12]  Patrick Weber,et al.  OpenStreetMap: User-Generated Street Maps , 2008, IEEE Pervasive Computing.

[13]  Jakub Piskorski,et al.  Real-Time News Event Extraction for Global Crisis Monitoring , 2008, NLDB.

[14]  Xavier Carreras,et al.  Semantic Role Labeling: An Introduction to the Special Issue , 2008, Computational Linguistics.

[15]  Matthew Hurst,et al.  Event Detection and Tracking in Social Streams , 2009, ICWSM.

[16]  Masaaki Kikuchi,et al.  Discovering Volatile Events in Your Neighborhood: Local-Area Topic Extraction from Blog Entries , 2009, AIRS.

[17]  Amit P. Sheth,et al.  Spatio-Temporal-Thematic Analysis of Citizen Sensor Data: Challenges and Experiences , 2009, WISE.

[18]  Nir Friedman,et al.  Probabilistic Graphical Models - Principles and Techniques , 2009 .

[19]  Amit P. Sheth,et al.  Citizen Sensing, Social Signals, and Enriching Human Experience , 2009, IEEE Internet Computing.

[20]  Andrea Vitaletti,et al.  Smart City: An Event Driven Architecture for Monitoring Public Spaces with Heterogeneous Sensors , 2010, 2010 Fourth International Conference on Sensor Technologies and Applications.

[21]  Yutaka Matsuo,et al.  Earthquake shakes Twitter users: real-time event detection by social sensors , 2010, WWW '10.

[22]  Hosung Park,et al.  What is Twitter, a social network or a news media? , 2010, WWW '10.

[23]  Kamin Whitehouse,et al.  The smart thermostat: using occupancy sensors to save energy in homes , 2010, SenSys '10.

[24]  Milind R. Naphade,et al.  Smarter Cities and Their Innovation Challenges , 2011, Computer.

[25]  Hila Becker,et al.  Beyond Trending Topics: Real-World Event Identification on Twitter , 2011, ICWSM.

[26]  Alexandra Moraru COMPLEX EVENT PROCESSING AND DATA MINING FOR SMART CITIES , 2012 .

[27]  Charu C. Aggarwal,et al.  Event Detection in Social Streams , 2012, SDM.

[28]  Xiaofeng Wang,et al.  Automatic Crime Prediction Using Events Extracted from Twitter Posts , 2012, SBP.

[29]  Oren Etzioni,et al.  Open domain event extraction from twitter , 2012, KDD.

[30]  Andrew McCallum,et al.  An Introduction to Conditional Random Fields , 2010, Found. Trends Mach. Learn..

[31]  Michelle X. Zhou,et al.  Event detection with social media data , 2012 .

[32]  Nello Cristianini,et al.  Nowcasting Events from the Social Web with Statistical Learning , 2012, TIST.

[33]  Biplav Srivastava,et al.  City Notifications as a Data Source for Traffic Management , 2013 .

[34]  Hai Yang,et al.  ACM Transactions on Intelligent Systems and Technology - Special Section on Urban Computing , 2014 .