Leveraging spatio-temporal clustering for participatory urban infrastructure monitoring

Internet-enabled, location aware smart phones with sensor inputs have led to novel applications exploiting unprecedented high levels of citizen participation in dense metropolitan areas. Especially the possibility to make oneself heard on issues, such as broken traffic lights, potholes or garbage, has led to a high degree of participation in Urban Infrastructure Monitoring. However, duplicate reporting by citizens leads to bottlenecks in manual processing by municipal authorities. Spatio-temporal clustering can serve as an essential tool to group and rank similar reports. Current data mining techniques could be used by municipal departments for this task, but the mandatory parameter selection can be unintuitive, time consuming and error-prone. In this work, we therefore present a novel framework for clustering spatio-temporal data. We first apply an intuitive transformation of the data into a graph structure and subsequently use well-established parameter-free graph clustering techniques to detect and group spatio-temporally close reports. We evaluate our method on two real-world data-sets from different mobile issue tracking platforms. As one of the datasets includes labels for duplicate reports, we can show how our framework outperforms existing techniques in our exemplary use-case (duplicate detection).

[1]  Anand Singh Jalal,et al.  A Density Based Algorithm for Discovering Density Varied Clusters in Large Spatial Databases , 2010 .

[2]  Klemens Böhm,et al.  Statistical Selection of Congruent Subspaces for Mining Attributed Graphs , 2013, 2013 IEEE 13th International Conference on Data Mining.

[3]  Paul Brown,et al.  Fix my street or else: using the internet to voice local public service concerns , 2007, ICEGOV '07.

[4]  Min Wang,et al.  Mining Spatial-temporal Clusters from Geo-databases , 2006, ADMA.

[5]  Hojung Cha,et al.  Automatically characterizing places with opportunistic crowdsensing using smartphones , 2012, UbiComp.

[6]  M. Parimala,et al.  A Survey on Density Based Clustering Algorithms for Mining Large Spatial Databases , 2011 .

[7]  Licia Capra,et al.  Urban Computing: Concepts, Methodologies, and Applications , 2014, TIST.

[8]  Lee Anne Fennell,et al.  Crowdsourcing Land Use , 2013 .

[9]  Dorothea Wagner,et al.  Clustering Evolving Networks , 2014, Algorithm Engineering.

[10]  P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .

[11]  Jussara M. Almeida,et al.  A comparison of Foursquare and Instagram to the study of city dynamics and urban social behavior , 2013, UrbComp '13.

[12]  Derya Birant,et al.  ST-DBSCAN: An algorithm for clustering spatial-temporal data , 2007, Data Knowl. Eng..

[13]  Slava Kisilevich,et al.  Spatio-temporal clustering , 2010, Data Mining and Knowledge Discovery Handbook.

[14]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[15]  Ines Mergel,et al.  Distributed Democracy: SeeClickFix.Com for Crowdsourced Issue Reporting , 2012 .

[16]  Mark H. Hansen,et al.  Participatory sensing - eScholarship , 2006 .

[17]  Ellie D'Hondt,et al.  Crowdsourcing of Pollution Data using Smartphones , 2010 .

[18]  M. Newman,et al.  Finding community structure in networks using the eigenvectors of matrices. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.