Generating vague neighbourhoods through data mining of passive web data

ABSTRACT Neighbourhoods have been described as ‘the building blocks of public services society’. Their subjective nature, however, and the resulting difficulties in collecting data, means that in many countries there are no officially defined neighbourhoods either in terms of names or boundaries. This has implications not only for policy but also business and social decisions as a whole. With the absence of neighbourhood boundaries many studies resort to using standard administrative units as proxies. Such administrative geographies, however, often have a poor fit with those perceived by residents. Our approach detects these important social boundaries by automatically mining the Web en masse for passively declared neighbourhood data within postal addresses. Focusing on the United Kingdom (UK), this research demonstrates the feasibility of automated extraction of urban neighbourhood names and their subsequent mapping as vague entities. Importantly, and unlike previous work, our process does not require any neighbourhood names to be established a priori.

[1]  Ross Purves,et al.  Exploring place through user-generated content: Using Flickr tags to describe city cores , 2010, J. Spatial Inf. Sci..

[2]  Christopher B. Jones,et al.  A Field Based Representation for Vague Areas Defined by Spatial Prepositions , 2008 .

[3]  G. Galster The Mechanism(s) of Neighbourhood Effects: Theory, Evidence, and Policy Implications , 2012 .

[4]  Luke Thomas Clasper Exploring Vernacular Perceptions of Spatial Entities: Using Twitter Data and R for Delimiting Vague, Dinformal Neighbourhoods in Inner London, UK , 2018 .

[5]  Jacqueline Warren Mills,et al.  Geospatial Analysis: A Comprehensive Guide to Principles, Techniques, and Software Tools, Second Edition - by Michael J. de Smith, Michael F. Goodchild, and Paul A. Longley , 2008, Trans. GIS.

[6]  Michael F. Goodchild,et al.  Constructing places from spatial footprints , 2012, GEOCROWD '12.

[7]  Steven Schockaert,et al.  Vague regions in Geographic Information Retrieval , 2011, SIGSPACIAL.

[8]  Jochen L. Leidner,et al.  Detecting geographical references in the form of place names and associated spatial natural language , 2011, SIGSPACIAL.

[9]  Frank Moulaert,et al.  Can Neighbourhoods Save the City?: Community Development and Social Innovation , 2013 .

[10]  Stephan Winter,et al.  Locating place names from place descriptions , 2013, Int. J. Geogr. Inf. Sci..

[11]  Cecilia Mascolo,et al.  Hoodsquare: Modeling and Recommending Neighborhoods in Location-Based Social Networks , 2013, 2013 International Conference on Social Computing.

[12]  Scott Orford,et al.  The Relationship between Self-reported Definitions of Urban Neighbourhood and Respondent Characteristics: A Study of Cardiff, UK , 2014 .

[13]  Jochen Schaab,et al.  Automated Footprint Generation from Geotags with Kernel Density Estimation and Support Vector Machines , 2009, Spatial Cogn. Comput..

[14]  Davide Buscaldi,et al.  Approaches to disambiguating toponyms , 2011, SIGSPACIAL.

[15]  Robert J. Sampson,et al.  Moving and the Neighborhood Glass Ceiling , 2012, Science.

[16]  David J. Unwin,et al.  Defining and Delineating the Central Areas of Towns for Statistical Monitoring Using Continuous Surface Representations , 2000, Trans. GIS.

[17]  Kevin Lynch,et al.  The Image of the City , 1960 .

[18]  Alia I. Abdelmoty,et al.  Acquisition of Vernacular Place Names from Web Sources , 2008, Weaving Services and People on the World Wide Web.

[19]  Krzysztof Janowicz,et al.  An agenda for the next generation gazetteer: geographic information contribution and retrieval , 2009, GIS.

[20]  Basile Chaix,et al.  The ‘constant size neighbourhood trap’ in accessibility and health studies , 2015 .

[21]  Michael F. Goodchild,et al.  Where's Downtown?: Behavioral Methods for Determining Referents of Vague Spatial Queries , 2003 .

[22]  Norman M. Sadeh,et al.  The Livehoods Project: Utilizing Social Media to Understand the Dynamics of a City , 2012, ICWSM.

[23]  Alia I. Abdelmoty,et al.  Acquisition of a vernacular gazetteer from web sources , 2008, LocWeb.

[24]  Avi Arampatzis,et al.  The design and implementation of SPIRIT: a spatially aware search engine for information retrieval on the Internet , 2007, Int. J. Geogr. Inf. Sci..

[25]  Martine De Cock,et al.  Neighborhood restrictions in geographic IR , 2007, SIGIR.

[26]  Paul Clough,et al.  Identifying imprecise regions for geographic information retrieval using the web , 2005 .

[27]  Adrian Popescu,et al.  Gazetiki: automatic creation of a geographical gazetteer , 2008, JCDL '08.

[28]  Paul Brindley Generating vague geographic information through data mining of passive web data , 2016 .

[29]  Max L. Wilson,et al.  A data driven approach to mapping urban neighbourhoods , 2014, SIGSPATIAL/GIS.

[30]  Hideo Joho,et al.  Deliverable type: Contributing WP: , 2022 .