How to Draw a Neighborhood? The Potential of Big Data, Regionalization, and Community Detection for Understanding the Heterogeneous Nature of Urban Neighborhoods

How to draw neighborhood boundaries, or spatial regions in general, has been a long-standing focus in Geography. This article examines this question from a methodological perspective, often referred to as regionalization, with an empirical study of neighborhoods in New York City. I argue that methodological advances, combined with the affordances of big data, enable a different, more nuanced approach to regionalization than has been possible in the past. Conventional data sets often dictate constraints in terms of data availability and spatio-temporal granularity. However, big data is now available at much finer spatio-temporal scales and covers a wider array of aspects of social life. The emergence of these data sets supports the notion that neighborhoods can be fuzzy and highly dependent on spatio-temporal scales and socio-economic variables. As such, these new data sets can help to bring quantitative analysis in line with social theory that has long emphasized the heterogeneous nature of neighborhoods. This article uses a data set of geotagged tweets to demonstrate how different “sets” of neighborhoods may exist at different spatio-temporal scales and for different algorithms. Such varying neighborhood boundaries are not a technical problem in need of a solution but rather a reflection of the complexity of the underlying urban fabric.

[1]  Stan Openshaw,et al.  Modifiable Areal Unit Problem , 2008, Encyclopedia of GIS.

[2]  L. F. Dorward A problem in weighing , 1957 .

[3]  B. Wellman The Community Question: The Intimate Networks of East Yorkers , 1979, American Journal of Sociology.

[4]  Stan Openshaw,et al.  An Empirical Study of Some Zone-Design Criteria , 1978 .

[5]  Eshref Shevky,et al.  The social areas of Los Angeles : analysis and typology , 1949 .

[6]  Stan Openshaw,et al.  The use and definition of travel-to-work areas in Great Britain: Some comments , 1982 .

[7]  S. Fortunato,et al.  Resolution limit in community detection , 2006, Proceedings of the National Academy of Sciences.

[8]  J. H. Ward Hierarchical Grouping to Optimize an Objective Function , 1963 .

[9]  Robert W. Faris,et al.  Measuring 'neighborhood': Constructing network neighborhoods , 2012, Soc. Networks.

[10]  R. Alba A graph‐theoretic definition of a sociometric clique† , 1973 .

[11]  Katharina Anna Zweig,et al.  A fixed degree sequence model for the one-mode projection of multiplex bipartite graphs , 2013, Social Network Analysis and Mining.

[12]  Martin G. Everett,et al.  The dual-projection approach for two-mode networks , 2013, Soc. Networks.

[13]  H. Green Hinterland Boundaries of New York City and Boston in Southern New England , 1955 .

[14]  Filip Agneessens,et al.  Introduction to the special issue on advances in two-mode social networks , 2013, Soc. Networks.

[15]  S. Borgatti,et al.  LS sets, lambda sets and other cohesive subsets , 1990 .

[16]  Martin Rosvall,et al.  Multilevel Compression of Random Walks on Networks Reveals Hierarchical Organization in Large Integrated Systems , 2010, PloS one.

[17]  Chaogui Kang,et al.  Social Sensing: A New Approach to Understanding Our Socioeconomic Environments , 2015 .

[18]  D. Grigg,et al.  THE LOGIC OF REGIONAL SYSTEMS1 , 1965 .

[19]  Javier Esparrago,et al.  Traditional grassland management and surrounding land use drive the abundance of a prairie plant species in urban areas , 2015 .

[20]  K. Axhausen Social Networks, Mobility Biographies, and Travel: Survey Challenges , 2008 .

[21]  Ed Manley,et al.  Identifying functional urban regions within traffic flow , 2014 .

[22]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[23]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[24]  Ron Johnston,et al.  CHOICE IN CLASSIFICATION: THE SUBJECTIVITY OFOBJECTIVE METHODS1 , 1968 .

[25]  Matthew Zook,et al.  Beyond the geotag: situating ‘big data’ and leveraging the potential of the geoweb , 2013 .

[26]  L. Bean,et al.  The Shevky-Bell Social Areas: Confirmation of Results and a Reinterpretation , 1961 .

[27]  L. A. Brown,et al.  THE DELIMITATION OF FUNCTIONAL REGIONS, NODAL REGIONS, AND HIERARCHIES BY FUNCTIONAL DISTANCE APPROACHES* , 1971 .

[28]  R. Johnston,et al.  Grouping and Regionalizing: Some Methodological and Technical Observations , 1970 .

[29]  Alessandro Chessa,et al.  Commuter networks and community detection: A method for planning sub regional areas , 2011, ArXiv.

[30]  Mauricio Barahona,et al.  Encoding dynamics for multiscale community detection: Markov time sweeping for the map equation. , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[31]  Benjamin H. Good,et al.  Performance of modularity maximization in practical contexts. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[32]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[33]  J. Haughton Irish local newspapers: A geographical study , 1950 .

[34]  Alexandre Arenas,et al.  Identifying modular flows on multilayer networks reveals highly overlapping organization in social systems , 2014, ArXiv.

[35]  Jean-Claude Thill,et al.  Social area analysis, data mining, and GIS , 2008, Comput. Environ. Urban Syst..

[36]  Bin Jiang,et al.  Volunteered Geographic Information: Towards the establishment of a new paradigm , 2015, Comput. Environ. Urban Syst..

[37]  Sungkil Lee,et al.  Real-Time Tracking of Visually Attended Objects in Virtual Environments and Its Application to LOD , 2009, IEEE Transactions on Visualization and Computer Graphics.

[38]  Barry Wellman,et al.  Geography of Twitter networks , 2012, Soc. Networks.

[39]  Emanuele Strano,et al.  The Structure of Spatial Networks and Communities in Bicycle Sharing Systems , 2013, PloS one.

[40]  S Openshaw,et al.  Algorithms for Reengineering 1991 Census Geography , 1995, Environment & planning A.

[41]  P. Rees The Factorial Ecology of Calcutta , 1969, American Journal of Sociology.

[42]  Martin G. Everett,et al.  Network analysis of 2-mode data , 1997 .

[43]  Matthew Zook,et al.  Social Media and the City: Rethinking Urban Socio-Spatial Inequality Using User-Generated Geographic Information , 2015 .

[44]  B. Wellman,et al.  Networks, Neighborhoods, and Communities , 1979 .

[45]  Yong Gao,et al.  Uncovering Patterns of Inter-Urban Trip and Spatial Interaction from Social Media Check-In Data , 2013, PloS one.

[46]  Martin Rosvall,et al.  Compression of flow can reveal overlapping modular organization in networks , 2011, ArXiv.

[47]  Justus Uitermark,et al.  How to Study the City on Instagram , 2016, PloS one.

[48]  Paul B. Slater,et al.  Comparisons of aggregation procedures for interaction data: An illustration using a college student international flow table , 1981 .

[49]  Alessandro Vespignani,et al.  The Structure of Interurban Traffic: A Weighted Network Analysis , 2005, physics/0507106.

[50]  Carl T. Bergstrom,et al.  The map equation , 2009, 0906.1405.

[51]  W. Patefield,et al.  An Efficient Method of Generating Random R × C Tables with Given Row and Column Totals , 1981 .

[52]  Stan Openshaw,et al.  A geographical solution to scale and aggregation problems in region-building, partitioning and spatial modelling , 1977 .

[53]  Wendell Bell Economic, Family, and Ethnic Status: An Empirical Test , 1955 .

[54]  Santo Fortunato,et al.  Limits of modularity maximization in community detection , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[55]  O. Järv,et al.  Using Mobile Positioning Data to Model Locations Meaningful to Users of Mobile Phones , 2010 .

[56]  Jean-Charles Delvenne,et al.  Markov Dynamics as a Zooming Lens for Multiscale Community Detection: Non Clique-Like Communities and the Field-of-View Limit , 2011, PloS one.

[57]  B. Derudder,et al.  How international is the Annual Meeting of the Association of American Geographers? A social network analysis perspective , 2016 .

[58]  M. Newman,et al.  Finding community structure in very large networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[59]  Song Gao,et al.  Discovering Spatial Interaction Communities from Mobile Phone Data , 2013 .

[60]  Martin Rosvall,et al.  Maps of random walks on complex networks reveal community structure , 2007, Proceedings of the National Academy of Sciences.

[61]  Xiaolong Zhu,et al.  A modified deep neural network enables identification of foliage under complex background , 2020, Connect. Sci..

[62]  Cecilia Mascolo,et al.  A multilayer approach to multiplexity and link prediction in online geo-social networks , 2016, EPJ Data Science.

[63]  B. Wellman,et al.  Imagining Twitter as an Imagined Community , 2011 .

[64]  H. Kariel,et al.  A NODAL STRUCTURE FOR A SET OF CANADIAN CITIES USING GRAPH THEORY AND NEWSPAPER DATELINES , 1977 .

[65]  Zachary P. Neal,et al.  Structural Determinism in the Interlocking World City Network , 2012 .

[66]  S. Strogatz,et al.  Redrawing the Map of Great Britain from a Network of Human Interactions , 2010, PloS one.

[67]  Zachary Neal,et al.  The backbone of bipartite projections: Inferring relationships from co-authorship, co-sponsorship, co-attendance and other co-behaviors , 2014, Soc. Networks.

[68]  Leon Danon,et al.  Comparing community structure identification , 2005, cond-mat/0505245.

[69]  R. Luce,et al.  Connectivity and generalized cliques in sociometric group structure , 1950, Psychometrika.

[70]  Matthew Zook,et al.  Artists and Bankers and Hipsters, Oh My! Mapping Tweets in the New York Metropolitan Region , 2014 .

[71]  K. Axhausen,et al.  Activity spaces: Measures of social exclusion? , 2003 .

[72]  Carlo Ratti,et al.  The impact of social segregation on human mobility in developing and industrialized regions , 2014, EPJ Data Science.

[73]  Andrea Lancichinetti,et al.  Community detection algorithms: a comparative analysis: invited presentation, extended abstract , 2009, VALUETOOLS.

[74]  M. Dacey,et al.  A graph theory interpretation of nodal regions , 1961 .

[75]  Carlo Ratti,et al.  Exploring human movements in Singapore: a comparative analysis based on mobile phone and taxicab usages , 2013, UrbComp '13.