Detecting the Boundaries of Urban Areas in India: A Dataset for Pixel-Based Image Classification in Google Earth Engine

Urbanization often occurs in an unplanned and uneven manner, resulting in profound changes in patterns of land cover and land use. Understanding these changes is fundamental for devising environmentally responsible approaches to economic development in the rapidly urbanizing countries of the emerging world. One indicator of urbanization is built-up land cover that can be detected and quantified at scale using satellite imagery and cloud-based computational platforms. This process requires reliable and comprehensive ground-truth data for supervised classification and for validation of classification products. We present a new dataset for India, consisting of 21,030 polygons from across the country that were manually classified as “built-up” or “not built-up,” which we use for supervised image classification and detection of urban areas. As a large and geographically diverse country that has been undergoing an urban transition, India represents an ideal context to develop and test approaches for the detection of features related to urbanization. We perform the analysis in Google Earth Engine (GEE) using three types of classifiers, based on imagery from Landsat 7 and Landsat 8 as inputs. The methodology produces high-quality maps of built-up areas across space and time. Although the dataset can facilitate supervised image classification in any platform, we highlight its potential use in GEE for temporal large-scale analysis of the urbanization process. Our methodology can easily be applied to other countries and regions.

[1]  T. V. Ramachandra,et al.  Insights to urban dynamics through landscape spatial pattern analysis , 2012, Int. J. Appl. Earth Obs. Geoinformation.

[2]  M. Helbich,et al.  Spatiotemporal urbanization processes in the megacity of Mumbai, India: A Markov chains-cellular automata urban growth model , 2013 .

[3]  Mario Chica-Olmo,et al.  An assessment of the effectiveness of a random forest classifier for land-cover classification , 2012 .

[4]  安藤 寛,et al.  Cross-Validation , 1952, Encyclopedia of Machine Learning and Data Mining.

[5]  P. K. Joshi,et al.  Monitoring Urban Landscape Dynamics Over Delhi (India) Using Remote Sensing (1998–2011) Inputs , 2013, Journal of the Indian Society of Remote Sensing.

[6]  Patricia Gober,et al.  Per-pixel vs. object-based classification of urban land cover extraction using high spatial resolution imagery , 2011, Remote Sensing of Environment.

[7]  K. Seto,et al.  Global forecasts of urban expansion to 2030 and direct impacts on biodiversity and carbon pools , 2012, Proceedings of the National Academy of Sciences.

[8]  A. Tatem,et al.  High Resolution Population Distribution Maps for Southeast Asia in 2010 and 2015 , 2013, PloS one.

[9]  C. Pugh,et al.  Sustainability the Environment and Urbanisation , 1996 .

[10]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[11]  M. Friedl,et al.  Mapping global urban areas using MODIS 500-m data: new methods and datasets based on 'urban ecoregions'. , 2010 .

[12]  Steven Salzberg,et al.  On Comparing Classifiers: Pitfalls to Avoid and a Recommended Approach , 1997, Data Mining and Knowledge Discovery.

[13]  D. King,et al.  Comparison of pixel- and object-based classification in land cover change mapping , 2011 .

[14]  José Antonio Lozano,et al.  Sensitivity Analysis of k-Fold Cross Validation in Prediction Error Estimation , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Catherine Linard,et al.  Disaggregating Census Data for Population Mapping Using Random Forests with Remotely-Sensed and Ancillary Data , 2015, PloS one.

[16]  T. Ramachandra,et al.  Urban sprawl: metrics, dynamics and modelling using GIS , 2004 .

[17]  Stefano Ermon,et al.  Transfer Learning from Deep Features for Remote Sensing and Poverty Mapping , 2015, AAAI.

[18]  K. Moffett,et al.  Remote Sens , 2015 .

[19]  R. Shankar,et al.  An analysis of urban growth trends in the post-economic reforms period in India , 2012 .

[20]  C. Justice,et al.  High-Resolution Global Maps of 21st-Century Forest Cover Change , 2013, Science.

[21]  H. Urdal,et al.  An urbanization bomb? Population growth and social disorder in cities , 2013 .

[22]  Forrest R. Stevens,et al.  Multitemporal settlement and population mapping from Landsat using Google Earth Engine , 2015, Int. J. Appl. Earth Obs. Geoinformation.

[23]  John F. Mustard,et al.  How much is built? Quantifying and interpreting patterns of built space from different data sources , 2011 .

[24]  Steven E. Franklin,et al.  A comparison of pixel-based and object-based image analysis with selected machine learning algorithms for the classification of agricultural landscapes using SPOT-5 HRG imagery , 2012 .

[25]  T. Ramachandra,et al.  Characterising Urban Sprawl from Remote Sensing Data and Using Landscape Metrics , 2007 .

[26]  Deepak Khare,et al.  Monitoring and modelling of urban sprawl using remote sensing and GIS techniques , 2008, Int. J. Appl. Earth Obs. Geoinformation.

[27]  A. Dewan,et al.  Land use and land cover change in Greater Dhaka, Bangladesh: Using remote sensing to promote sustainable urbanization , 2009 .

[28]  D. Civco,et al.  Mapping urban areas on a global scale: which of the eight maps now available is more accurate? , 2009 .

[29]  C. Elvidge,et al.  Spatial analysis of global urban extent from DMSP-OLS night lights , 2005 .

[30]  John Langford,et al.  Beating the hold-out: bounds for K-fold and progressive cross-validation , 1999, COLT '99.

[31]  T. Esch,et al.  Monitoring urbanization in mega cities from space , 2012 .

[32]  Atiqur Rahman,et al.  Monitoring Urban Sprawl Using Remote Sensing and GIS Techniques of a Fast Growing Urban Centre, India , 2011, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[33]  Sylvain Arlot,et al.  A survey of cross-validation procedures for model selection , 2009, 0907.4728.

[34]  L. Adair,et al.  Quantifying the urban environment: a scale measure of urbanicity outperforms the urban-rural dichotomy. , 2007, Social science & medicine.

[35]  Johannes Schlesinger,et al.  Using Crowd-Sourced Data to Quantify the Complex Urban Fabric - OpenStreetMap and the Urban-Rural Index , 2015, OpenStreetMap in GIScience.

[36]  Jay Gao,et al.  Use of normalized difference built-up index in automatically mapping urban areas from TM imagery , 2003 .

[37]  N. Pettorelli,et al.  Using the satellite-derived NDVI to assess ecological responses to environmental change. , 2005, Trends in ecology & evolution.

[38]  B. Bhatta,et al.  Urban sprawl measurement from remote sensing data , 2010 .

[39]  Sujatha Srinivasan,et al.  Urban India 2011: Evidence , 2012 .

[40]  Carla E. Brodley,et al.  The Effect of Instance-Space Partition on Significance , 2001, Machine Learning.

[41]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[42]  Weifeng Li,et al.  Comparing Machine Learning Classifiers for Object-Based Land Cover Classification Using Very High Resolution Imagery , 2014, Remote. Sens..

[43]  M. Pal,et al.  Random forests for land cover classification , 2003, IGARSS 2003. 2003 IEEE International Geoscience and Remote Sensing Symposium. Proceedings (IEEE Cat. No.03CH37477).

[44]  David Potere,et al.  Comparison of Global Urban Maps , 2009 .

[45]  Massimiliano Pittore,et al.  Performance Evaluation of Machine Learning Algorithms for Urban Pattern Recognition from Multi-spectral Satellite Images , 2014, Remote. Sens..

[46]  M. McKinney,et al.  Urbanization, Biodiversity, and Conservation , 2002 .

[47]  Kunal Sen,et al.  What has luck got to do with it? A regional analysis of poverty and agricultural growth in rural India , 2003 .

[48]  Yang Shao,et al.  Comparison of support vector machine, neural network, and CART algorithms for the land-cover classification using limited training data points , 2012 .

[49]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[50]  Hs Sudhira,et al.  Population crunch in India: is it urban or still rural? , 2012 .

[51]  B. Johnson,et al.  Integrating OpenStreetMap crowdsourced data and Landsat time-series imagery for rapid land use/land cover (LULC) mapping: Case study of the Laguna de Bay area of the Philippines , 2016 .

[52]  A. Tatem,et al.  High Resolution Population Maps for Low Income Nations: Combining Land Cover and Census in East Africa , 2007, PloS one.

[53]  Waqar Ahmad,et al.  A COMPARISON OF OBJECT-ORIENTED AND PIXEL-BASED CLASSIFICATION METHODS FOR MAPPING LAND COVER IN NORTHERN AUSTRALIA. , 2005 .

[54]  B. Bhatta Analysis of urban growth pattern using remote sensing and GIS: a case study of Kolkata, India , 2009 .

[55]  E. Glaeser,et al.  A World of Cities: The Causes and Consequences of Urbanization in Poorer Countries , 2013 .

[56]  A. Frenkel,et al.  Measuring Urban Sprawl: How Can We Deal with It? , 2008 .

[57]  S. Bhaskaran,et al.  Per-pixel and object-oriented classification methods for mapping urban features using Ikonos satellite data , 2010 .

[58]  P. Longley,et al.  Global Mapping Of Human Settlement: Experiences, Datasets, and Prospects , 2010 .

[59]  Ryosuke Shibasaki,et al.  Development of a New Ground Truth Database for Global Urban Area Mapping from a Gazetteer , 2011, Remote. Sens..

[60]  C. Prakasam,et al.  Land use and land cover change detection through remote sensing approach: a case study of Kodaikanal Taluk, Tamil Nadu. , 2010 .

[61]  Kent B. Barnes,et al.  SPRAWL DEVELOPMENT: ITS PATTERNS, CONSEQUENCES, AND MEASUREMENT , 2012 .

[62]  P. Fan,et al.  Measuring urban sprawl and its drivers in large Chinese cities: The case of Hangzhou , 2013 .

[63]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[64]  Jacinto Estima,et al.  Investigating the Potential of OpenStreetMap for Land Use/Land Cover Production: A Case Study for Continental Portugal , 2015, OpenStreetMap in GIScience.

[65]  Christopher D. Elvidge,et al.  Area and position accuracy of DMSP nighttime lights data , 2004 .

[66]  Nathaniel Baum-Snow,et al.  Did Highways Cause Suburbanization , 2007 .

[67]  H. Zhang,et al.  Impacts of land use/land cover change and socioeconomic development on regional ecosystem services: The case of fast-growing Hangzhou metropolitan area, China , 2013 .

[68]  Paolo Gamba,et al.  Scaling up to National/Regional Urban Extent Mapping Using Landsat Data , 2015, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[69]  Mariana Belgiu,et al.  Comparing supervised and unsupervised multiresolution segmentation approaches for extracting buildings from very high resolution imagery , 2014, ISPRS journal of photogrammetry and remote sensing : official publication of the International Society for Photogrammetry and Remote Sensing.

[70]  Annemarie Schneider,et al.  Monitoring land cover change in urban and peri-urban areas using dense time stacks of Landsat satellite data and a data mining approach , 2012 .

[71]  Martin Herold,et al.  The spatiotemporal form of urban growth: measurement, analysis and modeling , 2003 .

[72]  Stefan W. Maier,et al.  Comparing object-based and pixel-based classifications for mapping savannas , 2011, Int. J. Appl. Earth Obs. Geoinformation.