A multilevel stratified spatial sampling approach based on terrain knowledge for the quality assessment of OpenStreetMap dataset in Hong Kong

With the development of Volunteered Geographical Information (VGI) data, the OpenStreetMap has high research value in terms of project activity, social influence, urban development, application scope, and historical richness and the number of buildings or roads is increasing every day. However, how to evaluate the quality of a large amount OpenStreetMaps efficiently and accurately is still not fully understood. This article presents the development of an approach regarding multilevel stratified spatial sampling based on slope knowledge and official 1:1000 thematic maps as the reference dataset for OpenStreetMap data quality inspection of Hong Kong. This multilevel stratified spatial sampling plan is as follows: (1) The terrain characteristics of Hong Kong are fully considered by dividing grids into quality estimate strata based on the slope information; (2) Spatial sampling for the selection of grids or objects is used; (3) A more reliable sampling subset is made, regarding the representation of the entire OpenStreetMap dataset of Hong Kong. This sampling plan displays a 10% higher sampling accuracy, but without increasing the sample size, particularly as regards building completeness inspection compared with simple random sampling and systematic random sampling. This research promotes further applications of the Open‐Street‐Map dataset, thus enabling us to have a better understanding of the OpenStreetMap data quality in urban areas.

[1]  Yongze Song,et al.  Modeling of spatial stratified heterogeneity , 2022, GIScience & Remote Sensing.

[2]  Candan Gokceoglu,et al.  A Convolutional Neural Network Architecture for Auto-Detection of Landslide Photographs to Assess Citizen Science and Volunteered Geographic Information Data Quality , 2019, ISPRS Int. J. Geo Inf..

[3]  Cláudia M. Viana,et al.  The Value of OpenStreetMap Historical Contributions as a Source of Sampling Data for Multi-Temporal Land Use/Cover Maps , 2019, ISPRS Int. J. Geo Inf..

[4]  Qi Zhou,et al.  An Analysis of the Evolution, Completeness and Spatial Patterns of OpenStreetMap Building Data in China , 2019, ISPRS Int. J. Geo Inf..

[5]  Massimo Martinelli,et al.  Volunteered Geographic Information for Enhanced Marine Environment Monitoring , 2018, Applied Sciences.

[6]  Maria A. Brovelli,et al.  A New Method for the Assessment of Spatial Accuracy and Completeness of OpenStreetMap Building Footprints , 2018, ISPRS Int. J. Geo Inf..

[7]  Hansi Senaratne,et al.  A review of volunteered geographic information quality assessment methods , 2017, Int. J. Geogr. Inf. Sci..

[8]  Tonglin Zhang,et al.  A measure of spatial stratified heterogeneity , 2016 .

[9]  Wenzhong Shi,et al.  A Multilevel Stratified Spatial Sampling Approach for the Quality Assessment of Remote-Sensing-Derived Products , 2015, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[10]  Alexander Zipf,et al.  OpenStreetMap in GIScience: Experiences, Research, and Applications , 2015, Lecture Notes in Geoinformation and Cartography.

[11]  Alexander Zipf,et al.  An Introduction to OpenStreetMap in Geographic Information Science: Experiences, Research, and Applications , 2015, OpenStreetMap in GIScience.

[12]  Mahmoud Reza Delavar,et al.  A Quality Study of the OpenStreetMap Dataset for Tehran , 2014, ISPRS Int. J. Geo Inf..

[13]  Pascal Neis,et al.  Quality assessment for building footprints data on OpenStreetMap , 2014, Int. J. Geogr. Inf. Sci..

[14]  David L. Tulloch Crowdsourcing geographic knowledge: volunteered geographic information (VGI) in theory and practice , 2014, Int. J. Geogr. Inf. Sci..

[15]  Robert Hecht,et al.  Measuring Completeness of Building Footprints in OpenStreetMap over Space and Time , 2013, ISPRS Int. J. Geo Inf..

[16]  M. Triglav-Čekada,et al.  Using volunteered geographical information to map the November 2012 floods in Slovenia , 2013 .

[17]  Qingwu Hu,et al.  QUALITY ANALYSIS OF OPEN STREET MAP DATA , 2013 .

[18]  Martin Herold,et al.  A global land-cover validation data set, II: augmenting a stratified sampling design to estimate accuracy by region and land-cover class , 2012 .

[19]  Zhenhua Wang,et al.  Fuzzy acceptance sampling plans for inspection of geospatial data with ambiguity in quality characteristics , 2012, Comput. Geosci..

[20]  Michael F. Goodchild,et al.  Assuring the quality of volunteered geographic information , 2012 .

[21]  Stephen V. Stehman,et al.  Impact of sample size allocation when using stratified random sampling to estimate accuracy and area of land-cover change , 2012 .

[22]  Kevin Curran,et al.  OpenStreetMap , 2012, Int. J. Interact. Commun. Syst. Technol..

[23]  Zhenhua Wang,et al.  Designing a two-rank acceptance sampling plan for quality inspection of geospatial data products , 2011, Comput. Geosci..

[24]  M. Haklay How Good is Volunteered Geographical Information? A Comparative Study of OpenStreetMap and Ordnance Survey Datasets , 2010 .

[25]  Robert Haining,et al.  Sample surveying to estimate the mean of a heterogeneous surface: reducing the error variance through zoning , 2010, Int. J. Geogr. Inf. Sci..

[26]  David J. Selkowitz,et al.  A spatially stratified, multi-stage cluster sampling design for assessing accuracy of the Alaska (USA) National Land Cover Database (NLCD) , 2010 .

[27]  Xiaoying Zheng,et al.  Geographical Detectors‐Based Health Risk Assessment and its Application in the Neural Tube Defects Study of the Heshun Region, China , 2010, Int. J. Geogr. Inf. Sci..

[28]  Stephen V. Stehman,et al.  Sampling designs for accuracy assessment of land cover , 2009 .

[29]  Jinfeng Wang,et al.  A knowledge‐based similarity classifier to stratify sample units to improve the estimation precision , 2009 .

[30]  Budiman Minasny,et al.  The variance quadtree algorithm: Use for spatial sampling design , 2007, Comput. Geosci..

[31]  Alan H. Strahler,et al.  Validation of the global land cover 2000 map , 2006, IEEE Transactions on Geoscience and Remote Sensing.

[32]  Alfred Stein,et al.  An overview of spatial sampling procedures and experimental design of spatial studies for ecosystem comparisons , 2003 .

[33]  Jinfeng Wang,et al.  Spatial sampling design for monitoring the area of cultivated land , 2002 .

[34]  Stephen V. Stehman,et al.  Practical Implications of Design-Based Sampling Inference for Thematic Map Accuracy Assessment , 2000 .

[35]  J. Scepan,et al.  Thematic validation of high-resolution Global Land-Cover Data sets , 1999 .

[36]  Stephen V. Stehman,et al.  Basic probability sampling designs for thematic map accuracy assessment , 1999 .

[37]  Thomas J. Stohlgren,et al.  Assessing the accuracy of Landsat Thematic Mapper classification using double sampling , 1998 .

[38]  MICHAEL F. GOODCHILD,et al.  A Simple Positional Accuracy Measure for Linear Features , 1997, Int. J. Geogr. Inf. Sci..

[39]  Stephen V. Stehman,et al.  Thematic map accuracy assessment from the perspective of finite population sampling , 1995 .

[40]  S. V. Stehman,et al.  Comparison of systematic and random sampling for estimating the accuracy of maps generated from remotely sensed data , 1992 .