Assessment of Urban Land-Cover Classification: Comparison Between Pixel and Object Scales

The need for reliable and exhaustive data on land use is a major issue in planning policies and monitoring of land take. In this work, we evaluate urban building classification resulting from deep learning based approaches using as input SPOT 6/7 satellite images (at a spatial resolution of 1.5m) and national databases. In addition to assessing the classifier behaviour on urban land cover, the objective here is to compare the deep learning results at both pixel-based and object-based scale to qualify the overall results. Standard evaluation metrics (such as F-score) have shown better scores at the object-based assessment level (median F-score = 0.78) than the pixel ones (median F-score = 0.68). Identifying the impact of object characteristics using the object-based level of analysis has also revealed that beyond a surface area of 100 m2, objects are much better detected (median F-score > 0.91). It is the same with a high urban density (median F-score > 0.97). The accuracy of the intersections evaluated from the Intersection over Union also follows these trends as the entities area increases.